AI Evaluation Engineer - Software Engineering Domain
Egypt - Remote
R
A USD 120K-175K (estimate) Mid-level Contract Full Time
Tasks
- Collaborate on AI evaluation workflows
- Create debugging and investigation scenarios
- Design multi step reasoning challenges
- Design terminal based benchmark tasks for AI evaluation systems
- Develop task specifications for infrastructure workflows and pipelines
- Identify edge cases failure modes and system constraints
- Refine benchmark quality difficulty and validation logic
- Write solution approaches and deterministic evaluation criteria
Perks/Benefits
- N/A
Skills/Tech-stack
Automation | Benchmarking | Command Line | Command-line Interface | Data Pipelines | Debugging | DevOps | Evaluation Frameworks | Infrastructure | LLM | Linux | MLOps | Model Evaluation | Python | System Architecture
Education
N/A
Roles
AI | AI Evaluation Engineer | Engineer | Evaluation Engineer | Software Engineer
Related jobs
-
L2 Data Engineer - Remote USD 143K-174KAccess Control | Azure | Azure Data | Azure Data Factory | Azure SynapseMentorship | Remote workSenior-level Full TimeEgypt - Remote R5d ago
-
API Integration | Agentic AI | Automation | Generative AI | LLMFlexible schedule | Remote workMid-level Full TimeEgypt R5d ago
-
SASE Automation Engineer USD 151K-237KAnsible | CI/CD | DevOps | Docker | NetskopeFlexible working hours | Hybrid work | Internal training sessions | Remote work | Training budgetMid-level Full TimeCairo, Egypt R8d ago
-
Senior Applied ML Engineer (Speech & Audio) USD 140K-200KActivity Detection | Audio codecs | Audio preprocessing | Automatic Speech Recognition | ConformerAccommodation allowance | Career Development Programs | Career growth opportunities | Coffee | Daily DrinksSenior-level Full TimeEgypt - Remote R12d ago
-
AI Engineer (Hybrid) USD 125K-175KAgent Orchestration | Agent systems | Autogen | CI/CD | ChromaDBHybrid workMid-level Full TimeGiza, El Omraniya, Egypt R13d ago
-
API Integration | Automation | CRM | Claude | DashboardsFlexible work schedule | Fully remoteMid-level Full TimeEgypt R18d ago
-
AI Agent Engineer & Team Lead INR 2000K-5000KAutogen | CI/CD | Computer Vision | CrewAI | DockerFlexible working hours | Fully remote | Work from anywhereSenior-level Full TimeCairo Governorate, Cairo, Egypt R19d ago
-
AI Engineer USD 102K-135KAPI Integration | Automation | Cloud Computing | Data Ingestion | Data PipelinesPay transparencyMid-level Full TimeCairo, Cairo Governorate, Egypt - Remote R25d ago
-
AWS | Apache Spark | Azure | Compliance | Data GovernanceRemote workSenior-level Full TimeEgypt - Remote R26d ago
-
Senior AI Backend Engineer - Remotely USD 140K-180KAWS | Amazon SQS | Asyncio | Authentication | CI/CDRemote workSenior-level Full TimeCairo, Egypt (Remote) R26d ago
-
Senior MLOps Engineer - Remote - Robusta USD 150K-200KAWS | Alerting | CI/CD | Cloud platform | Data PipelinesRemote workSenior-level Full TimeEgypt - Remote R28d ago
-
Full-stack Data Engineer II USD 136K-170KAutogen | Databricks | Databricks Mosaic | Databricks Mosaic AI | Databricks WorkflowsBonuses | Hybrid work environmentMid-level Full TimeRemote, Cairo, Egypt R1mo ago
-
Data Engineer USD 115K-165KAWS Athena | AWS QuickSight | Amazon Web Services | Apache Airflow | Apache IcebergMid-level Full TimeCairo, Egypt, Egypt (Hybrid) R1mo ago
-
AI Team Lead PHP 150K-250KAutonomous Agents | Azure | Azure Machine Learning | BERT | CI/CDPaid time off | Performance bonus | Training and developmentSenior-level Full TimeEgypt - Remote R1mo ago
-
Senior-level Full TimeRemote , Egypt R1mo ago