Machine Learning Engineer - Evaluation
Tasks
- Audit evaluation for bias
- Build LLM powered evaluation pipelines
- Build RAG pipelines
- Build model fine tuning workflows
- Define evaluation benchmarking infrastructure
- Design and run evaluation experiments
- Detect evaluation drift and regressions
- Own evaluation methodology end to end
- Translate model behavior into product outcomes
Perks/Benefits
Skills/Tech-stack
Benchmarking | Bias detection | Evaluation methodology | Experiment design | Fine Tuning | LLM Evaluation | Language Models | Large Language Models | Machine Learning | Prompt engineering | RAG
Education
N/A
Regions
Countries
States
Cities
Related jobs
-
Machine Learning Engineer USD 131K-178KAWS | Cassandra | Convolutional Neural Networks | Data Lakes | Data PipelinesMid-level Full TimeRemote, NY, US R14h ago
-
Software Engineer, Machine Learning USD 213K-293KAPI Design | Agent Orchestration | Artificial Intelligence | Bias Mitigation | C++Senior-level Full TimeSunnyvale, CA | Remote, US | … R17h ago
-
Staff Machine Learning Engineer USD 189K-389KCalibration | Contextual Bandits | Contextual Decisioning | Data Validation | EmbeddingsEquity eligible | In Office 1 Day Per WeekSenior-level Full TimeSan Francisco, CA, US; Remote, US R1d ago
-
Agile | C++ | Deep learning | Distributed Computing | GPU ComputingDiscretionary bonus | Flexible time off | Healthcare | Leave benefits | Retirement benefitsExecutive-level Full TimeNY7 - 50 Hudson Yards, New … R1d ago
-
Senior Software Engineer, AI USD 171K-210KAirflow | Amazon Web Services | Apache Hive | Apache Impala | C#Career development access | Employee resource groups | Flexible WFH | Generous PTO | Internet reimbursementSenior-level Full TimeUS-California-Remote, United States R1d ago
-
Senior Software Engineer USD 144K-192KAWS | Angular | Apache Spark | Azure | BuildahCareer development | Employee resource groups | Flexible WFH | Generous PTO | Paid volunteer timeSenior-level Full TimeUS-California-Remote, United States R1d ago
-
Senior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerator hardware | Computer Vision | Data Quality | Data labelingCareer growth | Full-time employment | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KActive Learning | Apache Beam | CI/CD | Code review | Data GovernanceCareer growth | Health benefits | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter methods | Attention Optimization | DPO | Deep learning | FSDPBenefits package | Career growth potential | Full-time employment | Remote work | W2 employmentMid-level Full TimeUnited States - Remote R1d ago
-
Senior Machine Learning Engineer USD 156K-211KAPI Development | AWS | Agentic Workflows | CI/CD | Cloud ArchitectureAward-winning time-off plans | Comprehensive health, dental, vision coverage | Flexible work models | Life and disability insurance | Retirement and savings planSenior-level Full TimeUS - California - Thousand Oaks … R1d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | CUDA | Compiler optimization | Continuous batchingCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering Architect USD 100K-150KAgentic Systems | Chunking | Cost Optimization | Embeddings | Evaluation Frameworks100 percent remote | Career growth | MentorshipSenior-level Full TimeUnited States - Remote R1d ago
-
Software Engineer AI/ML USD 112K-150KA/B | A/B Testing | AWS | Anomaly Detection | Automated testingDental benefits | Employee assistance program | Health Coach | Health benefits | Retirement benefitsMid-level Full TimeEvendale, United States R1d ago
-
Sr. Data Engineer USD 93K-124KAWS CloudFormation | AWS DMS | AWS Glue | AWS Lambda | AWS X-Ray401k matching | Adoption Assistance | Dental & vision insurance | Health benefits | Paid parental leaveSenior-level Full TimeRemote, United States R1d ago
-
Sr Staff Gen AI Application Engineer USD 174K-210KAPI Development | Agentic Workflows | Application Security | CI/CD | Claude CodeAdoption Assistance | Disability insurance | Employee assistance program | Health Coach | HealthAhead programsSenior-level Full TimeRemote, United States R1d ago
-
Staff Backend AI Engineer, Remote USD 140K-215KAPI Gateway | AWS CDK | AWS ECS | AWS EKS | AWS Fargate401k matching | Dental insurance | Flexible time off | Flexible work schedule | Medical insuranceSenior-level Full TimeUnited States, UNITED STATES, United States R1d ago
-
Data Engineer USD 160K-210KAPI Integration | AWS | Amazon Kinesis | Artificial Intelligence | CI/CDSenior-level Full TimeUS - Remote R1d ago
-
Senior Machine Learning Engineer, Agentic USD 163K-245KA/B | A/B Testing | B testing | Collaborative Filtering | Content-Based Filtering401-K matching | Fertility benefits | Health insurance | Life and disability insurance | Mental health benefitsSenior-level Full TimeBellevue, WA; Menlo Park, CA R1d ago
-
Robotics Engineer USD 137K-187KBenchmarks | Calibration | Computer Vision | Data logging | Data synchronizationEntry-level Full TimeSF Bay Area, CA, Remote, International, … R1d ago
-
Senior Solution Engineer USD 165K-216KAPIs | AWS | Apache Airflow | Apache Kafka | Apache Spark401k | Flexible PTO | Health/Dental/Vision | Professional development budgetSenior-level Full TimeUS-TX-Remote R1d ago
-
Principal Machine Learning Engineer USD 245K-393KDistributed Systems | Infrastructure as Code | Lifecycle Management | ML pipelines | Machine LearningSenior-level Full TimeChicago, Illinois, USA R1d ago
-
Agentic AI Engineer USD 130K-201KAI Assistant | AI coding | AI coding tools | Automated testing | CI/CDSenior-level Full TimeChicago, IL R1d ago
-
Senior Software Engineer (Backend) - AI/ML USD 141K-232KAPI Design | AWS | Azure | Cloud Computing | Cloud platformEmployer health care contributions | Flexible time off | Global company offsites | Home office setup reimbursement | Remote-friendlySenior-level Full TimeUnited States (remote) R1d ago
-
Principal AI/ML Researcher / Engineer Reasoning, Planning, and Decision-making systems USD 296K-370KAgent systems | Belief State Tracking | C++ | Decision Making | Distributed Reinforcement LearningSenior-level Full TimeUnited States R1d ago