AI Engineer, Quality
San Francisco, CA or Remote (USA) / Remote (US)
R
USD 170K-220K Entry-level Full Time
Tasks
- Build unified evaluation platform for AI agents
- Create automated model evaluation pipelines
- Create evaluation datasets from production traces
- Define evaluation standards and best practices
- Design guardrails and monitoring for quality regressions
- Develop evaluation harnesses and comparison frameworks
- Implement observability and tracing for agent execution
- Integrate and orchestrate LLMs tools and retrieval systems
Perks/Benefits
- 401k
- Flexible PTO
- Flexible work schedules
- Technology reimbursement
- Therapy sessions
- Wellness benefits
- Work from home reimbursement
Skills/Tech-stack
Embeddings | Evaluation Frameworks | LLM orchestration | Langfuse | Langgraph | Langsmith | Language Models | Large Language Models | Monitoring | Observability | PostgreSQL | Python | RAG | React | Retrieval-Augmented Generation | Tracing | TypeScript | Vector Databases
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
Instructor - Masterclass, GenAI/Agentic AI Workshops USD 110K-120KAI Assistants | AI coding | AI coding tools | API Integration | API Keys24x7 support | Collaborative learning environment | Live virtual classesSenior-level Full TimeUnited States - Remote R14h ago
-
Analytical writing | Business Analysis | Data Synthesis | Evaluation Frameworks | Financial ModelingIndividual contributor | Project based work | Remote workSenior-level Full TimeUnited States - Remote R14h ago
-
Benchmarking | Commercial due diligence | Cost Optimization | Data Interpretation | Due DiligenceProject-based role | Remote workSenior-level Full TimeFlorida, United States - Remote R14h ago
-
Data Analysis | Evaluation Design | Evaluation rubrics | Financial Modeling | Hypothesis-drivenIndividual Contributor Role | Project based work | Remote workSenior-level Full TimeTexas, United States - Remote R14h ago
-
C plus plus | C# | CAD | Dynamics | FDA Compliance401k | Company holidays | Dental insurance | Health insurance | Paid maternity/paternity leaveSenior-level Full TimeLos Angeles, California R14h ago
-
Principal Agentic AI Engineer USD 274K-338KAgent Orchestration | Auditability | Benchmarking | Confidence scoring | Distributed SystemsContinuing education support | Dental insurance | Flexible vacation policy | Health insurance | Paid parental leaveSenior-level Full Timesan francisconew york R16h ago
-
Senior Data Engineer USD 117K-162KAWS | Azure | BigQuery | DBT | Data Architecture401k | Annual wellness stipend | Cell phone reimbursement | Coaches and therapists access | Collective Pause DaysSenior-level Full TimeRemote - US R16h ago
-
Embedded Software Engineer II USD 115K-140KBash | C plus plus | C# | CI/CD | D-busERGs | Family Caregiver Support | Flexible PTO | HSA match | Health benefitsMid-level Full TimeRemote - USA R17h ago
-
AI Engineer USD 131K-185KAnthropic API | Apps Script | Autogen | Cloud deployment | CrewAIAsync first collaboration | Conversion to employment based on performance | Direct access to leadership | Fast feedback loops | Fully remoteMid-level Full TimeUnited R17h ago
-
Senior Solution Engineer USD 165K-216KAnalytics | Cloud Computing | Data Architecture | Data Lake | Data WarehouseSenior-level Full TimeUS-CA-Bay Area-Remote R18h ago
-
Senior Software Engineer USD 140K-185KAWS | Automated testing | Azure | C++ | Git401K company matching | Dental insurance | Dependent care benefits | Flexible spending account | Health insuranceSenior-level Full TimeBoulder, CO R19h ago
-
Senior Software Engineer for AI USD 149K-208KAWS | Anthropic Claude | Cloud infrastructure | Code Reviews | Data PrivacySenior-level Full TimeRemote- United States R20h ago
-
Software / Computer Science Intern USD 42K-50KData Parsing | Data Querying | Data Storage | Data pipeline | DebuggingCollaborative team activities | Hybrid work arrangement | Mentorship | Occasional local travel | Professional developmentEntry-level InternshipMonroeville, PA R22h ago
-
Machine Learning Engineer USD 150K-215KData Augmentation | Deep learning | Isaac | Loss Functions | Medical ImagingMid-level Full TimeSan Francisco (hybrid) R23h ago
-
AI Engineer (Latam, Remote) USD 70K-110KAPI Integration | Authentication | Claude | Database Management | LLM integrationCollaboration with U S based teams | Fully remote | High ownership and autonomy | Part-time flexibilitySenior-level Part TimeFlorida, Aventura, United States of America R1d ago
-
Software Engineer II - Model Platform USD 149K-214KAWS | Azure | Cloud Computing | Data Pipelines | Distributed SystemsMid-level Full TimeRemote - USA R1d ago
-
Data Platform Engineer III USD 135K-160KAPIs | AWS Lambda | Agile | Amazon RDS | Amazon S3401k employer match | Dental insurance | ESPP | Flexible spending account | Health insuranceSenior-level Full TimeRemote, United States R1d ago
-
Senior Software Engineer (Typescript / FrontEnd) - AI/ML USD 141K-232KAPI Design | AWS | Azure | Cloud platform | Google CloudFlexible time off | Flexible work environment | Global gatherings | Healthcare | Home office setupSenior-level Full TimeUnited States (remote) R1d ago
-
Data Scientist / AI/ML Engineer (Imagery) VAWFH 1652 USD 153K-207KAccuracy | Computer Vision | Containerization | Data Cleansing | Data PreprocessingSenior-level Full TimeReston, VA R1d ago
-
Senior Machine Learning Ops Engineer USD 150K-173KAWS | Airflow | Bash | Batch inference | CI/CDEmployee mentorship program | Leadership programsSenior-level Full TimeUnited States R1d ago
-
Forward Deployed Machine Learning Engineer USD 180K-300KAPI Design | Cloud Computing | Deep learning | Diffusion Models | Fine TuningIn-person collaboration days | Remote work flexibility | Travel cost coverageSenior-level Full TimeSan Francisco (USA) R1d ago
-
Senior Software Engineer - Platform & MLOps USD 152K-230KAWS | Azure | CI/CD | Datadog | DockerDiscretionary incentive plan | Flexible work policy | Learning and development access | Medical benefitsSenior-level Full TimeSeattle, Washington, United States - Remote R1d ago
-
C# | MATLAB | NumPy | Pandas | PythonEnglish language support | Flexible schedule | Part-time project-based workSenior-level FreelanceUnited States - Remote R1d ago
-
Electromagnetism | Mechanics | NumPy | Numerical Simulation | PandasFlexible part-time schedule | Freelance project-based workSenior-level FreelanceFlorida, United States - Remote R1d ago
-
Computational physics | NumPy | Numerical Simulation | Pandas | PythonPart-time freelancingSenior-level FreelanceMichigan, United States - Remote R1d ago