Member of Technical Staff
Tasks
- Analyze AI model performance
- Assess AI systems across models tools and hardware
- Build evaluation datasets
- Collaborate with AI labs on model evaluation
- Communicate analysis through visualization
- Create analytical frameworks
- Design and execute AI benchmarking projects
- Develop AI evaluation methodologies
- Identify gaps in AI evaluation systems
- Improve benchmarking infrastructure
- Produce strategic evaluation reports
Perks/Benefits
Skills/Tech-stack
Agentic Systems | Benchmarking | Data Analysis | Data Visualization | Dataset Construction | Evaluation Pipelines | Experimentation | GitHub | Language Models | Language Processing | Large Language Models | Machine Learning | Model Evaluation | Multimodal Models | Natural Language | Natural Language Processing | Python | Version control
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Regions
Countries
States
Related jobs
-
AI Transformation Lead USD 155K-175KAI Agents | API Integration | Agent systems | Anthropic | Data FlowsConference support | English learning support | Flexible hours | Hybrid work | International team cultureSenior-level Full TimeCyprus - Remote R14h ago
-
Senior AI Engineer USD 139K-218KAPIs | APIs integration | Access Control | Agent Orchestration | Agentic architectureAsynchronous work | High-performance culture | Remote workSenior-level Full TimeRemote, US R16h ago
-
Machine Learning Engineer II USD 142K-210KAirflow | Anthropic | Artificial Intelligence | CatBoost | Document processingEmployee stock purchase plan | Flexible spending wallets | Health care coverage | Paid time off | Remote-firstMid-level Full TimeRemote US R23h ago
-
AI Native Software Engineer USD 130K-220KAgent Orchestration | Agent systems | Autogen | CI/CD | ContainersSenior-level Full TimeRemote (United States) R1d ago
-
Sr. Manager, AI Lead - Semantic Layer - Remote USD 168K-224KAPI Integration | Analytics | Artificial Intelligence | Data Governance | Data ModelingRemote workSenior-level Full TimeCalifornia - Home Teleworkers, United States R1d ago
-
Senior Data Scientist - Operation Research USD 123K-185KCPLEX | Google OR | Google OR-Tools | Gurobi | Machine LearningCareer development opportunities | Equal employment opportunity | High degree of individual responsibilitySenior-level Full TimeDallas, Texas, United States - Remote R1d ago
-
Principal AI/ML Researcher / Engineer Reasoning, Planning, and Decision-making systems USD 296K-370KAgent systems | Artificial Intelligence | Belief State Tracking | Caching | Causal modelingSenior-level Full TimeUnited States R1d ago
-
Senior Machine Learning Scientist (USA Remote) USD 112K-186KAWS Batch | AWS EC2 | AWS Lambda | AWS SageMaker | Deep learningHealth and wellness programs | Remote-first culture | Time offSenior-level Full TimeDallas, TX, United States R1d ago
-
Senior Machine Learning Scientist (USA Remote) USD 112K-186KAWS Batch | AWS EC2 | AWS Lambda | AWS SageMaker | Deep learningGenerous time off | Health and wellness programs | Remote workSenior-level Full TimeChicago, IL, United States R1d ago
-
AI RMF | AWS SageMaker | Amazon AWS | Apache Spark | Azure401k matching | Dental insurance | Disability insurance | Health insurance | Life insuranceMid-level Full TimeRemote, United States R2d ago
-
Media Software Engineer, Speech (All Levels) USD 120K-180KAndroid | Artificial Intelligence | Audio Processing | C# | C++401k retirement savings plan | Company holidays | Complimentary lunch and snacks | Fertility support | Medical, dental, and vision insuranceEntry-level Full TimeSunnyvale R2d ago
-
Senior Applied Scientist - Search USD 200KData Science | Embeddings | Fine Tuning | Hybrid search | Information Retrieval401k retirement | Equity package | Growth opportunities | Hybrid work schedule | Medical, dental, and vision coverageSenior-level Full TimeNew York City R2d ago
-
Mid-level Full TimeUnited States R2d ago
-
Deep learning | LLMs | Langchain | MLOps | Machine LearningFlexible schedule | Part-time availability | Project based workMid-level FreelanceUnited States - Remote R2d ago
-
Freelance Machine Learning Engineer USD 180KLLMs | Langchain | MLOps | NumPy | PandasFlexible project-based engagement | Part-time project workMid-level FreelanceUnited States - Remote R2d ago
-
Langchain | Language Models | Large Language Models | MLOps | Machine LearningPart-time projects | Project based workMid-level FreelanceTexas, United States - Remote R2d ago
-
Langchain | Language Models | Large Language Models | MLOps | Model DeploymentFlexible hours | Part time freelance projects | Project based workMid-level FreelanceNew York, United States - Remote R2d ago
-
Freelance Machine Learning Engineer USD 180KGenAI | Langchain | Language Models | Large Language Models | MLOpsPart-time project workMid-level FreelanceTexas, United States - Remote R2d ago
-
Freelance Machine Learning Engineer USD 180KLangchain | Language Models | Large Language Models | MLOps | NumPyFlexible schedule | Part-time opportunities | Project based workMid-level FreelanceNew York, United States - Remote R2d ago
-
Principal Machine Learning Engineer USD 180K-368KAWS Lambda | Amazon SQS | Amazon SageMaker | Automated testing | Backend EngineeringHybrid workSenior-level Full TimeRemote, USA R3d ago
-
Journeyman Data Scientist USD 114K-190KClassification | Clustering | Component analysis | Data Analysis | Data MiningFully remote | Hybrid onsite and remote | Public trust suitabilitySenior-level Full TimeUSA-VA-Ashburn R3d ago
-
Senior Machine Learning Engineer USD 198K-287KData Engineering | Fine Tuning | Foundation Models | GenAI | Incident ResponseOn-call rotationSenior-level Full TimeRemote - US R3d ago
-
Senior-level Full TimeRemote, US R3d ago
-
Sr. Staff Machine Learning Engineer, Content Ecosystem USD 227K-469KCausal Inference | Data Quality | Experimentation | Game theory | Language ModelsSenior-level Full TimeSan Francisco, CA, US; Remote, US R3d ago
-
Senior Data Platform Engineer USD 133K-197KAWS | Amazon IAM | Amazon Redshift | Ansible | Apache IcebergDental benefits | Free 1Password account | Generous paid time off | Health benefits | Maternity and Parental Leave Top-UpSenior-level Full TimeRemote (United States | Canada) R3d ago