Applied AI Researcher, Benchmarking
Tasks
- Analyze noisy empirical results
- Build prototypes
- Conduct human in the loop assessments
- Construct benchmarks
- Design evaluation frameworks
- Develop test suites
- Measure model or system performance
- Quantify emergent capability
- Run experimental evaluations
- Test adversarial robustness
- Track longitudinal performance
Perks/Benefits
- 100 percent covered health insurance
- 401k matching
- Access To State Of The Art Models
- Commuter benefits
- Dental insurance
- Equity
- In-office lunch
- Vision insurance
Skills/Tech-stack
A/B | A/B Testing | Adversarial Robustness | B testing | Deep learning | Experiment design | Human-in-the-loop | Language Processing | Machine Learning | Model Evaluation | Natural Language | Natural Language Processing | Python | Series analysis | Statistics | The Loop | Time Series | Time Series Analysis
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
AI Engineer USD 103K-140KAI Agents | AI Studio | Access Control | Anthropic Claude | AuthenticationBonus eligibleSenior-level Full TimeDenver, CO, United States7h ago
-
Technical Architect – AI, ML & Generative AI USD 142K-240KAWS Bedrock | AWS SageMaker | Agentic AI | Apache Spark | Artificial Intelligence401k | Critical Illness Accident Hospital Indemnity Identity Theft Protection | Dental plans | Life and Accidental Death and Dismemberment | Long-term disabilitySenior-level Full TimeFrisco, United States16h ago
-
Entry-level InternshipChicago, IL, US20h ago
-
Solution Architect (AI & Data Applications) USD 180K-247KAutogen | CI/CD | Databricks | Docker | FastAPIMentoring system | Professional development | Supportive work environmentSenior-level Full TimeJersey City, NJ, United States20h ago
-
AWS | Artificial Intelligence | Azure AI | Data Analysis | DatabricksBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeChicago, IL, United States1d ago
-
BigQuery | Cloud Computing | Dashboards | Data Engineering | Data GovernanceDental coverage | Medical coverage | Paid time off | Retirement savings options | Vision coverageSenior-level Full TimeWork At Home-Rhode Island, United States1d ago
-
Mid-level ContractHarrisburg, PA1d ago
-
Lead Director - Solution Engineering (AI/ML - GenAI) USD 144K-288KAPI Design | Agile | Amazon Web Services | Apache Kafka | Artificial IntelligenceDental insurance | Medical insurance | Paid time off | Retirement savings | Vision insuranceSenior-level Full TimeBuffalo Grove-2100 E Lake Cook, United …1d ago
-
Quantitative Trading & Research - SPG - Vice President USD 200K-260KAmazon Redshift | Backtesting | C++ | Data Analysis | EconometricsBackup childcare | Financial coaching | Health care coverage | Mental health support | Onsite health and wellness centersExecutive-level Full TimeNew York, NY, United States1d ago
-
AI Engineering Sr Director or VP, Data Science USD 128K-175KAI Platform | AWS SageMaker | Agent systems | Agentic AI | Azure MLCollaborative culture | Growth opportunities | Impactful technical work | Professional developmentSenior-level Full TimeColumbia, MD, United States1d ago
-
AI Engineer USD 139K-198KAI Search | AKS | AWS Bedrock | Amazon SageMaker | AutogenLeadership development | Professional developmentSenior-level Full TimeWashington, DC1d ago
-
AI Agents | Apache Spark | Data Ingestion | Data Modeling | Data Transformation401k match | Company provided disability insurance | Dental insurance | Flexible spending accounts | Health care and dependent care flexible spending accountsSenior-level Full TimeUnited States1d ago
-
AI Solutions Engineer, East USD 125K-175KAWS | Azure | Cloud platform | Dspy | Generative AI401k plan | Dental insurance | Medical insurance | Mental wellness support | Parental leaveMid-level Full TimeRemote (New York) R1d ago
-
Sr. AI Engineer USD 176K-240KAWS | Agentic Workflows | Autonomous Agents | Compliance | Context engineering401k plan with employer matching | Advancement opportunities | Employee development program stipend | Fertility/adoption assistance | Flexible PTOSenior-level Full TimeAtlanta, GA1d ago
-
Deployed Engineer (Seattle) USD 165K-280KAWS | Agent architecture | Azure | Containers | Failure handling401k plan | Dental insurance | Flexible vacation | Meals on in office days | Medical insuranceSenior-level Full TimeSeattle, WA1d ago
-
Researcher, Alignment Oversight USD 250K-445KEvaluation Design | Experimentation | Human-in-the-loop | Language Models | Large Language ModelsHybrid work model | Relocation assistanceMid-level Full TimeSan Francisco1d ago
-
AI Developer II USD 96K-150KAPIs | Agentic AI | Authentication and Authorization | Azure AI | Blue PrismEntry-level Full TimeMaryville, TN, United States1d ago
-
Senior-level ContractATLANTA, GA1d ago
-
Senior-level Full TimeErie, PA, United States1d ago
-
Administrative Data | Analytic Plan | Analytic dashboards | Data Visualization | Data integrationRemote work within the United States | U S work authorization requiredMid-level Full TimeUS-Remote R1d ago
-
AI Lead/Manager USD 250K-340KAgent systems | Applied Machine Learning | Architecture | Code review | DebuggingSenior-level Full TimeNew York City1d ago
-
AI Platform Engineer USD 119K-258KAI orchestration | API Integration | Azure | Azure Data | Azure Data FactoryOccasional travel | Remote workSenior-level Full TimeBaltimore, Maryland, United States R1d ago
-
Causal Inference | Data Analysis | Data Science | EHR Data Science | EHR dataMid-level Full TimeAtlanta, GA, United States1d ago
-
Deep Learning Researcher USD 200K-300KDeep learning | High Performance | High-Performance Computing | Machine Learning | Performance ComputingCollaborative work environment | Intellectual freedomSenior-level Full TimeNew York2d ago
-
Quantitative Researcher USD 200K-300KData Analysis | Economics | Machine Learning | Predictive Modeling | PythonMid-level Full TimeNew York2d ago