Member of Technical Staff
Tasks
- Analyze AI model performance
- Assess AI systems across models tools and hardware
- Build evaluation datasets
- Collaborate with AI labs on model evaluation
- Communicate analysis through visualization
- Create analytical frameworks
- Design and execute AI benchmarking projects
- Develop AI evaluation methodologies
- Identify gaps in AI evaluation systems
- Improve benchmarking infrastructure
- Produce strategic evaluation reports
Perks/Benefits
Skills/Tech-stack
Agentic Systems | Benchmarking | Data Analysis | Data Visualization | Dataset Construction | Evaluation Pipelines | Experimentation | GitHub | Language Models | Language Processing | Large Language Models | Machine Learning | Model Evaluation | Multimodal Models | Natural Language | Natural Language Processing | Python | Version control
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Regions
Countries
States
Related jobs
-
Senior/Lead Data Scientist (Supply Chain) USD 116K-158KData Analysis | Data Modeling | Data Pipelines | Data Storytelling | Data VisualizationCareer development | Individual responsibilityMid-level Full TimePlano, Texas, United States - Remote R17h ago
-
Specialist, Analytic Scientist USD 95K-177KAWS | Data Analysis | Data Visualization | Exploratory Data Analysis | Generative AI401k match | Company pension plan | Dental insurance | Disability coverage | Health insuranceMid-level Full TimeOhio - Columbus, One Nationwide Plaza, … R17h ago
-
AI Gateways | AWS CDK | Chunking | Context engineering | Cost Tracking401k match | Counseling membership | Flexible time away | Life insurance | Long-term disabilityMid-level Full Time-REMOTE, USA- R18h ago
-
Senior/Staff Data Scientist, Consumer Apps - Klover USD 150K-185KCausal Inference | Churn Prediction | Cloud Computing | Code review | Credit ScoringHybrid scheduleSenior-level Full TimeChicago, IL R19h ago
-
Agile | Apache Airflow | Artificial Intelligence | Automated testing | BigQueryCollaborative culture | Flexible working hours | Performance evaluations | Professional development opportunities | Remote workSenior-level Full TimeIdaho R1d ago
-
Agile | Apache Airflow | BigQuery | CI/CD | Cloud StorageCollaborative company culture | Flexible working hours | Professional development opportunities | Remote-first work environmentSenior-level Full TimeMinnesota R1d ago
-
Agile | Airflow | BigQuery | CI/CD | Cloud StorageCollaborative culture | Professional development | Remote-first flexibilitySenior-level Full TimeColorado R1d ago
-
Agile | Airflow | BigQuery | CI/CD | Cloud StorageCollaborative company culture | Flexible working hours | Professional development opportunities | Remote-first environmentSenior-level Full TimeColumbia R1d ago
-
Agile | Airflow | Automated testing | BI tools | BigQueryCollaborative company culture | Professional development | Remote-first flexible hoursSenior-level Full TimeIllinois R1d ago
-
Agile | Airflow | BigQuery | CI/CD | Cloud StorageCollaborative company culture | Professional development opportunities | Remote-first flexible hoursSenior-level Full TimeFlorida R1d ago
-
Agile | Airflow | Automated testing | BI tools | BigQueryCollaborative culture | Flexible working hours | Professional development | Remote-first environmentSenior-level Full TimeCalifornia R1d ago
-
Agile | Apache Airflow | Automated testing | BigQuery | CI/CDCollaborative company culture | Professional development opportunities | Remote-first flexibilitySenior-level Full TimeConnecticut R1d ago
-
Agile | Airflow | BI tools | BigQuery | CI/CDCollaborative & Innovative Culture | Professional development | Remote-first flexible hoursSenior-level Full TimeArizona R1d ago
-
Apache Airflow | Apache Hive | Apache Iceberg | Apache Kafka | Apache SparkFully remote work option | International hiring | Long term contractor optionEntry-level Full TimeUnited States R1d ago
-
Senior Applied Scientist - Search USD 200K-200KData Science | Fine Tuning | Information Retrieval | Knowledge graphs | Knowledge representation401k retirement | Career growth opportunities | Dental insurance | Equity compensation | Health insuranceSenior-level Full TimeNew York City R1d ago
-
Defensive Security AI Scientist USD 240K-260KAccelerate | Attack Path | Attack path modeling | CISA KEV | CUDA401k matching | Bereavement | Dental insurance | Disability insurance | Employee assistance programSenior-level Full TimeRemote - Nationwide, United States R2d ago
-
Defensive Security AI Scientist USD 240K-260KAccelerate | Artificial Intelligence | CISA KEV | CUDA | CVSS401k plan with company matching | Bereavement | Disability insurance | Employee assistance program | Employee discount programSenior-level Full TimeRemote - Nationwide, United States R2d ago
-
ML Infrastructure Engineer USD 145K-165KAWS | Amazon Elastic Kubernetes Service | Amazon SageMaker | BigQuery | CD pipelinesHealth benefits | Paid time off | Remote work optionMid-level Full TimeBoston, MA R2d ago
-
ML Infrastructure Engineer USD 145K-165KAWS | Amazon SageMaker | BigQuery | CI/CD | CloudFormationBenefits plans | Remote work optionMid-level Full TimeNew York, New York, United States R2d ago
-
ML Infrastructure Engineer USD 145K-165KAWS | Amazon SageMaker | BigQuery | BigQuery datasets | CI/CDCompany benefits plan enrollment | Health benefits | Performance-based bonus | Remote work optionMid-level Full TimeLos Angeles, California, United States R2d ago
-
AI | AWS | DBA | Database systems | DevOpsDental insurance | Flexible working hours | Health insurance | Paid time off | Professional developmentSenior-level Full TimeMinnesota R2d ago
-
AI machine learning | AWS | Cloud platform | DBA operations | Database systemsDental insurance | Flexible working hours | Health insurance | Paid time off | Professional developmentMid-level Full TimeIllinois R2d ago
-
C# | MATLAB | NumPy | Pandas | PythonPaid work | Part-time project work | Project-based employmentSenior-level Full TimeMichigan, United States - Remote R2d ago
-
C# | MATLAB | NumPy | Pandas | PythonPart-time project-basedSenior-level Full TimeUnited States - Remote R2d ago
-
NumPy | Pandas | Python | SQL | SciPyPart-time project work | Project-based employmentSenior-level Full TimeTexas, United States - Remote R2d ago