Member of Technical Staff
Tasks
- Analyze AI model performance
- Assess AI systems across models tools and hardware
- Build evaluation datasets
- Collaborate with AI labs on model evaluation
- Communicate analysis through visualization
- Create analytical frameworks
- Design and execute AI benchmarking projects
- Develop AI evaluation methodologies
- Identify gaps in AI evaluation systems
- Improve benchmarking infrastructure
- Produce strategic evaluation reports
Perks/Benefits
Skills/Tech-stack
Agentic Systems | Benchmarking | Data Analysis | Data Visualization | Dataset Construction | Evaluation Pipelines | Experimentation | GitHub | Language Models | Language Processing | Large Language Models | Machine Learning | Model Evaluation | Multimodal Models | Natural Language | Natural Language Processing | Python | Version control
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Regions
Countries
States
Related jobs
-
GenAI | Generalized Linear Models | Linear Models | Machine Learning | Model DeploymentFlexible time offExecutive-level Full TimeRemote, Remote, United States R6h ago
-
Databricks Solution Architect USD 180K-247KAWS S3 | Apache Spark | Autoscaling | Azure Data | Azure Data LakeSenior-level Full TimeUnited States R16h ago
-
Staff Data Scientist USD 220K-250KAutomation | Dashboards | Data Governance | Data Modeling | Data pipeline401k plan | Company paid life insurance | Dental insurance | Family building benefits | Gym discountsSenior-level Full TimeAustin, Texas R20h ago
-
C++ | Cloud Computing | Code Reviews | Deployment Automation | Distributed Systems401k match | Caregiving support | Family planning support | Flexible vacation | Gender-affirming careSenior-level Full TimeRemote - United States R20h ago
-
Data Scientist, Reuters USD 82K-152KAWS | Agile | Azure | Computer Vision | Deep learningContinuous learning | Flexible work-life balance | Mental health days | Retirement savings | Tuition reimbursementEntry-level Full TimeUnited States of America, New York, … R21h ago
-
Lead Data Scientist- Recommendation Systems USD 140K-188KAzure | Client Communication | Cloud platform | Data Ingestion | DatabricksCareer development | Individual responsibilitySenior-level Full TimeNew York, New York, United States … R21h ago
-
APIs | Agentic Workflows | CI/CD | Cost Management | GeminiSenior-level Full TimeRemote - USA, United States R21h ago
-
Manager, AI Engineering - Analytics USD 197K-267KAgent systems | Artificial Intelligence | Data Modeling | Data Warehousing | EvalsHybrid work flexibility | Professional growth opportunities | Stock equity | Work-life balanceMid-level Full TimeHybrid - San Francisco R21h ago
-
Machine Learning Operations Engineer USD 133K-167KAWS SageMaker | Docker | GitHub Actions | Machine Learning | NumPyCareer development | Communities | Commuting cost coverage | Corporate giving programs | Daily free lunchMid-level Full TimeBoston, Massachusetts, United States R22h ago
-
Applied AI Engineer, Investments USD 134K-183KAPIs | Artificial Intelligence | Cloud technologies | Data Pipelines | Data Processing401k match | Family-forming benefits | Paid time off | Relocation support | Volunteer time offEntry-level Full TimeRedwood City, CA (Hybrid) R22h ago
-
Senior-level Full TimeRemote - United States R23h ago
-
AI Solutions Manager, Digital Native USD 140K-175KAI Observability | AI systems | CRM | Customer Success | Data Infrastructure401k | Medical/Dental/Vision | Mental wellness support | Parental leave | Unlimited paid time offMid-level Full TimeRemote (San Francisco) R23h ago
-
Senior Data Scientist USD 172K-225KA/B | A/B Testing | Amplitude | Attribution Modeling | B testing401k | Equity | Flexible time off | Health benefitsSenior-level Full TimeSan Francisco, CA; New York City, … R1d ago
-
Forward Deployed AI Engineer, West USD 125K-175KAWS | Azure | Docker | GCP | Generative AI401k plan | Dental insurance | Medical insurance | Parental leave | Unlimited paid time offMid-level Full TimeRemote (San Francisco) R1d ago
-
Senior Machine Learning Engineer, Reinforcement Learning USD 150K-250KDomain Randomization | Embedded Systems | Gazebo | Isaac-Gym | Mujoco401k retirement plan | Dental insurance | Employee referral bonus | Flexible PTO | Free lunchSenior-level Full TimeColumbus, Ohio or Remote R1d ago
-
Senior Data Platform Engineer USD 140K-220KApache Hudi | Apache Spark | CI/CD | Delta Lake | Distributed StorageSenior-level Full TimePittsburgh, PA or Remote R1d ago
-
Staff Software Engineer, Data Platform USD 170K-240KAPI Design | Backend Services | Frontend Development | JavaScript | Node.js401k match | Dental insurance | Equity stock options | Health insurance | Learning GrantSenior-level Full TimeRemote - USA R1d ago
-
Senior Engineer - Data Platform USD 148K-201KAirgapped environments | CI/CD | CRD | ConnectRPC | Consistency models401k retirement plan | Conference support | Dental insurance | Disability insurance | Flexible time offSenior-level Full TimeRemote, United States R1d ago
-
Staff Data Scientist - Product USD 205K-295KA/B | A/B Testing | Amplitude | B testing | Cohort Analysis401k | AD D Insurance | Employee assistance program | Equity | FSA HSA benefitsSenior-level Full TimeRemote, United States R1d ago
-
Associate Engineer - Data Platform USD 102K-138KAI Assisted Development | Command Line | Command-line Interface | Containerization | Docker401k retirement plan | Conferences travel lodging fees | Dental & vision insurance | Disability insurance | Flexible time offMid-level Full TimeRemote, United States R1d ago
-
Data Science Engineer (Shreveport, LA) USD 37K-40KData Historian | Data Visualization | Data analytics | Excel | Machine Learning401k match | Dental insurance | Disability insurance | Health insurance | Life insuranceMid-level Full TimeAtlanta, GA, United States R1d ago
-
C++ | CUDA | CUDA kernels | Concurrency | Distributed SystemsSenior-level Full TimePittsburgh, PA or Remote R1d ago
-
C++ | CUDA | Data parallelism | GRPC | GoSenior-level Full TimePittsburgh, PA or Remote R1d ago
-
Data Scientist 1 USD 95K-115KAWS | Azure | BigQuery | Cloud platform | Data Engineering401k match | Continuing education reimbursement | Employer Paid Parental Leave | Flexible working hours | Paid time offMid-level Full TimeUnited States - Remote R1d ago
-
Staff Machine Learning Engineer, Underwriting and Credit USD 276K-415KA/B | A/B Testing | AWS | Airflow | B testingFlexible time off | Medical insurance | Modern family planning | Remote work | Retirement savings plansSenior-level Full TimeBay Area, CA, United States of … R1d ago