Staff ML Engineer, Generative Model Performance & Efficiency

Mountain View, California, United States, New York City, New York, United States

USD 251K-310K Senior-level Full Time

@ W...

Apply Save

Found 21d ago

Tasks

Analyze model architectures for training performance bottlenecks
Build performance profiling and debugging tools
Design low latency high throughput serving systems
Develop quantization and model compression techniques
Implement model partitioning and sharding strategies
Optimize model code for TPUs and GPUs
Optimize training and inference performance

Perks/Benefits

Skills/Tech-stack

Apply Save

Language: en Views: 5

Clicks: 1

Saves: 0

Related jobs

Featured Feat. Principal Knowledge & Data Architect USD 174K-284K

AWS Neptune | Canonicalization | Chunking | Cypher | DBT

Benefits including health and wellness programs | Health, wellness, and retirement plans

Senior-level Full Time

Headquarters - Chevy Chase, MD R

12d ago
Sr Data Engineer A USD 120K-164K

AWS S3 | Data Quality | Data pipeline | DevOps | ETL

Senior-level Contract

Raleigh, United States

2h ago
SYSTEM ENGINEER - Computer Network Support - AI/ML - 6+ yrs of Experience - TS/SCI w/Poly clearance is required - ES - 032726-2 A USD 136K-140K

AI/ML | Agile | Confluence | Jira | LLM

401k retirement plan | Dental insurance | Disability insurance | Federal Holidays | Floating holidays

Mid-level Full Time

Fort George G Meade, United States

2h ago
Senior Data Management Professional - Data Engineering - Private Credit USD 110K-190K

AI experimentation | Alerting | Artificial Intelligence | Data Annotation | Data Architecture

Senior-level Full Time

New York

3h ago
Software Engineer - TikTok AI Search Infrastructure USD 212K-389K

Algorithm Design | C++ | DAG | Data Architecture | Data Processing

Senior-level Full Time

San Jose, California, United States

4h ago
Machine Learning Engineer / AI Model Developer (Mid to Senior)(Top Secret Clearance Required) (Hybrid) USD 78K-176K

APIs | Agile methodologies | Backend Development | Cloud Computing | Data Preparation

401k | Employee discount program | Employee referral rewards | Flexible spending account | Flexible work schedule

Mid-level Full Time

Fort Belvoir, VA, US R

5h ago
Software Engineer, Systems ML Engineering USD 170K-251K

Alerting | Benchmarking | C++ | CUDA | Dashboarding

Senior-level Full Time

Sunnyvale, CA | Bellevue, WA | …

5h ago
AI/HPC Network Performance Engineer USD 147K-226K

AI Training | Alerting | Auto-remediation | C++ | Configuration Management

Oncall rotation

Senior-level Full Time

Menlo Park, CA

5h ago
Forward Deployed Engineer III, Google Cloud, Applied AI USD 174K-253K

API Integration | Agent systems | Agentic Workflows | Chatbots | Cloud platform

Benefits | Bonus | Equity | Travel up to 50 percent

Senior-level Full Time

San Francisco, CA, USA; Atlanta, GA, …

5h ago
Research Software Engineer, Multimodal AI USD 174K-253K

Agent Orchestration | Audio | C++ | Few-Shot Learning | Few-shot

Mid-level Full Time

San Jose, CA, USA

5h ago
ML Engineer, GenAI Ads, Search Personalization USD 207K-301K

C++ | Clustering Algorithms | Data Processing | Data Structures | Data Structures and Algorithms

Senior-level Full Time

Mountain View, CA, USA

5h ago
Senior Software Engineer, AI/ML, Geo and Gemini App USD 174K-253K

A/B | A/B Testing | B testing | C++ | Data Analysis

Senior-level Full Time

New York, NY, USA

5h ago
Senior Software Engineer, AI/ML, Ads Training USD 174K-253K

C++ | Distributed Systems | GPU | JAX | Machine Learning

Senior-level Full Time

Mountain View, CA, USA

5h ago
Principal Machine Learning Engineer, Content Safety USD 295K-345K

Computer Vision | Content Moderation | Data Pipelines | Deep learning | Language Models

Equity compensation

Senior-level Full Time

San Mateo, CA, United States R

10h ago
Software Engineer, Product USD 208K-269K

Agentic Frameworks | Anthropic | Claude Code | Cursor | Documentation

Dental insurance | Health insurance | Paid time off | Retirement plan | Vision insurance

Mid-level Full Time

San Francisco, CA

10h ago
Staff/Sr. ML Infrastructure Engineer, Foundation Model Compute Infra USD 175K-312K

C++ | Containers | Distributed Systems | Distributed Training | Distributed inference

Senior-level Full Time

Cupertino

11h ago
Gen AI Developer Associate USD 125K-170K

APIs | AWS | CI/CD | Docker | GraphQL

Senior-level Full Time

United States

12h ago
Lead Software Engineer - Applied AI ML Lead USD 170K-195K

Apache Airflow | Apache Iceberg | Apache Kafka | Apache Spark | Automated testing

Backup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centers

Senior-level Full Time

Palo Alto, CA, United States

14h ago
Senior AI and Machine Learning Engineer, Maps Services Evaluation USD 174K-253K

Data Analysis | Data Visualization | GenAI | Java | Jupyter Notebook

Senior-level Full Time

Cupertino

15h ago
Forward Deployed AI Engineer USD 180K-230K

API | AWS | Azure | CI/CD | Cloud platform

5-day workweek | Collaborative work culture | Flexible working hours | Supportive work environment

Senior-level Full Time

New York, New York, United States

16h ago
Principal AI Engineer USD 200K-245K

Agent-based | Agent-based architecture | Amazon Web Services | Azure | CI/CD

5 days per week | Collaborative culture | Flexible working hours | Supportive work environment

Senior-level Full Time

New York, New York, United States

16h ago
Machine Learning Engineer, Sensor Pipelines USD 175K-215K

C++ | Computer Vision | JAX | Machine Learning | PyTorch

Hybrid work schedule

Mid-level Full Time

Mountain View, CA, USA

16h ago
AI Engineer (Temporary) USD 85K-95K

AWS | Agents SDK | Anthropic API | Azure | Data Risk

Access to training and enablement resources | Hybrid work environment | Temporary contract with benefits

Entry-level Full Time Temporary

Ithaca (Main Campus), United States

16h ago
Principal Embedded Software Engineer - OS Build USD 152K-229K

Artifactory | BSP | Bamboo | Bash | Bitbucket

Senior-level Full Time

USA-CO Lafayette Bldg 2, United States

16h ago
Mission Planning and Optimization Engineer USD 110K-165K

Astrodynamics | C# | C++ | Combinatorial Optimization | Decision Making

401k matching | Education assistance | Paid Holidays | Relocation assistance | Sick time

Senior-level Full Time

El Segundo, United States

16h ago

Staff ML Engineer, Generative Model Performance & Efficiency

Tasks

Perks/Benefits

Skills/Tech-stack

Education

Roles

Regions

Countries

States

Cities

Related jobs