AI Inference Engineer - Model Optimization & Deployment

Foster City, CA

USD 205K-303K (estimate) Senior-level Full Time

@ Z...

Apply Save

Found 12h ago

Tasks

Accelerate inference with mixed precision
Benchmark accuracy and latency
Build TensorRT deployment pipelines
Create TensorRT plugins
Develop concurrent memory safe inference code
Implement model conversion pipelines
Optimize large-scale models
Perform parity checking between frameworks and binaries
Quantize models for inference
Recover accuracy after optimization
Write custom CUDA kernels

Perks/Benefits

Skills/Tech-stack

Education

N/A

Roles

Apply Save

Language: en | Views: 2 | Clicks: 0 | Saves: 0

Related jobs

Staff Software Engineer, Applied AI, Commerce AI USD 207K-300K

API Design | Agile | Application Programming | Application Programming Interfaces | Data Processing

Senior-level Full Time

Sunnyvale, CA, USA; Kirkland, WA, USA

1h ago
Senior Software Engineer, Applied AI, Commerce AI USD 174K-252K

API Design | Application Programming | Application Programming Interfaces | Data Processing | Debugging

Senior-level Full Time

Sunnyvale, CA, USA; Kirkland, WA, USA

1h ago
Data Engineer, Anti Scraping, Third Party Data Operations USD 156K-226K

AI | Adversarial Machine Learning | Data Integrity | Data Quality | Data logging

Mid-level Full Time

Mountain View, CA, USA

1h ago
Manufacturing Test Development Engineer, Data Center USD 120K-172K

Automated testing | Cause analysis | Electronics testing | Functional testing | Manufacturing test

Collaborate with international teams | International travel as needed | Work with global manufacturing partners

Mid-level Full Time

Sunnyvale, CA, USA

1h ago
Software Engineer, Applied AI, Commerce AI USD 147K-211K

API Design | Agile | Data Processing | Deep learning | Distributed Computing

Mid-level Full Time

Sunnyvale, CA, USA; Kirkland, WA, USA

1h ago
Staff Software Engineer, DevIntel Data Warehouse USD 207K-300K

C++ | Data Analysis | Data Structures | Data Warehouse | Data Warehouse Design

Senior-level Full Time

Seattle, WA, USA; Kirkland, WA, USA

1h ago
Staff Software Engineer, Cluster Management USD 207K-300K

C++ | Compute Technologies | Data Structures | Data Structures and Algorithms | Distributed Systems

Senior-level Full Time

Sunnyvale, CA, USA

1h ago
Software Engineer, Gemini App, Info Seek, Quality, DeepMind USD 147K-211K

A/B | A/B Testing | Agentic Workflows | B testing | Context engineering

Mid-level Full Time

Mountain View, CA, USA

1h ago
Associate Data Engineer USD 46K-111K

Automated testing | Azure DevOps | BigQuery | CI/CD | Cloud Storage

Company-Paid Holidays | Employee assistance program | Life and disability insurance | Medical, dental, and vision coverage | Paid time off

Mid-level Full Time

Nashville, TN, US

11h ago
Software Engineer, Infrastructure - Analytics Platform USD 230K-385K

Asynchronous programming | Backpressure | C++ | Concurrency | Consistency

Hybrid work model | On Call Pay N/A | Relocation assistance

Senior-level Full Time

San Francisco

12h ago
Robotics Autonomy Engineer-Planning and Control (Federal) USD 70K-300K

C++ | Control Systems | Depth cameras | GPS | IMU

Hybrid remote option | Onsite collaboration

Mid-level Full Time

Irvine, CA

12h ago
Data Ops Engineer USD 128K-170K

Apache Airflow | Cloud platform | Data Warehouse | DevOps | ETL

Dental insurance | Life insurance | Long-term disability insurance | Medical insurance | Paid time off

Senior-level Full Time

Miami, Florida

12h ago
SAP iXp Intern - AI Scientist USD 30K-124K

CrewAI | Deep learning | Fine Tuning | Graph Machine Learning | Langchain

Beverages and coffee | Expert presentations | Free lunch | Ice cream | Intern community events

Entry-level Internship

Palo Alto, CA, US, 94304

13h ago
Analytics, Finance & Strategy USD 270K-320K

AWS | Apache Airflow | Cloud platform | DBT | Dashboards

Flexible working hours | Generous vacation | Optional equity donation matching | Parental leave

Mid-level Full Time

San Francisco, CA | New York …

13h ago
Senior Data Engineer USD 100K-125K

AWS | Agile | Azure | C# | CI/CD

Remote work

Senior-level Full Time

Denver, Colorado, United States R

13h ago
Robotics Infrastructure Engineer USD 94K-220K

Alerting | Build Automation | CI runners | CI/CD | Camera Models

High autonomy | High output environment

Mid-level Full Time

Watertown, MA

13h ago
Staff Data Analyst USD 154K-239K

Data Modeling | Data Pipelines | Data Quality | Data Transformation | Data Visualization

Diversity, equity, inclusion and belonging | Social impact | Well-being programs

Senior-level Full Time

San Francisco, California

13h ago
Senior Software Engineer, AI/ML USD 142K-203K

Agent systems | Algorithms | Anthropic API | Backend APIs | CI/CD

Senior-level Full Time

Austin, Texas, United States

13h ago
Data Platform Engineer USD 235K-376K

CI/CD | Data Contracts | Data Modeling | Data Products | Data pipeline

Cell phone reimbursement | Health/Dental/Vision | Learning & development stipend | Mental Health & Wellness | PTO

Mid-level Full Time

San Francisco, CA • New York, … R

14h ago
Senior Software AI Engineer USD 149K-186K

.NET | AWS | Agentic Systems | Cloud Computing | Code review

401k match | Commuter benefits | Fertility and Family Planning Programs | Fitness benefits | Health insurance

Senior-level Full Time

Remote US R

14h ago
Agentic Analytics Engineer USD 186K-256K

AI Agents | Airflow | Automation | BigQuery | DBT

401k matching | Life insurance | Medical/Dental/Vision insurance | Unlimited PTO

Mid-level Full Time

Seattle, Washington, United States

14h ago
AI Engineer II USD 100K-215K

API Design | APIs | Azure | C# | C++

Mid-level Full Time

Redmond, WA, US

14h ago
Sr. Data Engineer - US (Remote) USD 159K-254K

AWS | Alerting | Apache Airflow | Apache Kafka | Apache Spark

Remote work | Unlimited AI token budget

Senior-level Full Time

United States R

14h ago
Staff Machine Learning Engineer- AI Governance USD 169K-270K

AI Governance | Agentic Frameworks | Bias | Data Drift | Docker

Senior-level Full Time

Foster City, CA, United States

15h ago
Executive Director, AI/ML Drug Discovery Analytics USD 326K-383K

Cloud Computing | Deep learning | Diffusion Models | GPU Computing | Generative Models

Executive-level Full Time

Redwood City, California, United States

15h ago

AI Inference Engineer - Model Optimization & Deployment

Tasks

Perks/Benefits

Skills/Tech-stack

Education

Roles

Regions

Countries

States

Cities

Related jobs