Senior Inference Engineer, AIConfigurator for Dynamo

US, CA, Santa Clara, United States

USD 184K-356K Senior-level Full Time

@ N...

Apply Save

Found 1mo ago

Tasks

Build LLM serving optimization engine
Build Python and Rust APIs
Create CLIs and SDK surfaces
Develop configuration search and SLA aware ranking
Develop schema tests documentation and automation
Estimate efficiency and latency
Generate backend specific deployment artifacts
Implement inference runtime abstractions for parallelism and batching
Integrate performance databases and profiling data
Perform Pareto frontier analysis
Validate simulated performance against deployed results

Perks/Benefits

Skills/Tech-stack

Apply Save

Language: en Views: 5

Clicks: 1

Saves: 0

Related jobs

Featured Feat. Principal Knowledge & Data Architect USD 174K-284K

AWS Neptune | Canonicalization | Chunking | Cypher | DBT

Benefits including health and wellness programs | Health, wellness, and retirement plans

Senior-level Full Time

Headquarters - Chevy Chase, MD R

3d ago
Featured Feat. Associate Director, Data Labs USD 167K-167K

AWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM Governance

Conference speaking opportunities | Hybrid work schedule | Media appearances

Senior-level Full Time

Washington, District of Columbia, 20004, United … R

17d ago
Data Engineer USD 144K-180K

APIs | AWS | Alerting | Amazon Kinesis | Batch Processing

Senior-level Full Time

US - Remote; Canada - Remote R

10h ago
Software Engineer, AI Integrations USD 130K-170K

AWS | Alerting | C++ | CI/CD | CSS

Senior-level Full Time

Mountain View, CA

12h ago
Staff Software Engineer, AI Integrations USD 170K-240K

Agile | Alerting | Amazon Web Services | C++ | CSS

Senior-level Full Time

Mountain View, CA

12h ago
Senior Software Engineer Remote/Travel- Active Secret -Top Secret clearance(SCI-eligibility) USD 145K-204K

API Testing | Cypher | Data Processing | DataOps | DevOps

Competitive benefits | Growth opportunity | Remote work | Travel

Senior-level Full Time

Reston, VA, United States R

22h ago
Production AI Engineer Owning High-Scale Cloud Infrastructure USD 145K-165K

AWS | AWS CloudFormation | Amazon ECS | Automation | CI/CD

401-k plan | Company match | Employee stock purchase plan | Healthcare benefits | Life insurance

Mid-level Full Time

Atlanta, GA, United States

1d ago
Senior Staff Generative AI Scientist USD 185K-210K

AWS | Apache Spark | Artificial Intelligence | Azure | Cloud platform

401k matching | Dental insurance | Disability insurance | Health insurance | Life insurance

Senior-level Full Time

Remote, United States R

1d ago
Machine Learning Engineer USD 117K-177K

AWS | Azure | Cloud Computing | Data Visualization | Google Cloud

Mid-level Full Time

St. Louis, Missouri, United States

1d ago
Software Engineer - X Data USD 125K-400K

Apache Flink | Apache Kafka | Apache Spark | BigQuery | Cause analysis

401k retirement plan | Dental insurance | Discounts | Health insurance | Life insurance

Mid-level Full Time

Palo Alto, CA

1d ago
Embedded Software Engineer Intern (Fall 2026) USD 108K-108K

ADC | Antenna Testing | C# | C++ | CAN

Housing stipend | Overtime pay | Paid sick time | Relocation support

Entry-level Internship

South San Francisco, California, USA

1d ago
Site Reliability Engineer, Inference Infrastructure USD 165K-267K

Amazon Web Services | C plus plus | Cloud platform | Distributed Systems | GPU

401k matching | Co working Benefit | Company offsite | Education and learning stipend | Health and dental benefits

Mid-level Full Time

Toronto

1d ago
Staff Software Engineer, Inference Infrastructure USD 160K-220K

Amazon Web Services | Azure | C++ | Cloud platform | Distributed Systems

401k | Annual company offsite | Arts and culture budget | Education and learning stipend | Health and dental benefits

Senior-level Full Time

San Francisco

1d ago
AI Engineer – Decision & Optimization Systems USD 80K-210K

AMPL | C++ | CPLEX | Constraint Satisfaction | Go

401k | Full healthcare coverage | Unlimited PTO

Senior-level Full Time

El Segundo, CA

1d ago
AI Engineer - Allocation and Packing Systems USD 80K-210K

A/B | A/B Testing | Approximation algorithms | Assignment Algorithms | B testing

401k | Equity grant | Full healthcare coverage | Unlimited PTO

Senior-level Full Time

El Segundo, CA

1d ago
AI Engineer – Routing & Network Optimization USD 80K-210K

A Star | A Star Algorithm | AMPL | Bellman Ford Algorithm | CPLEX

401k | Equity grant | Full healthcare coverage | Unlimited PTO

Senior-level Full Time

El Segundo, CA

1d ago
AI Software Engineer USD 80K-210K

AWS Kinesis | Apache Airflow | Apache Kafka | C# | C++

401k | Equity grant | Full healthcare coverage | Unlimited PTO

Senior-level Full Time

El Segundo, CA

1d ago
Deployment Strategist - North America USD 150K-245K

AI Agent | AI Agent Frameworks | API Integration | Agent Frameworks | Agent Orchestration

Annual company offsite | Annual travel stipend | Co-working stipend | Professional development stipend

Mid-level Full Time

United States R

1d ago
Applied AI Engineer USD 204K-352K

Backend Development | Cloud infrastructure | Deep learning | Docker | Fine Tuning

Annual membership | Centralized coffee stipend | Flexible PTO | Health and wellness stipend | Health, dental, and vision insurance

Senior-level Full Time

New York, NY (HQ)

1d ago
Senior Applied Scientist - Search USD 200K-200K

Data Science | Embedding | Fine Tuning | Hybrid search | Information Retrieval

401k retirement | Annual leave | Equity package | Health coverage | Hybrid schedule

Senior-level Full Time

New York City R

1d ago
ML Research Engineer - Hardware Codesign USD 185K-455K

C++ | CUDA | Floating point | Floating point numerics | Functional simulation

Hybrid work schedule | Relocation assistance

Senior-level Full Time

San Francisco

1d ago
Staff AI Engineer USD 140K-210K

API Development | AWS | Access Management | Bedrock AgentCore | CI/CD

401k | All-hands meetings | Dental insurance | Employee equity program | Health insurance

Senior-level Full Time

New York City

1d ago
Senior Quantum Embedded Engineer USD 142K-175K

10G Ethernet | AMD Xilinx | AMD Xilinx toolchain | AMD/Xilinx SoC | Bash

Hybrid work options | Remote work options

Senior-level Full Time

New Haven, CT

1d ago
Associate Quantum Engineer USD 130K-184K

Cryogenic Systems | Data Analysis | Data acquisition | Instrument Control | Laboratory Instrument Control

Interdisciplinary team | Mentorship | State-of-the-art facilities

Mid-level Full Time

New Haven, CT

1d ago
Quantum Engineer (Physicist) USD 136K-187K

Circuit-QED | Cryogenics | Data Analysis | Error correction | Low temperature physics

Mid-level Full Time

New Haven, CT

1d ago

Senior Inference Engineer, AIConfigurator for Dynamo

Tasks

Perks/Benefits

Skills/Tech-stack

Education

Roles

Regions

Countries

States

Cities

Related jobs