Principal LLM Inference Engineer

Santa Clara

USD 195K-285K Senior-level Full Time

@ d...

Apply Save

Found 21d ago

Tasks

Build and maintain inference runtimes and serving frameworks
Build proof of concept systems
Create technical publications and open source contributions
Develop and tune custom kernels
Drive quantization sparsity and batching strategies
Manage distributed inference tensor pipeline and KV cache
Optimize operators for throughput and latency
Prototype LLM inference use cases
Provide workload insights to hardware firmware and compiler teams
Translate POCs into customer demonstrations

Perks/Benefits

Skills/Tech-stack

Apply Save

Language: en Views: 6

Clicks: 0

Saves: 0

Related jobs

Featured Feat. Principal Knowledge & Data Architect USD 174K-284K

AWS Neptune | Canonicalization | Chunking | Cypher | DBT

Benefits including health and wellness programs | Health, wellness, and retirement plans

Senior-level Full Time

Headquarters - Chevy Chase, MD R

12d ago
AI Ops Engineer USD 122K-191K

Agent Development Kit | Agile | CI/CD | Cloud platform | Docker

Career coaching | Employee assistance programme | Flexible working hours | Global career opportunities | Hybrid work

Entry-level Full Time

Colorado, United States

8h ago
Senior Data Engineer - Modern Data & AI Platforms USD 156K-204K

AI machine learning | AWS | Cortex | DBT | Data Governance

Remote work | Weekend and after hours release support

Senior-level Full Time

Kansas City, MO, United States

9h ago
AI Architect - Business Applications, LinkedIn Marketing Solutions USD 141K-231K

A/B | A/B Testing | Access Control | B testing | Experimentation

Senior-level Full Time

Chicago, IL, United States

10h ago
Senior Databricks Platform Engineer USD 140K-165K

AWS | Active Directory | Apache Hive | Apache Spark | Audit Logging

Senior-level Full Time

Arlington, VA, United States

11h ago
Senior Data Engineer-JT0224 USD 115K-175K

.Net Core | .Net Framework | AWS CloudFormation | Apache Airflow | Azure

401k match | Career growth opportunities | Dental insurance | Employee resource groups | Health insurance

Senior-level Full Time

Remote, United States R

11h ago
Senior Machine Learning Engineer USD 140K-224K

AIOps | AWS | Apache Spark | Cloud platform | Data Analysis

401k match | Dental insurance | Flexible paid time off | Life insurance | Medical insurance

Senior-level Full Time

Remote, United States R

11h ago
Data Engineer USD 107K-160K

Apache Airflow | Cloud Computing | Data Modeling | ETL | Git

Mid-level Full Time

Tel Aviv-Yafo, Tel Aviv District, IL

12h ago
Lead Data Engineer USD 125K-160K

BigQuery | CI/CD | Cloud Composer | Cloud Storage | Cloud platform

Senior-level Full Time

Columbia, MD, United States

13h ago
Sr Data Engineer A USD 120K-164K

AWS S3 | Data Quality | Data pipeline | DevOps | ETL

Senior-level Contract

Raleigh, United States

13h ago
SYSTEM ENGINEER - Computer Network Support - AI/ML - 6+ yrs of Experience - TS/SCI w/Poly clearance is required - ES - 032726-2 A USD 136K-140K

AI/ML | Agile | Confluence | Jira | LLM

401k retirement plan | Dental insurance | Disability insurance | Federal Holidays | Floating holidays

Mid-level Full Time

Fort George G Meade, United States

13h ago
AI Platform Engineer USD 108K-215K

Autogen | Azure Data | Azure Data Lake | Azure Data Lake Storage | Azure Key Vault

Mid-level Full Time

Belmont, NC, United States

14h ago
AI Platform Engineer USD 108K-215K

AI Agents | Agent Frameworks | Autogen | Azure Data | Azure Data Lake

Mid-level Full Time

Belmont, NC, United States

14h ago
Senior Data Management Professional - Data Engineering - Private Credit USD 110K-190K

AI experimentation | Alerting | Artificial Intelligence | Data Annotation | Data Architecture

Senior-level Full Time

New York

14h ago
Senior Software Engineer (C++/DS/Algorithm) USD 119K-164K

Algorithms | C# | C++ | Compiler concepts | Compiler optimizations

Health and wellness benefits | Hybrid work mode

Senior-level Full Time

Austin - Texas - United States …

14h ago
Research Scientist / Engineer – Reinforcement Learning Infrastructure USD 200K-300K

Asynchronous training | Containerization | Curriculum learning | Distributed Systems | Distributed Training

Senior-level Full Time

SF Bay Area, CA, Remote, US, … R

14h ago
Software Engineer - TikTok AI Search Infrastructure USD 212K-389K

Algorithm Design | C++ | DAG | Data Architecture | Data Processing

Senior-level Full Time

San Jose, California, United States

15h ago
Machine Learning Engineer / AI Model Developer (Mid to Senior)(Top Secret Clearance Required) (Hybrid) USD 78K-176K

APIs | Agile methodologies | Backend Development | Cloud Computing | Data Preparation

401k | Employee discount program | Employee referral rewards | Flexible spending account | Flexible work schedule

Mid-level Full Time

Fort Belvoir, VA, US R

16h ago
Software Engineer, Systems ML Engineering USD 170K-251K

Alerting | Benchmarking | C++ | CUDA | Dashboarding

Senior-level Full Time

Sunnyvale, CA | Bellevue, WA | …

16h ago
AI/HPC Network Performance Engineer USD 147K-226K

AI Training | Alerting | Auto-remediation | C++ | Configuration Management

Oncall rotation

Senior-level Full Time

Menlo Park, CA

16h ago
Forward Deployed Engineer III, Google Cloud, Applied AI USD 174K-253K

API Integration | Agent systems | Agentic Workflows | Chatbots | Cloud platform

Benefits | Bonus | Equity | Travel up to 50 percent

Senior-level Full Time

San Francisco, CA, USA; Atlanta, GA, …

16h ago
Research Software Engineer, Multimodal AI USD 174K-253K

Agent Orchestration | Audio | C++ | Few-Shot Learning | Few-shot

Mid-level Full Time

San Jose, CA, USA

16h ago
ML Engineer, GenAI Ads, Search Personalization USD 207K-301K

C++ | Clustering Algorithms | Data Processing | Data Structures | Data Structures and Algorithms

Senior-level Full Time

Mountain View, CA, USA

16h ago
Senior Software Engineer, AI/ML, Geo and Gemini App USD 174K-253K

A/B | A/B Testing | B testing | C++ | Data Analysis

Senior-level Full Time

New York, NY, USA

16h ago
Tech Lead Manager, Google Analytics Gold Processing Backend USD 207K-301K

Bigtable | C++ | Colossus | Data Processing | Data Storage

Senior-level Full Time

Mountain View, CA, USA

16h ago

Principal LLM Inference Engineer

Tasks

Perks/Benefits

Skills/Tech-stack

Education

Roles

Regions

Countries

States

Cities

Related jobs