LLM Inference Frameworks and Optimization Engineer

San Francisco, Singapore, Amsterdam

USD 160K-230K Mid-level Full Time

@ T...

Apply Save

Found 1mo ago

Tasks

Analyze inference performance bottlenecks
Apply CUDA graph optimizations
Design distributed inference engines
Develop model execution plans
Implement distributed inference strategies
Implement speculative decoding
Optimize GPU TPU and accelerator performance
Optimize TensorRT and TRT-LLM graphs
Optimize end to end model serving pipelines
Optimize inference latency and throughput
Perform software hardware co design
Use torch compile for model execution

Perks/Benefits

Skills/Tech-stack

Education

N/A

Roles

Apply Save

Language: en Views: 2

Clicks: 0

Saves: 0

Related jobs

Featured Feat. Principal Knowledge & Data Architect USD 174K-284K

AWS Neptune | Canonicalization | Chunking | Cypher | DBT

Benefits including health and wellness programs | Health, wellness, and retirement plans

Senior-level Full Time

Headquarters - Chevy Chase, MD R

21d ago
Consultant- Marketing Data Science & AI USD 98K-122K

AWS | Azure | Cloud platform | Clustering | Data Engineering

401k match | Cell phone stipend | Employee assistance program | FSA | HSA

Mid-level Full Time

San Francisco, CA, United States

8h ago
Senior Palantir Data Engineer USD 135K-200K

AWS Glue | AWS Lambda | AWS S3 | Agile | Amazon Athena

Senior-level Contract Full Time

Lavallette, NJ, United States

9h ago
Senior AI/ML Data Engineer USD 162K-226K

AWS | Apache Airflow | BigQuery | CI/CD | Cloud Composer

Career growth | Flexible remote days | In office five days per week

Senior-level Full Time

Frisco, TX, United States R

10h ago
Embedded Software Engineer– Technical Lead USD 150K-175K

ARM | Bring-up | C++ | CI/CD | Debugging

401k match | Health insurance | Paid time off

Senior-level Full Time

Waltham, MA, United States

11h ago
Data Engineer USD 95K-178K

AWS | Airflow | Azure | Data Modeling | Databricks

Mid-level Contract

Beaverton, OR, US

12h ago
Manager, AI Engineering USD 49K

Artificial Intelligence | CI/CD | Computer Vision | Deep learning | Docker

Mid-level Full Time

Morrisville, North Carolina, États-Unis d’Amérique

14h ago
Senior Software Engineer USD 137K-220K

AWS | Automated testing | CI/CD | Docker | GCP

Senior-level Full Time

Tel Aviv-Yafo, Tel Aviv District, IL

15h ago
MLOps Engineer ID72409 USD 125K-193K

CI/CD | Cloud Computing | Concept drift | Concept drift detection | Data Drift

Annual learning budget | Collaborative culture | Flexible hours | Internal TechTalks | Mentorship

Mid-level Full Time

Arlington, United States R

15h ago
Database Developer A USD 120K-150K

Azure DevOps | Data Modeling | Data integration | ETL | GitHub

Remote work

Senior-level Contract

Saint Paul, United States

15h ago
Senior Data Management Professional - Data Engineering - Private Funds USD 110K-190K

Agile | Data analytics | Distributed Systems | ETL | Machine Learning

401k match | Dental insurance | Life insurance | Long-term disability | Medical insurance

Senior-level Full Time

New York

15h ago
Applied AI Engineer III USD 139K-232K

.NET | AI Agent | AI agent orchestration | AWS | AWS Bedrock

Senior-level Full Time

Boston, Massachusetts, United States

15h ago
Lead Applied AI Site Reliability Engineer II - PxE A&A USD 113K-232K

.NET | ArgoCD | Automation | Bash | C#

Mid-level Full Time

Dallas, Texas, United States; Hermitage, Tennessee, …

15h ago
Delivery Senior Consultant, Data Engineering and Gen AI Conversion Solutions USD 155K-265K

.NET | AWS | Agile | Amazon Web Services | Angular

Senior-level Full Time

Gilbert, Arizona, United States; Lake Mary, …

15h ago
Data Engineer II USD 84K-140K

Agile | Amazon Redshift | Apache Airflow | CI/CD | Cloud Data

Mid-level Full Time

Arlington/Rosslyn, Virginia, United States; Baltimore, Maryland, …

15h ago
Palantir Foundry Data Engineer USD 84K-140K

Agile | Batch Processing | Cloud Computing | Continuous integration | Data Lineage

Mentorship | Professional development | Travel 10% average

Mid-level Full Time

Arlington/Rosslyn, Virginia, United States; Baltimore, Maryland, …

15h ago
Lead Applied AI Engineer II USD 124K-255K

.NET | AWS | AWS Bedrock | Agent Orchestration | Agentic AI

Senior-level Full Time

Dallas, Texas, United States; Hermitage, Tennessee, …

15h ago
Lead Applied AI Site Reliability Engineer II - ServiceNow USD 113K-232K

.NET | AI | AI SSDLC | Agentic Workloads | Argo CD

Mid-level Full Time

Dallas, Texas, United States; Hermitage, Tennessee, …

15h ago
Applied AI SRE III - PxE GPS USD 102K-210K

.NET | Amazon Web Services | ArgoCD | Automation | Bash

Senior-level Full Time

Dallas, Texas, United States; Hermitage, Tennessee, …

15h ago
Senior AI Engineer USD 142K-186K

AWS | Agent systems | Azure | Containerization | Database Management System

Senior-level Full Time

Tel Aviv-Yafo, Tel Aviv District, IL

16h ago
Senior Software Engineer, Fleet-level ML Performance USD 174K-252K

C++ | Computer Architecture | Deep learning | Embedding architectures | Machine Learning

Senior-level Full Time

Sunnyvale, CA, USA

17h ago
Forward Deployed Engineer, Higher Education, Google Public Sector USD 207K-300K

API Integration | Agent systems | BigQuery | CI/CD | CrewAI

Equity compensation | Health insurance | Paid time off | Retirement plan

Senior-level Full Time

Sunnyvale, CA, USA

17h ago
Senior Data Engineer, Trust and Safety, Technology and Data Enablement USD 156K-226K

Big Data | Data Governance | Data Lifecycle Management | Data Management | Data Modeling

Senior-level Full Time

Austin, TX, USA

17h ago
Forward Deployed Engineer IV, GenAI, Google Cloud USD 207K-300K

APIs | Agent systems | Cloud platform | Cost Per | Cost Per Request

Senior-level Full Time

New York, NY, USA; Austin, TX, …

17h ago
Software Engineer, Infrastructure, Ads Privacy Data Governance USD 147K-210K

Agentic Workflows | C++ | Data Processing | Data Storage | Data Structures

Mid-level Full Time

Mountain View, CA, USA; New York, …

17h ago

LLM Inference Frameworks and Optimization Engineer

Tasks

Perks/Benefits

Skills/Tech-stack

Education

Roles

Regions

Countries

States

Cities

Related jobs