Software Engineer, Inference Platform

San Francisco, CA

USD 200K-250K Mid-level Full Time

@ F...

Apply Save

Found 1mo ago

Tasks

Build and operate KV cache and scheduling infrastructure
Contribute to inference platform architecture and roadmap
Drive improvements in throughput TTFT and cost per token
Implement and validate disaggregated prefill and decode pipelines
Own inference deployments end-to-end
Participate in on-call rotation to maintain system reliability
Partner with customers to optimize deployment configurations
Profile and resolve bottlenecks across compute memory and communication

Perks/Benefits

Skills/Tech-stack

Apply Save

Language: en | Views: 2 | Clicks: 0 | Saves: 0

Related jobs

Staff Machine Learning Engineer- AI Governance USD 169K-270K

AI Governance | Agentic Frameworks | Bias | Data Drift | Docker

Senior-level Full Time

Foster City, CA, United States

7h ago
Generative AI Consultant USD 94K-114K

AWS | Anthropic | Azure | CI/CD | Chroma

401k plan | Dental insurance | Flexible spending account | Flexible work environment | Gym reimbursement

Mid-level Full Time

New York, NY, United States

8h ago
Principal AI Engineer USD 115K-160K

API Design | Agentic Systems | Artificial Intelligence | Backend Development | Data Pipelines

Business travel insurance | Dental insurance | Disability insurance | Employee assistance program | Employee stock purchase plan

Senior-level Full Time

Dallas, TX, United States

9h ago
Data Engineer (remote) USD 85K-100K

Agile | Apache Spark | Artificial Intelligence | Azure Data | Azure Data Factory

401k match | Employee assistance program | Flexible schedule | Health insurance | Paid parental leave

Mid-level Full Time

Work From Home, United States R

13h ago
Software Engineer - Dragonfly Portfolio USD 160K-215K

Cryptography | Distributed Systems | Event Ingestion | Onchain Event Ingestion | Performance optimization

Onsite work location

Mid-level Full Time

San Francisco

13h ago
Tech Lead, GTM Applied AI and Analytics USD 138K-225K

Airflow | Amazon SageMaker | DBT | Databricks | Deep learning

Senior-level Full Time

San Francisco, CA, United States

14h ago
Software Integration Engineer II USD 101K-219K

Ansible | Bash | C# | CentOS | Confluence

Mid-level Full Time

Salt Lake City, Utah

15h ago
Data Engineer ID50062 USD 148K-164K

AWS | AWS SageMaker | AWS SageMaker Studio | Airflow | Apache Spark

Education budget | Fitness budget | Flexible schedule | Mentorship | Office options

Senior-level Full Time

Blacksburg, United States

15h ago
Full Stack Developer - Cloud Engineer USD 107K-160K

API Design | Agile | Amazon SageMaker | Analytical Data | Analytical Data Warehouse

Mid-level Full Time

Columbus, Ohio, United States

16h ago
Sr. Tech Lead, GTM Applied AI & Analytics USD 150K-243K

Airflow | Data Warehousing | Databricks | Fine Tuning | LLM APIs

Senior-level Full Time

San Francisco, CA, United States

17h ago
Data Engineer, Analytics USD 205K-235K

Data Governance | Data Modeling | Data Quality | Data Security | Data Visualization

Entry-level Full Time

Seattle, WA

18h ago
Software Engineer, Machine Learning USD 185K-200K

Classification | Computer Vision | Data Mining | Data Regression | Deep learning

Mid-level Full Time

Menlo Park, CA

18h ago
Data Engineer (Analytics) USD 191K-235K

Big Data | Data Modeling | Data Warehousing | Data integration | Dimensional Modeling

Domestic and international travel | Telecommuting

Mid-level Full Time

Menlo Park, CA | Remote, US R

18h ago
Robotics Manipulation Engineer USD 157K-240K

Adaptive Control | C plus plus | Control Systems | Deep learning | GPU

Senior-level Full Time

Fremont, CA

18h ago
Robotics Engineer - Logistics and Material Flow USD 170K-240K

AGV | Automation | C++ | Cause analysis | Computer Vision

Travel to data centers for engineering studies

Senior-level Full Time

Fremont, CA

18h ago
Software Engineer III, AI/ML, YouTube Ads, User Experiences USD 147K-211K

Ad Ranking | Algorithms | C++ | Data Processing | Data Structures

Senior-level Full Time

Mountain View, CA, USA

18h ago
Research Engineer, World Models, DeepMind USD 147K-211K

Accelerator Training | C++ | Deep learning | Distributed Training | GPU Computing

Mid-level Full Time

London, UK; New York, NY, USA

18h ago
AI Engineer, Professional Services, Google Cloud USD 183K-265K

Apache Beam | Apache Spark | C++ | Data Validation | Data Warehousing

Technical workshops | Travel opportunities

Senior-level Full Time

Austin, TX, USA; Atlanta, GA, USA

18h ago
Senior Software Engineer, AI/ML GenAI, Core USD 174K-252K

Algorithms | C++ | Computer Vision | Data Processing | Data Structures

Senior-level Full Time

Kirkland, WA, USA; Sunnyvale, CA, USA

18h ago
Software Engineer III, AI/ML, Core USD 147K-211K

Algorithms | Data Processing | Data Storage | Data Structures | Debugging

Senior-level Full Time

Sunnyvale, CA, USA

18h ago
Software Engineer III, AI/ML, Google Workspace USD 147K-211K

C++ | Data Processing | Debugging | Language Processing | ML Infrastructure

Senior-level Full Time

Boulder, CO, USA

18h ago
Product Software Modernization Engineer, Quantum AI USD 147K-211K

Bazel | Cloud Spanner | Cloud Storage | Cloud platform | Distributed cloud

Mid-level Full Time

Seattle, WA, USA; Goleta, CA, USA

18h ago
Software Engineer III, Infrastructure, GDC AI Storage USD 147K-211K

CSI | Data Structures | Data Structures and Algorithms | Distributed Systems | Go

Senior-level Full Time

Kirkland, WA, USA

18h ago
Partner Solutions Engineer, Machine Learning and Digitization Operations USD 127K-183K

Automation | C++ | CSS | Database Design | HTML

Mid-level Full Time

Ann Arbor, MI, USA

18h ago
Software Engineering - DataStage Developer USD 112K-129K

Axway | Axway Secure SFTP | Azure | Azure DevOps | CA7 Scheduler

Hybrid work schedule | Remote work

Mid-level Full Time

Syracuse, New York, United States

21h ago

Software Engineer, Inference Platform

Tasks

Perks/Benefits

Skills/Tech-stack

Education

Roles

Regions

Countries

States

Cities

Related jobs