Staff Software Engineer, Inference
Tasks
- Design cross team initiatives
- Drive distributed systems performance improvements
- Implement request routing and scheduling
- Improve system reliability
- Lead inference platform architecture
- Manage GPU resources
- Optimize batching and memory usage
- Optimize inference performance
Perks/Benefits
- 401k employer match
- Dental insurance
- Employee stock purchase program
- Flexible PTO
- Flexible spending account
- Health insurance
- Health savings account
- Life insurance
- Long-term disability insurance
- Paid parental leave
- Paid sick leave
- Short-term Disability Insurance
- Tuition reimbursement
- Vision insurance
Skills/Tech-stack
BF16 | C++ | CUDA | Distributed Systems | FP8 | GPU interconnects | Go | Inference Server | Kubernetes | Latency optimization | Mixed Precision | NCCL | NUMA | Networking | Performance optimization | Python | RDMA | Ray Serve | Streaming inference | TensorRT-LLM | Throughput Optimization | Torchserve | Triton Inference | Triton Inference Server | VLLM
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
Summer 2026 Data Engineer USD 41K-50KAPIs | Agile | Azure Data | Azure Data Factory | Azure Data LakeExposure to real-world projects | Learning and development opportunities | MentorshipEntry-level InternshipBoston, MA, United States9h ago
-
Early-Career Network Engineer (RAN Optimization) USD 85K-130K4G | 5G | Automation | C Band | CBRS401k match | Dental insurance | Disability insurance | Educational assistance | Financial wellness programsMid-level Full TimePlano,Texas,United States R9h ago
-
Senior Embedded Software Engineer USD 146K-196KARM Cortex | C# | Digital Signal | Digital Signal Processing | Embedded Linux401k match | Dental insurance | Employee assistance program | FSA | Flexible scheduleSenior-level Full TimeCamarillo, CA, United States9h ago
-
Data Engineer USD 126K-208KAPI Integration | Airflow | Amazon Web Services | BigQuery | CCPADEI initiatives | Dental benefits | Employee rewards program | Medical benefits | Mental health supportMid-level Full TimeRemote, United States R9h ago
-
Alerting | Ansible | Bash | CI/CD | CephRemote workSenior-level Full TimeUnited States, United States R10h ago
-
Ansible | Bash | CI/CD | CentOS | CephContract-to-hire | No sponsorship | Remote workSenior-level Full TimeUnited States, United States R10h ago
-
HPG Big Data Engineer / Senior-Level USD 119K-164KAgile | Azure Data | Azure Data Lake | Azure Data Lake Storage | Azure FunctionsSenior-level Full TimeNashville, TN, United States11h ago
-
Platform and Integrations Engineer USD 100K-200KAnthropic | GraphQL | Next.js | Node.js | OpenAIIn-person work | Medical/Dental/Vision | Significant equity upsideEntry-level Full TimeSan Francisco, CA, US12h ago
-
Machine Learning Engineer USD 131K-178KAWS | Cassandra | Convolutional Neural Networks | Data Lakes | Data PipelinesMid-level Full TimeRemote, NY, US R12h ago
-
Amazon S3 | Data Engineering | Data Modeling | Data Pipelines | Data QualitySenior-level Full TimeNew York13h ago
-
Amazon S3 | Automation | Data Engineering | Data Modeling | Data Pipelines401k match | Dental insurance | Life insurance | Long-term disability | Medical insuranceSenior-level Full TimePrinceton13h ago
-
Senior Databricks Forward Deployed Engineer - GPS USD 119K-198KAPI Integration | AWS | Airflow | Azure | CI/CDTravelSenior-level Full TimeArlington/Rosslyn, Virginia, United States; Atlanta, Georgia, …14h ago
-
Lead Databricks Forward Deployed Engineer - GPS USD 189K-372KAPI Integration | AWS | Airflow | Apache Spark | AzureSenior-level Full TimeArlington/Rosslyn, Virginia, United States; Atlanta, Georgia, …14h ago
-
Lead AI and Data Solutions Engineer II USD 137K-229KAmazon Web Services | Apache Spark | Application Programming | Application Programming Interfaces | Cloud ComputingSenior-level Full TimeSacramento, California, United States; Tempe, Arizona, …14h ago
-
TikTok Shop - E-commerce Anti-Fraud Data Scientist USD 156K-296KA/B | A/B Testing | Analytics | B testing | Big DataMid-level Full TimeSeattle, Washington, United States14h ago
-
Software Engineer, Systems ML - SW/HW Co-design USD 117K-173KAI infrastructure | Bias Mitigation | C# | C++ | Co-designSenior-level Full TimeSunnyvale, CA | Redmond, WA15h ago
-
Software Engineer, Machine Learning USD 213K-293KAPI Design | Agent Orchestration | Artificial Intelligence | Bias Mitigation | C++Senior-level Full TimeSunnyvale, CA | Remote, US | … R15h ago
-
Acoustics | Algorithm Integration | Audio Software | Bring-up | C++Senior-level Full TimeMountain View, CA, USA15h ago
-
Senior Staff Software Engineer, AI Innovation USD 262K-365KC++ | Cross-Functional Collaboration | Cross-functional | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeMountain View, CA, USA15h ago
-
Staff Software Engineer, AI/ML Performance USD 207K-300KAlgorithms | Auto sharding | C++ | Code debugging | Code generationSenior-level Full TimeSunnyvale, CA, USA15h ago
-
C++ | Data Processing | Debugging | Deep learning | Few-Shot LearningSenior-level Full TimeMountain View, CA, USA15h ago
-
GTM Applied AI Architect, Google Cloud USD 153K-222KAgent Development | Agent Development Kit | Cloud platform | Function Calling | GeminiSenior-level Full TimeAustin, TX, USA; Boulder, CO, USA15h ago
-
Software Engineer III, Generative AI, Payments Risk USD 147K-211KAgent systems | Algorithms | Analytics | Big Data | Computer VisionSenior-level Full TimeMountain View, CA, USA15h ago
-
Software Engineer III, Infrastructure, Infra Spanner USD 147K-211KC++ | Concurrency | Consensus Algorithms | Data Corruption | Data corruption diagnosisSenior-level Full TimeSunnyvale, CA, USA15h ago
-
C++ | Data Analysis | Data Processing | Deep learning | EmbeddingsSenior-level Full TimeMountain View, CA, USA15h ago