Senior Software Engineer - AI Inference
Tasks
- Deploy and operate machine learning systems at scale
- Design and build scalable inference infrastructure
- Design and operate production distributed systems
- Drive architecture and technical decisions for inference platform
- Improve model deployment observability and production performance
- Lead integration of inference runtimes and serving frameworks
- Mentor junior engineers on system design and performance optimization
Perks/Benefits
- 401k match
- Dental insurance
- Life insurance
- Medical insurance
- Paid Holidays
- Paid time off
- Vision insurance
- Wellness programs
Skills/Tech-stack
Batching | CUDA | Caching | Distributed Systems | High Performance | High-Performance Computing | Inference Optimization | KServe | Kubernetes | Load Balancing | Machine Learning | Memory Aware Serving | NCCL | NVIDIA GPU | ONNX | Observability | Performance Computing | Prompt Caching | PyTorch | Request Routing | Request Scheduling | Structured Sampling | TensorRT | Traffic Management | Triton | VLLM
Education
N/A
Regions
Countries
States
Cities
Related jobs
-
Lead Researcher, Large Language Models/LLM, TikTok USD 224K-410KData Processing | Deep learning | Inference Optimization | Language Models | Large Language ModelsSenior-level Full TimeSan Jose, California, United States5h ago
-
Partner Engineer, Generative AI USD 159K-223KAWS | Azure | Bias Mitigation | C plus plus | Cloud PlatformsEntry-level Full TimeMenlo Park, CA6h ago
-
Adversarial ML | Benchmarking | Data Mining | Environment Design | Function CallingMid-level Full TimeMountain View, CA, USA; New York, …6h ago
-
Software Engineer III, AI/ML, gUP Customer Support USD 147K-211KAgile | C++ | Code review | Data Processing | DebuggingSenior-level Full TimeSunnyvale, CA, USA6h ago
-
Computer Vision | Data Processing | Debugging | Deep learning | GenAISenior-level Full TimeMountain View, CA, USA6h ago
-
Staff Software Engineer, Deep Data Research, Applied AI USD 207K-300KComputer Vision | Data Processing | Debugging | Fine Tuning | Language ModelingSenior-level Full TimeSunnyvale, CA, USA6h ago
-
Senior Software Engineer, Database Internals AlloyDB USD 174K-252KC plus plus | C# | Code optimization | Compute Technologies | Concurrency ControlEntry-level Full TimeSunnyvale, CA, USA6h ago
-
Senior Software Engineer, AI/ML GenAI, Google Cloud AI USD 174K-252KC++ | Cloud AI | Computer Vision | Data Processing | Google CloudSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA6h ago
-
Computer Vision | Data Processing | Debugging | Distributed Computing | Generative AIMid-level Full TimeMountain View, CA, USA6h ago
-
Staff Software Engineer, AI-Powered GRC Automation USD 207K-300KCloud Platforms | Cloud platform | Continuous controls monitoring | Controls monitoring | Data PipelinesSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA6h ago
-
Senior Staff Software Engineer, BigQuery Core Analytics USD 262K-365KBigQuery | C plus plus | C++ | CI/CD | Distributed SystemsSenior-level Full TimeKirkland, WA, USA; Sunnyvale, CA, USA6h ago
-
Staff Machine Learning Engineer, Embeddings USD 253K-354KA/B | A/B Testing | B testing | C++ | Cloud ComputingCaregiving support | Comprehensive healthcare benefits | Employer 401k match | Family planning support | Flexible vacationSenior-level Full TimeRemote - United States R13h ago
-
ML Engineer, Surrogate Modeling (Vehicle Engineering) USD 125K-175KActive Learning | Adaptive Sampling | CFD | Continuous integration | Data Pipelines401k retirement plan | Employee stock purchase plan | Life insurance | Long-term disability insurance | Long-term incentivesEntry-level Full TimeHawthorne, CA13h ago
-
Lead ML Inference Engineer, Advertising USD 246K-486KCo-design | Distributed Systems | GPU Acceleration | Hardware-Software Co-design | Hardware/softwareDisability benefits | Equity awards | Health insurance | Life insurance | Paid time offSenior-level Full TimeSan Jose, California13h ago
-
Senior Software Engineer (Search / Retrieval) USD 180K-240KBM25 | Distributed Systems | Elasticsearch | Entity recognition | Language ProcessingFlexible work environment | Remote work opportunitySenior-level Full TimePalo Alto, California16h ago
-
Software Engineer - Developer Products (AI) USD 170K-240KAPI Design | APIs | CLIs | Data Structures | Data Structures and AlgorithmsEmployee benefits package | Remote-friendly work environmentSenior-level Full TimeSan Francisco, California16h ago
-
Senior-level Full TimePalo Alto, California16h ago
-
Staff AI Engineer USD 215K-285KA/B | A/B Testing | B testing | Behavioral signals | Distributed SystemsSenior-level Full TimePalo Alto, California; San Francisco, California16h ago
-
AI Software Engineer (Vehicle Engineering) USD 125K-175K3D Reconstruction | Agent systems | Agentic AI | Anomaly Detection | CI/CD401k retirement plan | Dental insurance | Employee stock purchase plan | Life insurance | Life insurance coverageSenior-level Full TimeHawthorne, CA17h ago
-
Lead AI Engineer (ML Ops) USD 116K-170KAPIs | AWS | Agile Scrum | Azure | CI/CD401k match | Employee assistance program | Employee charity match | Employee stock purchase plan | Health and wellness allowanceSenior-level Full TimeIrving - 6011 Connection, United States17h ago
-
Platform and Databricks DevOps Engineer USD 77K-176KAWS | Azure | Bash | CI/CD | DatabricksDependent care | Paid leave | Professional development | Tuition assistance | Work-life programsMid-level Full TimeUSA, VA, McLean (8283 Greensboro Dr, …17h ago
-
Adaptive Systems | Apache Spark | C++ | CUDA | Data ProcessingBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States17h ago
-
Developer Advocate – Robotics and Physical AI USD 184K-287KAgentic AI | Embedded Systems | GPUs | Generative AI | Isaac LabSenior-level Full TimeUS, CA, Santa Clara, United States17h ago
-
Data AI Solutions Sr Analyst - C12 - TAMPA USD 87K-130KAgentic AI | Artificial Intelligence | Data Catalog | Data Governance | Data LineageSenior-level Full Time3800 CITIGROUP CENTER DRIVE BUILDING F …17h ago
-
Senior Bioinformatics Data Engineer (Consultant) USD 168K-200KADaM | AWS ECS | AWS ECS Fargate | AWS Glue | AWS Glue CatalogBackground check | Code review feedback loops | Compliance training | Embedded pair working model | VPN AccessSenior-level Full TimeRaleigh, United States17h ago