Senior Software Engineer - AI Inference
Tasks
- Deploy and operate machine learning systems at scale
- Design and build scalable inference infrastructure
- Design and operate production distributed systems
- Drive architecture and technical decisions for inference platform
- Improve model deployment observability and production performance
- Lead integration of inference runtimes and serving frameworks
- Mentor junior engineers on system design and performance optimization
Perks/Benefits
- 401k match
- Dental insurance
- Life insurance
- Medical insurance
- Paid Holidays
- Paid time off
- Vision insurance
- Wellness programs
Skills/Tech-stack
Batching | CUDA | Caching | Distributed Systems | High Performance | High-Performance Computing | Inference Optimization | KServe | Kubernetes | Load Balancing | Machine Learning | Memory Aware Serving | NCCL | NVIDIA GPU | ONNX | Observability | Performance Computing | Prompt Caching | PyTorch | Request Routing | Request Scheduling | Structured Sampling | TensorRT | Traffic Management | Triton | VLLM
Education
N/A
Regions
Countries
States
Cities
Related jobs
-
Featured Feat. Applied AI Engineer - Bay Area USD 211K-263KArtificial Intelligence | C plus plus | C# | Embeddings | Feature Engineering401k | Comprehensive health and wellness benefits | Learning and development opportunities | Unlimited time offMid-level Full TimeHQ (San Francisco)25d ago
-
Senior-level ContractAustin, United States9h ago
-
AI Search | Agile | Angular | Azure AI | Azure AI Search100 percent onsite | Public trust clearance requiredSenior-level ContractWoodlawn, United States9h ago
-
Senior Applied AI Engineer USD 160K-210KAPI Design | AWS | CI/CD | Circuit Breakers | DockerDynamic work environment | Flexible working hoursSenior-level Full TimeUS - Remote, Canada - Remote R9h ago
-
AI APIs | Backend Development | Data Engineering | Data Pipelines | Frontend DevelopmentMid-level ContractChandler, United States9h ago
-
Azure Databricks Developer USD 125K-198KApache Spark | Azure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake StorageSenior-level Full TimeLouisville, Kentucky, United States10h ago
-
Java Full Stack Developer-Software Engineer II USD 93K-155KAPI Design | AWS | Ansible | Artificial Intelligence | BenchmarkingMid-level Full TimeDallas, Texas, United States10h ago
-
C++ | Cloud Native | Container Orchestration | Deep learning | Distributed SystemsCareer growth | Open Source contribution | World Class CollaborationEntry-level Full TimeSan Jose, California, United States10h ago
-
Senior Algorithm Engineer, Performance Advertising USD 174K-252KA/B | A/B Testing | Ad Ranking | Auction optimization | B testingCollaborative team environment | Continuous learning | Flat organizational structure | Ownership and empowermentSenior-level Full TimeSan Francisco, United States11h ago
-
Linguistic Engineer USD 103K-156KBias Mitigation | Ethical AI | Hack | Knowledge graphs | Machine LearningMid-level Full TimeRedmond, WA11h ago
-
Research Engineer, Robotics USD 184K-356KC++ | CUDA | Computer Graphics | GPU Architectures | GPU KernelsSenior-level Full TimeRedmond, WA11h ago
-
Partner Engineer, Generative AI USD 159K-223KAWS | Agent Orchestration | Azure | Bias Mitigation | C++Senior-level Full TimeMenlo Park, CA11h ago
-
Staff Research Engineer, MRS AI USD 146K-208KA/B | A/B Testing | Alignment techniques | B testing | BenchmarkingSenior-level Full TimeBellevue, WA11h ago
-
Senior Software Developer, Computer Vision, XR USD 100K-253KAr | Augmented Reality | C++ | Computer Vision | Data ProcessingSenior-level Full TimeSan Jose, CA, USA; Waterloo, ON, …12h ago
-
Customer Engineer III, Applied AI, Google Cloud USD 174K-253KAgent tooling | C++ | Cloud Architecture | Conversational AI | Document AISenior-level Full TimeSunnyvale, CA, USA; Mountain View, CA, …12h ago
-
Research Engineer, Pretraining, DeepMind USD 174K-253KFine Tuning | Inference Optimization | JAX | Language Models | Large Language ModelsMid-level Full TimeNew York, NY, USA12h ago
-
Senior Software Engineer, Map Ads, Machine Learning USD 174K-253KC++ | Data Processing | Debugging | Differential Modeling | Language ModelsSenior-level Full TimeMountain View, CA, USA12h ago
-
Staff Datacloud Blackbelt Engineer, Data and AI USD 183K-266KAI/ML | AI/ML workflows | BigQuery | Cloud Architecture | Computer VisionSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA12h ago
-
Staff Software Engineer, AI/ML, XR USD 207K-301KAI | App Development | Dart | Flutter | Framework designSenior-level Full TimeSan Jose, CA, USA; New York, …12h ago
-
Senior Software Engineer, Eye Tracking, Core USD 174K-253KAndroid | Artificial Intelligence | Augmented Reality | C++ | Camera PipelinesBonus | Equity | Health insurance | Paid time off | Retirement planSenior-level Full TimeSan Jose, CA, USA12h ago
-
Senior Staff Software Engineer, AI/ML, Google Cloud USD 262K-365KAlgorithms | Data Processing | Data Structures | Debugging | Distributed SystemsSenior-level Full TimeSeattle, WA, USA12h ago
-
Senior Software Engineer, AI/ML, Google Cloud Platforms USD 174K-253KC++ | Code Reviews | Data Processing | Data Structures | Data structures algorithmsSenior-level Full TimeKirkland, WA, USA12h ago
-
Staff Software Engineer, Infrastructure, Google Cloud AI USD 207K-301KCompute Technologies | Cross-Functional Collaboration | Cross-functional | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeKirkland, WA, USA; Sunnyvale, CA, USA12h ago
-
Senior Software Engineer, AI/ML, Google Cloud USD 174K-253KC++ | Data Processing | Debugging | Distributed Computing | Information RetrievalSenior-level Full TimeSunnyvale, CA, USA12h ago
-
Senior Software Engineer, AI/ML GenAI, Google Cloud USD 174K-253KAlgorithms | C++ | Computer Vision | Data Processing | Data StructuresSenior-level Full TimeSunnyvale, CA, USA12h ago