Senior Software Engineer - AI Inference
Tasks
- Deploy and operate machine learning systems at scale
- Design and build scalable inference infrastructure
- Design and operate production distributed systems
- Drive architecture and technical decisions for inference platform
- Improve model deployment observability and production performance
- Lead integration of inference runtimes and serving frameworks
- Mentor junior engineers on system design and performance optimization
Perks/Benefits
- 401k match
- Dental insurance
- Life insurance
- Medical insurance
- Paid Holidays
- Paid time off
- Vision insurance
- Wellness programs
Skills/Tech-stack
Batching | CUDA | Caching | Distributed Systems | High Performance | High-Performance Computing | Inference Optimization | KServe | Kubernetes | Load Balancing | Machine Learning | Memory Aware Serving | NCCL | NVIDIA GPU | ONNX | Observability | Performance Computing | Prompt Caching | PyTorch | Request Routing | Request Scheduling | Structured Sampling | TensorRT | Traffic Management | Triton | VLLM
Education
N/A
Regions
Countries
States
Cities
Related jobs
-
Lead Data Expert USD 97K-163KData Architecture | Data Automation | Data Modeling | Data Quality | Data VisualizationOnsite 30 to 50 percentSenior-level Full TimeArlington/Rosslyn, Virginia, United States4h ago
-
CRM | Data Mining | Deep learning | Email outreach | Knowledge graphsMid-level Full TimeSan Jose, California, United States5h ago
-
Senior Software Engineer, AI/ML, Google Public Sector USD 174K-252KAlgorithms | C++ | Cloud Object Storage | Data Structures | Distributed ComputingSenior-level Full TimeReston, VA, USA6h ago
-
Staff Software Engineer, Intelligent Database Management USD 207K-300KAI | API Design | AlloyDB | Audit Logging | BigtableSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA6h ago
-
Forward Deployed Engineer I, GenAI, Google Cloud USD 102K-145KAPI Development | Agent Framework | Agent systems | Cloud Computing | CrewAISenior-level Full TimeSan Francisco, CA, USA; Atlanta, GA, …6h ago
-
C++ | Data Mining | Data Processing | Deep learning | Few-Shot LearningSenior-level Full TimeMountain View, CA, USA6h ago
-
Software Engineer III, AI/ML, Google Ads USD 147K-211KC++ | Data Processing | Debugging | Information Retrieval | Language ProcessingSenior-level Full TimeMountain View, CA, USA6h ago
-
ML Engineer USD 190K-320KCost Optimization | Data Versioning | Dataset Operations | Dataset curation | Eval Frameworks401k matching | Dental insurance | Employee assistance program | Health insurance | Stock optionsSenior-level Full TimeSan Francisco11h ago
-
Data Engineer USD 105K-115KBig Data | Cloud Computing | Data Modeling | Data Pipelines | Data StorageActive secret clearanceMid-level Full TimeSan Diego, CA, US13h ago
-
Principal Data Engineer USD 200K-240KAWS | Agentic Workflows | Anomaly Detection | Batch pipelines | CCPA401k plan | Commuter benefits | Flexible vacation | Life insurance | Long-term disabilitySenior-level Full TimeBoulder, Colorado or New York City, … R13h ago
-
Associate AI - Data Scientist / ML Engineer (Azure) USD 46K-110KAzure Container | Azure Container Instances | Azure Data | Azure Data Factory | Azure Data LakeDental insurance | Disability insurance | Employee assistance program | Life insurance | Medical insuranceMid-level Full TimeNashville, TN, US15h ago
-
Staff Machine Learning Engineer USD 159K-309KAWS | Airflow | Apache Spark | BigQuery | Cloud platform401k plan with company match | Commuter benefits | Disability coverage | Electric Car Charging Station | Employee assistance programSenior-level Full TimeMountain View, USA16h ago
-
Staff Machine Learning Engineer USD 152K-261KAWS | Airflow | Apache Spark | BigQuery | Cloud platform401k plan with company match | Commuter benefits | Disability insurance | Electric Car Charging Station | Employee assistance programSenior-level Full TimeMountain View, USA16h ago
-
Staff Machine Learning Engineer USD 142K-309KAWS | Deep learning | Information Retrieval | Machine Learning | Statistical modeling401k plan with company match | Electric Car Charging Station | Employee assistance program | Flexible spending accounts | Health savings accountSenior-level Full TimeMountain View, USA16h ago
-
Staff Software Engineer - AI Research Infrastructure USD 190K-270KBackend Services | C plus plus | CI/CD | Cloud infrastructure | Cluster managementSenior-level Full TimeNew York City, New York; San …16h ago
-
Staff Software Engineer - AI Research Infrastructure USD 190K-270KBackend Services | CI testing | Cluster scheduling | Data Pipelines | Distributed SystemsSenior-level Full TimeNew York City, New York16h ago
-
Senior AI Solutions Engineer, Enterprise Knowledge Work USD 260K-325KAgentic Systems | Dspy | Evaluation | LLM orchestration | LanggraphCollaborative culture | Flexible working hours | Supportive work environmentSenior-level Full TimeNew York, New York, United States; …16h ago
-
Quantum Software Engineer II USD 100K-215KAI Tooling | CUDA | Compilers | Complex linear algebra | DebuggingEntry-level Full TimeRedmond, WA, US17h ago
-
Senior Applied Scientist , Sponsored Products USD 183K-273KA/B | A/B Testing | B testing | Bandit Algorithms | Causal InferenceSenior-level Full TimeNew York, New York, USA17h ago
-
Principal, AI Platform Engineer USD 125K-187KAWS | Azure | CI/CD | Data leakage | Deterministic executionSenior-level Full TimeAtlanta, Georgia, US United States, 3034017h ago
-
AI Engineer USD 100K-143KAPI Development | AWS | Automation | Azure | Cloud platform401k | Dental insurance | Disability insurance | EAP | Gym reimbursementMid-level Full TimeOcala, Florida, United States17h ago
-
Lead Data Engineer, Marketing Operations and Engineering USD 139K-257KAWS | Apache Airflow | Apache Spark | Cloud Computing | Cloud platformSenior-level Full TimeSan Jose, United States R17h ago
-
Mid-level Full TimeCharlotte, United States17h ago
-
AI/ML Implementation Engineer USD 93K-155K8D | APQP | AWS | AWS Lambda | Amazon Bedrock401k matching | Dental insurance | Disability benefits | Employee assistance program | Health coachingMid-level Full TimeRemote, United States R17h ago
-
Generative AI Consultant USD 105K-105KAWS | Azure | CI/CD | Deep learning | Docker401k matching | Health insurance | Paid time off | Parental leave | Professional developmentMid-level Full TimeSan Francisco, CA, United States18h ago