ML Model Serving Engineer
Tasks
- Collaborate with infrastructure and training teams
- Extend serving frameworks
- Optimize machine learning model serving
- Reduce model initialization time
- Speed up inference
Perks/Benefits
- Dental insurance
- Employee assistance program
- Health insurance
- Healthcare Flexible Spending Account
- Retirement matching
- Stock options
- Unlimited PTO
- Vision insurance
Skills/Tech-stack
Cloud Platforms | Distributed inference | High Performance | High performance systems | Kubernetes | Model Optimization | Model Serving | Performance Engineering | Performance systems | PyTorch
Education
N/A
Regions
Countries
States
Related jobs
-
Automation Testing | CI/CD | CSS | Cypress | Feature DevelopmentMedical, dental & vision coverage | Paid time off | Parental leave | Reimbursement programs | Retirement planMid-levelRaleigh, United States R10d ago
-
Machine Learning Performance Modeling Architect USD 173K-249KC# | C++ | Data Visualization | Heterogeneous computing | Image qualitySenior-level Full TimeSunnyvale, CA9h ago
-
Software Developer, Scaled Ops AI Acceleration Team USD 147K-203KAI infrastructure | Data Mining | Fine Tuning | Hack | JavaScriptSenior-level Full TimeSunnyvale, CA | Austin, TX | …9h ago
-
Robotics Manipulation Engineer USD 170K-240KAdaptive Control | Automation | C++ | Deep learning | GazeboSenior-level Full TimeFremont, CA9h ago
-
Software Engineer - Language (Technical Leadership) USD 213K-293KASR | Benchmarking | C# | C++ | Conversational AISenior-level Full TimeMenlo Park, CA | Seattle, WA …9h ago
-
Code review | Contamination Checking | Data Generation | Data Pipelines | Data ProcessingEntry-level Full TimeMenlo Park, CA9h ago
-
Business Support Engineer USD 136K-197KCall Support | Cloud Computing | Data Analysis | Data Mining | Docker24x7 on-call rotationEntry-level Full TimeMenlo Park, CA9h ago
-
Business Support Engineer USD 159K-223KCloud Computing | Data Analysis | Data Mining | Distributed Systems | Docker24x7 on-call rotation | Cross-functional team collaboration | Global partner supportSenior-level Full TimeMenlo Park, CA9h ago
-
Research Engineer, Media Data Research - MSL FAIR USD 170K-251KComputer Vision | Data Curation | Data Generation | Data Scaling Laws | Data mixingSenior-level Full TimeMenlo Park, CA9h ago
-
Staff Software Engineer, Torch TPU USD 207K-300KCUDA | Computer Vision | Data Processing | Debugging | Distributed SystemsSenior-level Full TimeSunnyvale, CA, USA9h ago
-
C++ | Compilers | Custom Kernels | Data Processing | Data StructuresSenior-level Full TimeMountain View, CA, USA9h ago
-
Technical Solutions Engineer, Cloud AI, Google Cloud USD 150K-218KAI Model Training | AI model | Apache Beam | Apache Hadoop | Apache SparkSenior-level Full TimeSunnyvale, CA, USA; Austin, TX, USA9h ago
-
Staff AI Engineer USD 210K-235KAgent systems | Agentic AI | Anthropic API | Anthropic Claude | Automated Evaluation401k | Career growth | Disability and life insurance | Equipment provided | Flexible vacation policySenior-level Full TimeRemote (United States) R19h ago
-
Checkpointing | Cloud Networking | Failure recovery | Golang | Human Feedback401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributionsSenior-level Full TimeSan Francisco, CA - US20h ago
-
Principal Engineer, AI Model LifeCycle USD 260K-326KAdapters | Checkpointing | DPO | DeepSpeed | Distributed TrainingCell phone stipend | Commuter benefits | Dental insurance | Health insurance | Mental health wellness supportSenior-level Full TimeSan Francisco, CA - US20h ago
-
3D Object Detection | Camera | Computer Vision | Data Curation | Deep learningEmployee assistance program | Flexible spending accounts | Health savings account | Life insurance | Medical/Dental/VisionSenior-level Full TimeMountain View Technical Center - Mountain …20h ago
-
AI/ML Engineer, Oncology AI USD 136K-226KBiomarker Discovery | Causal Machine Learning | Deep learning | Generative Models | GenomicsHealth care | Paid Holidays | Paid caregiver leave | Paid parental leave | Paid vacationEntry-level Full TimeCambridge 300 Technology Square, United States20h ago
-
Senior-level Full TimeSan Jose, CA, United States20h ago
-
Senior Data Engineer USD 83K-222KAWS | Amazon SageMaker | CI/CD | Cloud Computing | Containerization401k matching | Confidential counseling | Employee stock purchase plan | Family leave | Financial coachingSenior-level Full TimeWork At Home-Pennsylvania, United States20h ago
-
AI/ML Engineer USD 55K-126KAWS | Agentic Frameworks | Deep learning | Generative AI | KerasDependent care | Disability insurance | Health insurance | Life insurance | Paid leaveMid-level Full TimeUndisclosed Location - USA, VA, Mclean, …20h ago
-
Advanced Database Developer USD 128K-222KAWS CloudFormation | AWS CloudWatch | AWS EKS | AWS IAM | AWS LambdaCross-team collaboration | Flexible working environment | Growth opportunities | Learning culture | Supportive teamMid-level Full TimeUS Atlanta Operations Center, United States20h ago
-
AI Research Engineer, Computer Vision USD 170K-210KAutoregressive models | CUDA | DDP | Data Pipelines | DeepSpeed401k retirement plan | Company equity | Dental insurance | Fertility support | Human Annotation SupportMid-level Full TimeRemote (U.S. or Canada) R21h ago
-
Principal AI/ML Engineer - AdTech USD 300K-400KAWS | Ad Exchanges | Apache Kafka | Apache Spark | CassandraEmployee discounts | Employee equity | Medical, dental & vision coverage | Pet insurance | Unlimited PTOSenior-level Full TimeRemote - United States R22h ago
-
Lead AI Engineer USD 200K-215KA/B | A/B Testing | AWS Bedrock | Agentic LLM | Agentic LLM systemsDental insurance | Employee discounts | Employee equity | Health insurance | Pet insuranceSenior-level Full TimeRemote - United States R22h ago
-
Senior Robotics Engineer USD 70K-200KAgent autonomy | Agent systems | Control Theory | DevOps | DockerSenior-level Full TimeIrvine, CA23h ago