Senior Principal Machine Learning Engineer, vLLM
Tasks
- Benchmark and evaluate quantization approaches
- Benchmark and evaluate sparsification approaches
- Benchmark and profile model parallelization
- Create inference serving deployment pipelines
- Design inference optimization algorithms
- Develop LLM deployment pipelines
- Develop LLM training pipelines
- Develop and test inference optimization algorithms
- Implement model compression algorithms
- Implement model quantization algorithms
- Implement model sparsification algorithms
- Mentor and guide engineers
- Perform code reviews
- Provide technical solutions in design discussions
Perks/Benefits
- 401k employer match
- Employee stock purchase plan
- Flexible spending account
- Health savings account
- Paid parental leave
- Paid time off
- Tuition reimbursement
Skills/Tech-stack
CPU architecture | Code review | Computer Vision | Deep learning | GPU Architecture | Graph theory | Inference Optimization | LLM Inference | LLM Inference Optimization | Language Models | Language Processing | Large Language Models | Linear Algebra | Machine Learning | Model Compression | Model Quantization | Model sparsification | Natural Language | Natural Language Processing | NumPy | Parallel Computing | Probability | PyTorch | Reinforcement Learning | Tensor math
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Regions
Countries
States
Cities
Related jobs
-
Senior AI Operations Engineer USD 170K-180KAI infrastructure | Azure | CI/CD | Cloud infrastructure | Container Engine for Kubernetes401k match | Employee assistance program | Employee stock purchase plan | Flexible schedule | Flexible spending accountSenior-level Full TimeWork From Home, United States R16h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Cause analysis | Code reviewBonus | Equity | Health benefits | Hybrid scheduleMid-level Full TimeMountain View, CA, USA R21h ago
-
Data Engineer USD 123K-151KC++ | Cloud platform | Data Migration | Data Modeling | Data PartitioningBenefits | Hybrid scheduleMid-level Full TimeAustin, TX, USA R21h ago
-
Senior Software Engineer USD 189K-252KAlgorithm Design | Code review | Data Structures | Debugging | Machine LearningBenefits | Bonuses | Equity | Hybrid work scheduleSenior-level Full TimeNew York, NY, USA R21h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Code review | Data AnalysisBenefits | Bonuses | Equity | Hybrid work scheduleMid-level Full TimeMountain View, CA, USA R21h ago
-
Software Engineer USD 149K-211KAlgorithms | Android Development | C# | C++ | Data StructuresHybrid scheduleMid-level Full TimeMountain View, CA, USA R21h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Code review | Data AnalysisBonus | Equity | Hybrid work scheduleMid-level Full TimeMountain View, CA, USA R21h ago
-
Software Engineer USD 149K-211KC# | C++ | Cause analysis | Data Processing | Data StructuresHybrid scheduleMid-level Full TimeSunnyvale, CA, USA R21h ago
-
Senior Software Engineer USD 189K-252KAlgorithms | Audio Processing | C++ | Cause analysis | Data StructuresHybrid scheduleSenior-level Full TimeNew York, NY, USA R21h ago
-
Software Engineer USD 149K-211KAlgorithms | C# | C++ | Cause analysis | Data AnalysisHybrid scheduleMid-level Full TimeMountain View, CA, USA R21h ago
-
Research Engineer USD 147K-211KAlgorithm Design | C++ | Experimental Design | JAX | Machine LearningHybrid scheduleMid-level Full TimeMountain View, CA, USA R21h ago
-
Senior Research Engineer USD 174K-252KC plus plus | Code Reviews | Data Curation | Deep learning | JAXHybrid scheduleSenior-level Full TimeNew York, NY, USA R21h ago
-
Staff Software Engineer USD 264K-300KAdversarial Testing | Algorithm Design | C++ | Data Pipelines | Data StructuresBonus | Equity | Hybrid scheduleSenior-level Full TimeMountain View, CA, USA R21h ago
-
Staff Software Engineer USD 207K-300KAdversarial Testing | C++ | Data pipeline | Learning evaluation | Machine LearningEquity compensation | Health benefits | Hybrid scheduleSenior-level Full TimeNew York, NY, USA R21h ago
-
Senior Research Engineer USD 174K-252KC plus plus | Cause analysis | Code Reviews | Dataset curation | Deep Neural NetworksBenefits | Bonus | Equity | Hybrid work scheduleSenior-level Full TimeMountain View, CA, USA R21h ago
-
Software Engineer USD 147K-211KAlgorithms | C# | C++ | Cause analysis | Data AnalysisHybrid work scheduleMid-level Full TimeNew York, NY, USA R21h ago
-
Federal AI Solutions Engineer (Entry Level) USD 85K-105KAI Agents | AI RMF | AWS Bedrock | AWS CDK | Amazon Elastic Container Service401k employer match | Career growth and mentorship | Certification reimbursement | Dental insurance | Federal HolidaysEntry-level Full TimeHybrid - McLean, VA, United States R21h ago
-
Hugging Face | LLM orchestration | Langchain | Language Models | Large Language ModelsCareer growth potential | Early stage technical hire | Equity compensation | High ownership role | Hybrid workMid-level Full TimeSan Francisco, CA; Hybrid R1d ago
-
AI Solutions Architect USD 144K-200KAI RMF | Angular | Django | Drift Detection | FedRAMPCareer development | Employee resource groups | Flexible WFH | Generous PTO | Paid volunteer timeSenior-level Full TimeUS-Washington DC-Remote, United States R1d ago
-
Senior-level Full TimeUnited States - Remote R1d ago
-
Senior Staff Software Engineer, Data Platform USD 253K-298KAI Agents | Agent systems | Batch Processing | Change Data Capture | Compliance401k | Quarterly in person surges | Remote-firstSenior-level Full TimeRemote - USA R1d ago
-
AI/ML Security Engineer USD 102K-163KAPI Integration | AWS | Azure | Benchmarking | EvaluationCorporate holidays | Flexible time off | Group dental insurance | Group health insurance | Pet benefit optionMid-level Full TimeRemote R1d ago
-
Principal AI Engineer USD 160K-220KAI Governance | API Design | AWS | AWS Bedrock | Agent OrchestrationSenior-level Full TimeUS - Remote R1d ago
-
A/B | A/B Testing | Active Learning | Auto-labeling | B testingDental insurance | Dependent Care Account | Disability insurance | Flexible spending account | Flexible vacationMid-level Full TimeAnywhere, USA R1d ago
-
Batching | C# | C++ | CUDA | FP16Dental insurance | Disability insurance | Flexible spending account | Flexible vacation | Health insuranceMid-level Full TimeAnywhere, USA R1d ago