Senior Principal Machine Learning Engineer, vLLM
Tasks
- Benchmark and evaluate quantization approaches
- Benchmark and evaluate sparsification approaches
- Benchmark and profile model parallelization
- Create inference serving deployment pipelines
- Design inference optimization algorithms
- Develop LLM deployment pipelines
- Develop LLM training pipelines
- Develop and test inference optimization algorithms
- Implement model compression algorithms
- Implement model quantization algorithms
- Implement model sparsification algorithms
- Mentor and guide engineers
- Perform code reviews
- Provide technical solutions in design discussions
Perks/Benefits
- 401k employer match
- Employee stock purchase plan
- Flexible spending account
- Health savings account
- Paid parental leave
- Paid time off
- Tuition reimbursement
Skills/Tech-stack
CPU architecture | Code review | Computer Vision | Deep learning | GPU Architecture | Graph theory | Inference Optimization | LLM Inference | LLM Inference Optimization | Language Models | Language Processing | Large Language Models | Linear Algebra | Machine Learning | Model Compression | Model Quantization | Model sparsification | Natural Language | Natural Language Processing | NumPy | Parallel Computing | Probability | PyTorch | Reinforcement Learning | Tensor math
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Regions
Countries
States
Cities
Related jobs
-
AI Engineer (Latam, Remote) USD 70K-110KAPI Integration | Authentication | Claude | Database Management | LLM integrationCollaboration with U S based teams | Fully remote | High ownership and autonomy | Part-time flexibilitySenior-level Part TimeFlorida, Aventura, United States of America R9h ago
-
Forward Deployed Machine Learning Engineer USD 180K-300KAPI Design | Cloud Computing | Deep learning | Diffusion Models | Fine TuningIn-person collaboration days | Remote work flexibility | Travel cost coverageSenior-level Full TimeSan Francisco (USA) R22h ago
-
Senior Software Engineer - Platform & MLOps USD 152K-230KAWS | Azure | CI/CD | Datadog | DockerDiscretionary incentive plan | Flexible work policy | Learning and development access | Medical benefitsSenior-level Full TimeSeattle, Washington, United States - Remote R23h ago
-
APIs | Agent architecture | Embeddings | Inference Serving | LLMDirect technical influence | Early stage equity upside | Fast Moving Engineering Culture | High technical autonomy | Remote workMid-level Full TimeSan Francisco, CA; Onsite R23h ago
-
Principal Data Engineer - Parametric USD 115K-225KAgile | Amazon Web Services | Athena | Containerization | Data GovernanceSenior-level Full TimeSeattle WA 800, United States R23h ago
-
Embedded Engineer USD 109K-140KARM Cortex | ARM Cortex-M | Automated testing | Code review | Cortex-M401k match | Career development plan | Disability benefits | Employee assistance program | HSAMid-level Full TimeKamas , UT, USA R23h ago
-
Senior Machine Learning Engineer USD 174K-287KComputer Vision | Deep learning | Gradient optimization | Graph theory | Inference OptimizationPaid parental leave | Paid time offSenior-level Full TimeBoston, United States R23h ago
-
Senior Data Engineer IS - Remote USD 122K-208KAI/ML | APIs | AWS Glue | Apache Airflow | Azure Data401k matching | Dental insurance | Disability insurance | Health insurance | Life insuranceSenior-level Full TimePortland, OR, United States R23h ago
-
Principal AI Platform Engineer USD 167K-220KAgent Orchestration | Backend Development | Braintrust | Cost Optimization | Data PipelinesEquity | Flexible Token Limits | Health, dental, vision coverage | Unlimited paid time offSenior-level Full TimeSan Francisco, California R1d ago
-
AI / Computer Vision (IC) USD 200K-357KComputer Vision | Edge inference | GPS/IMU | Language Models | LidarComprehensive health plans | Parental leave plans | Professional development stipend | Remote work optionSenior-level Full TimeRemote - US R1d ago
-
AI System Design | Acceptance Testing | Agile | Cause analysis | Conversational AI401k matching | Health insurance | Paid Holidays | Paid time off | Remote workMid-level Full TimeRedmond, WA, United States R1d ago
-
Cloud Data | Cloud Data Platforms | Data Governance | Data Modeling | Data QualityDental insurance | Employer-matched 401k | Health insurance | Life insurance | Paid HolidaysSenior-level Full TimeRedmond, WA, United States R1d ago
-
AI Developer - Model Creation & Full Stack USD 150K-175KAWS | Angular | Azure | CI/CD | D3.jsRemote work | USPS Public Trust Clearance eligibleMid-level Full TimeWork from home, VA, United States R1d ago
-
AI Developer USD 149K-190KChain-of-Thought | Continuous Deployment | Continuous integration | Deep learning | FHIRFlexible work schedule | Remote work opportunitySenior-level Full TimeUnited States R1d ago
-
API Integration | AWS | AWS Glue | Batch Processing | Code reviewSenior-level Full TimeIndianapolis, IN, United States R1d ago
-
Staff Developer, AI Experience USD 161K-287KAgentic coding | Artificial Intelligence | CI/CD | Context engineering | Continuous DeliveryRecognition programs | Remote work | Time off | Volunteer days | Wellness programsSenior-level Full TimeUnited States R1d ago
-
Staff Developer, AI Experience USD 161K-287KAgentic coding | Architecture | Artificial Intelligence | CI/CD | LLM EvaluationCharity support | Professional growth | Recognition programs | Time off programs | Volunteer daysSenior-level Full TimeUnited States R1d ago
-
Machine Learning Engineer II GBP 124K-186KAWS | Anomaly Detection | Athena | Bedrock | C++Formal learning opportunities | Hybrid work | On-the-job learningMid-level Full TimeUSA – MN – Minneapolis, United … R1d ago
-
Edge AI Engineer USD 130K-200KBenchmarking | C++ | Core ML | Edge Computing | Embedded SystemsCareer growth | Health benefits | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 150K-222KAccelerator hardware | Agentic Systems | Data Quality | Data quality monitoring | Deep learningCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Distinguished Engineer, Applied AI USD 150K-300KAWS | Agentic AI | Algorithms | Artificial Intelligence | Auto-failover401k match | Adoption Assistance | Career mentorship | Certification assistance | Employee trainingSenior-level Full TimeCA Palo Alto Office, United States R1d ago
-
AI Data Infrastructure Engineer USD 146K-189KApache Beam | CI/CD | Code review | Data Lineage | Data ModelingBenefits package | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 146K-189KActive Learning | Apache Beam | CI/CD | Caching | Code reviewMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 150K-270KAdapter-Tuning | DPO | Dataset curation | Distributed Training | Evaluation methodologyCareer growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Performance Optimization Engineer USD 136K-258KC++ | Continuous batching | Deep learning | Distributed Systems | FSDPMid-level Full TimeUnited States - Remote R1d ago