Senior Machine Learning Engineer, Distributed vLLM (llm-d)
Tasks
- Build distributed inference infrastructure using Kubernetes
- Conduct code reviews
- Contribute to technical discussions
- Design and develop inference solutions
- Develop inference algorithms
- Develop system components in Go and/or Rust
- Engage in industry and open source events
- Enhance resource utilization and fault tolerance
- Mentor engineering team
- Optimize memory utilization and request distribution
- Participate in upstream communities
Perks/Benefits
- Dental
- Disability
- Family medical leave
- Flexible spending
- Health savings
- Medical
- Military Leave
- Paid time off
- Parental leave
- Reimbursement programs
- Retirement 401k
- Stock purchase
- Vision
Skills/Tech-stack
API Gateways | CNI | Cilium | Cloud Native | Computer Architecture | Distributed Systems | Envoy | GPU Profiling | GPU Programming | GRPC | Golang | HTTP/2 | High Performance | High-performance networking | Infiniband | Istio | Kubernetes | Microservices | OpenTelemetry | Parallel processing | Python | RDMA | Reverse Proxies | RoCE | Rust | SGLang | TensorRT-LLM | UCX | VLLM | WASM
Education
N/A
Regions
Countries
States
Cities
Related jobs
-
Automation Testing | CI/CD | CSS | Cypress | Feature DevelopmentMedical, dental & vision coverage | Paid time off | Parental leave | Reimbursement programs | Retirement planMid-levelRaleigh, United States R11d ago
-
Manager, Data Engineering USD 130K-166KAWS | Access Controls | Apache Airflow | Audit Logging | AzureCollaborative team culture | Remote work | Work-life balanceSenior-level Full TimeRemote, United States R13h ago
-
Staff AI Engineer USD 210K-235KAgent systems | Agentic AI | Anthropic API | Anthropic Claude | Automated Evaluation401k | Career growth | Disability and life insurance | Equipment provided | Flexible vacation policySenior-level Full TimeRemote (United States) R1d ago
-
Data Solutions Engineer II USD 72K-115KApache Spark | Azure Data | Azure Data Factory | Azure Data Lake | Azure SynapseMid-level Full TimeOhio WFH, United States R1d ago
-
Cloud Data Engineer USD 125K-205KApache Airflow | Apache Spark | Azure | CI/CD | ContainersFlexible work hours | Fully remote option | Occasional Evening CoverageMid-level Full TimeComerica Park, United States R1d ago
-
Sr AI Platform Specialist Solution Architect USD 206K-330KAnsible | CI/CD | Computer Vision | Data pipeline | Deep learningSenior-level Full TimeRemote US CA, United States R1d ago
-
Senior Software Engineer, Data Platform USD 164K-227KAccess Control | Airflow | Amazon Kinesis | Amazon Redshift | Apache Flink401k match | Community volunteer time | Commuter benefit | Company-paid days off | Dental insuranceSenior-level Full TimeSan Francisco, CA, USA R1d ago
-
AI Research Engineer, Computer Vision USD 170K-210KAutoregressive models | CUDA | DDP | Data Pipelines | DeepSpeed401k retirement plan | Company equity | Dental insurance | Fertility support | Human Annotation SupportMid-level Full TimeRemote (U.S. or Canada) R1d ago
-
Data Engineer USD 100KAPIs | Apache Kafka | Apache Spark | Azure | Azure Data401k match | Dental insurance | Life insurance | Medical insurance | Paid sick leaveEntry-level Full TimeRemote, US R1d ago
-
Senior Data Engineer USD 165K-175KAWS Glue | AWS Step Functions | Amazon Athena | Amazon EMR | Amazon KinesisEmployee discounts | Employee equity | Medical, dental & vision coverage | Unlimited PTOSenior-level Full TimeRemote - United States R1d ago
-
Data Automation Engineer USD 110K-125KAPI Integration | AWS | Airflow | Azure | C SharpDental insurance | Employee discounts | Employee equity | Health insurance | Pet insuranceMid-level Full TimeRemote - United States R1d ago
-
Principal AI/ML Engineer - AdTech USD 300K-400KAWS | Ad Exchanges | Apache Kafka | Apache Spark | CassandraEmployee discounts | Employee equity | Medical, dental & vision coverage | Pet insurance | Unlimited PTOSenior-level Full TimeRemote - United States R1d ago
-
Lead AI Engineer USD 200K-215KA/B | A/B Testing | AWS Bedrock | Agentic LLM | Agentic LLM systemsDental insurance | Employee discounts | Employee equity | Health insurance | Pet insuranceSenior-level Full TimeRemote - United States R1d ago
-
Senior Machine Learning Engineer USD 198K-287KArtificial Intelligence | Data Engineering | Fine Tuning | Foundation Models | GenAISenior-level Full TimeRemote - US R1d ago
-
Data Engineer USD 135K-200KAPI Integration | AWS Firehose | AWS Kinesis | AWS Lambda | Amazon ECS401k | Dental insurance | Disability insurance | EAP | Employee assistance programSenior-level Full TimeNew York, NY (remote) R1d ago
-
GenAI Principal Engineer - Remote USD 145K-215KAI orchestration | API first | API-first design | AWS | Android401k matching | DEI focus | Development opportunities | Flexible schedule | Flexible time offSenior-level Full TimeUnited States, UNITED STATES, United States R1d ago
-
HPC - AI/ML Platform Engineer USD 113K-190KAnsible | Bash | CI/CD | GPU scheduling | GrafanaDental insurance | Employee resource groups | Flexible family care | Health insurance | Paid HolidaysMid-level Full TimeUnited States R1d ago
-
Data Engineer - Governance and QA USD 120K-150KCI/CD | DBT | Data Architecture | Data Contracts | Data Modeling401k with company match | Dental insurance | Life insurance | Long-term disability | Medical insuranceMid-level Full TimeDallas, TX - Hybrid (3x in … R1d ago
-
Senior Sales Engineer - Key Accounts Northcentral USD 149K-198K.NET | CRM | Go | Java | Node.jsCommunity guilds | Employee stock purchase plan | Hybrid work | Inclusion talks | Mentor/Buddy programSenior-level Full TimeChicago, Illinois, USA; Michigan, USA, Remote; … R1d ago
-
Senior Data Engineer USD 143K-229KAnalysis Services | Azure Data | Azure Data Factory | Azure DevOps | Azure Gen2Mentorship | Remote work opportunity | Travel as requiredSenior-level Full TimeDenver, CO, United States R1d ago
-
Senior Data Engineer USD 143K-229KAnalysis Services Tabular | Azure Analysis | Azure Analysis Services | Azure Analysis Services Tabular | Azure DataMentorship | Remote work optionSenior-level Full TimeKansas City, MO, United States R1d ago
-
Databricks Pipeline Architect USD 150K-180KAWS Glue | AWS Lambda | AWS S3 | Agile | Amazon Web ServicesPublic trust clearance support | Remote workSenior-level Full TimeWork from home, VA, United States R1d ago
-
Tier 3 Network Systems Engineer (Remote) USD 80K-101KActive Directory | Ansible | Ansible Playbook | Apache HTTP | Apache HTTP ServerAfter hours availability | Customer support support | On-call rotation | Remote workMid-level Full TimeDallas, TX, US R1d ago
-
ML Engineer, II - Road & Lane USD 139K-183KBEV | CUDA | CUDA kernels | Camera Calibration | Computer VisionMid-level Full TimeRemote - US, Ann Arbor, MI, … R1d ago
-
ML Engineer, II - Learned Behaviors USD 153K-222KBehavior Cloning | Data Pipelines | Distributed Training | Graph Neural Networks | Imitation LearningMid-level Full TimeRemote - US, Ann Arbor, MI, … R1d ago