Engineering Manager, Model Inference
Tasks
- Architect and scale inference infrastructure
- Benchmark and eliminate inference bottlenecks
- Collaborate with ML research on model optimization quantization deployment
- Define inference system technical direction
- Develop AI inference APIs
- Ensure reliability efficiency observability
- Establish engineering standards and operational processes
- Lead and grow AI inference engineering team
- Lead incident response
- Optimize batching throughput latency GPU utilization
- Plan and execute cross-functional projects
- Recruit, mentor, and develop engineering talent
Perks/Benefits
- 401k matching
- Commuter benefits
- Flexible PTO
- Flexible spending accounts
- Generous time off
- HSA contribution
- Lifestyle Wallet
- Mental health support
- Paid parental leave
- Personal device allowance
- Sabbatical leave
- Therapy and coaching
Skills/Tech-stack
APIs | Attention Mechanism | Batching | Distributed Systems | Docker | Expert parallelism | FlashAttention | GPU Performance | GPU performance analysis | Grouped Query Attention | Incident Response | Kernel Fusion | Kubernetes | Multi-head attention | Observability | Performance Analysis | Pipeline parallelism | PyTorch | Quantization | Real Time | Real-time Systems | Tensor Parallelism | TensorFlow | TensorRT | Time Systems | Transformer | VLLM
Education
Regions
Countries
States
Related jobs
-
Quantitative Systems Manager USD 223K-225KAt risk | Data Feeds | Data Warehousing | Machine Learning | Margin AnalysisAD and D insurance | Dental insurance | Discretionary bonus eligibility | FSA | Flexible vacation policyMid-level Full TimeNew York22h ago
-
CTO, Data Platforms USD 175K-245KAI Ready | AI-Ready Data | Apache Airflow | Apache Hudi | Apache IcebergClient-facing leadership | Hybrid workSenior-level Full TimeUSA (Remote) R23h ago
-
Director, Analytics Engineering USD 240K-270KAlerting | Automated testing | CI/CD | DBT | Data GovernanceChildcare subsidy | Dell discount | Education discount | Employee assistance program | GympassExecutive-level Full TimeUnited States - Remote R23h ago
-
Principal Product Manager, Enterprise AI and Analytics USD 198K-297KAI Machine Learning Pipelines | AI machine learning | Cloud Integration | Competitive strategy | Data GovernanceCompany-sponsored team events | Flexible time off | Wellness resourcesSenior-level Full TimeSanta Clara, California1d ago
-
Forward Deployed Engineer - AI Solutions Engineering USD 140K-170KAI Evaluation | APIs | GraphQL | HubSpot | HubSpot workflows401k plan with company matching | Childcare reimbursement | Commuter reimbursement | Generous parental leave policy | Medical, dental, and vision insuranceMid-level Full TimeSan Francisco Office1d ago
-
Manager, Software Development- SAS Developer Experience USD 140K-198KAgile | Ansible | Argo CD | ArgoCD | CI/CD401k plan | Childcare benefits | Comprehensive medical/dental/vision plans | Onsite Health Care Center | Paid HolidaysMid-level Full TimeCary HQ, NC, United States1d ago
-
Account Management | Agile | Atlassian Tool Suite | Bitbucket | Budget Management10 percent travel | 9/80 schedule | Onsite work | Relocation assistanceMid-level Full TimeMDLI02, United States1d ago
-
Director, Data & Analytics Engineering USD 125K-135KAPI Development | Calculus | Decision Tree | Deep learning | Distributed TrainingTelecommuting up to 2 days per weekExecutive-level Full TimeAlpharetta GA 6655, United States R1d ago
-
Senior Manager, Global Data and Analytics USD 85K-150KAPIs | Alteryx | BigQuery | Data Pipelines | Data Quality401k | Company holidays | Fertility coverage | Fitness reimbursements | Flexible paid time offSenior-level Full TimeUSA-NY 1755 Broadway, United States1d ago
-
Senior-level Full TimeNew York, NY, 10010, USA2d ago
-
VP Marketing Data Science USD 129K-194KA/B | A/B Testing | B testing | Data Visualization | Dataiku401k | Accident insurance | Disability insurance | Life insurance | Medical, dental, and vision coverageExecutive-level Full TimeLocation(s): New York, New York, United …2d ago
-
Technical Program Manager, MTIA Software USD 167K-230KAI Algorithms | AI Inference | AI Training | AI accelerator | BenchmarkingMid-level Full TimeMenlo Park, CA3d ago
-
Mgr, Data Engineering USD 130K-160KData Pipeline Monitoring | Data Platform Operations | Data pipeline | Data platform | Incident Response401k company match | Family-forming benefits | Financial Planning Sessions | Flexible time off | Free snacks and refreshmentsMid-level Full TimeAustin, Texas, United States3d ago
-
AWS | Alerting | Black box monitoring | Black-box | CI/CDBackup childcare | Financial coaching | Mental health support | Mentoring | Onsite health and wellness centersSenior-level Full TimeJersey City, NJ, United States3d ago
-
Senior Product Manager, Data Platform (Remote) USD 162K-215KAI | Access Control | Business Intelligence | Cost Optimization | Data CatalogBusiness events | Remote work | Team gatherings | Together Weeks | Travel up to 5 days per quarterSenior-level Full TimeBoston, MA R3d ago
-
Manager I, Engineering - Change Experience Platform USD 187K-240KAPI Design | CLI tooling | Cassandra | Data Modeling | Developer experienceCareer pathing | Community guilds | Continuous professional development | Inclusion talks | Inclusive company cultureMid-level Full TimeNew York, New York, USA4d ago
-
Principal Product Manager, Inference Engine USD 218K-273KAnalytics | Autoscaling | Batching | Capacity Planning | ComplianceConference reimbursement | Employee assistance program | Employee stock purchase program | Equity compensation | Flexible time offSenior-level Full TimeSeattle4d ago
-
Senior Technical Program Manager, Productivity Engineering (Data Platform · AI Transformation) USD 164K-222KAPI Design | AWS | Airflow | DBT | Data FlowsAnnual bonus | Equity grants | Health benefits | Retirement benefitsSenior-level Full TimePleasanton, California, USA HQ4d ago
-
Senior Product Manager - Agent Integrations USD 192K-240KAI Agents | Backlog Management | BentoML | Customer discovery | Data AnalysisContinuous professional development | Inclusive community culture | Mental health benefits | Mentor/Buddy program | Stock equitySenior-level Full TimeNew York, New York, USA4d ago
-
APIs | Access Management | C++ | Cloud infrastructure | Code reviewSenior-level Full TimeNew York, NY, USA; Chicago, IL, …4d ago
-
AI Transformation Group Manager USD 176K-265KAkka | Apache Spark | Artificial Intelligence | Batch Processing | CI/CDSenior-level Full TimeLocation(s): Jersey City, New Jersey, United …4d ago
-
Manager, Machine Learning Engineering USD 187K-348KComputer Vision | Data Engineering | Deep learning | Distributed Systems | Generative AIDisability insurance | Employee wellness program | Health insurance | Life insurance | Paid HolidaysMid-level Full TimeWA Bellevue 205 108th Avenue NE, …4d ago
-
Mid-Level or Senior Engineering Data Scientist USD 137K-234KAPI Development | Anomaly Detection | C# | Data Analysis | Data PipelinesDisability insurance | Flexible spending account | Health insurance | Health savings account | Life insuranceMid-level Full TimeEverett, Washington; Seattle, Washington4d ago
-
API Integration | Accounting | Agile | Anomaly Detection | Artificial IntelligenceDental insurance | Disability insurance | Flexible spending account | Health insurance | Health savings accountMid-level Full TimeGlobal Headquarters, United States4d ago
-
Delivery Solutions Architect USD 180K-247KBusiness Analysis | Data Architecture | Discovery Workshops | Distributed Systems | Executive CommunicationHybrid work | Travel 30 percentSenior-level Full TimeCentral - United States4d ago