Engineering Manager, Model Inference
Tasks
- Architect and scale inference infrastructure
- Benchmark and eliminate inference bottlenecks
- Collaborate with ML research on model optimization quantization deployment
- Define inference system technical direction
- Develop AI inference APIs
- Ensure reliability efficiency observability
- Establish engineering standards and operational processes
- Lead and grow AI inference engineering team
- Lead incident response
- Optimize batching throughput latency GPU utilization
- Plan and execute cross-functional projects
- Recruit, mentor, and develop engineering talent
Perks/Benefits
- 401k matching
- Commuter benefits
- Flexible PTO
- Flexible spending accounts
- Generous time off
- HSA contribution
- Lifestyle Wallet
- Mental health support
- Paid parental leave
- Personal device allowance
- Sabbatical leave
- Therapy and coaching
Skills/Tech-stack
APIs | Attention Mechanism | Batching | Distributed Systems | Docker | Expert parallelism | FlashAttention | GPU Performance | GPU performance analysis | Grouped Query Attention | Incident Response | Kernel Fusion | Kubernetes | Multi-head attention | Observability | Performance Analysis | Pipeline parallelism | PyTorch | Quantization | Real Time | Real-time Systems | Tensor Parallelism | TensorFlow | TensorRT | Time Systems | Transformer | VLLM
Education
Regions
Countries
States
Related jobs
-
Manager I, Engineering - Data Visualization Explorations USD 187K-240KCode review | Data Visualization | Frontend Development | Incident Response | JavaScriptBuddy program | Career pathing | Community guilds | Employee stock purchase plan | Hybrid workplaceMid-level Full TimeNew York, New York, USA13h ago
-
Manager, Engineering (Data Platform) USD 239K-275KAWS | AWS Glue | Amazon EMR | Amazon Web Services | Apache Airflow401k | Flexible PTO | Medical/Dental/Vision | Teladoc HealthMid-level Full TimeNew York City, New York13h ago
-
Product Manager - AI Inference & Model Serving USD 160K-275KAI Inference | Autoscaling | Cache Management | Cold Start | Cold Start OptimizationConference attendance | Professional development and training | Stock options | Workstation providedMid-level Full TimeAustin, TX, United States13h ago
-
Technical Product Manager, AI Storage USD 168K-240KBacklog Management | BeeGFS | Block Storage | CSI | DAOSConference attendance | Customized workstation | Professional development and trainingMid-level Full TimeAustin, TX, United States13h ago
-
Senior Solution Owner | AI & Data Solutions USD 160K-195KAPIs | Agentic AI | Agile | Agile Ceremonies | Amazon Web ServicesSenior-level Full TimeNew York19h ago
-
AI | AI Agents | Agent systems | Cloud Computing | Context engineeringSenior-level Full TimeSan Francisco, CA, USA; New York, …22h ago
-
ML Infrastructure Engineer USD 151K-230KAirflow | Amazon SageMaker | Apache Spark | Argo Workflows | C++Entry-level Full TimeOakland, CA1d ago
-
Mid-level Full TimeSTORE SUPPORT CENTER, ATLANTA - 9090, …1d ago
-
Manager of Data Science USD 104K-170KAPIs | Agentic Workflows | Artificial Intelligence | Automation | CloudSenior-level Full TimeHudson, WI, United States1d ago
-
Senior Manager, Software Engineering - Remote USD 125K-200KAPI | API Gateway | Agentic Workflows | Amazon Web Services | CI/CDComprehensive benefits package | Remote work | Variable pay opportunitySenior-level Full TimeUnited States, UNITED STATES, United States R1d ago
-
Engineering Manager, Data Engineering (Remote, US) USD 176K-264KAgile | Airflow | Aiven Debezium | BigQuery | CI/CD401k company contribution | Life and disability coverage | Medical, dental, and vision plans | Parental leave | Remote-first cultureMid-level Full TimeRemote, United States R1d ago
-
AIPS | API Standards | Apigee | Authentication | Best practicesSenior-level Full TimeSeattle, WA, USA; Goleta, CA, USA1d ago
-
Embedded Event Security Manager USD 130K-150KAccess Control | Contingency Planning | Credentialing | Crowd Management | Event planning401k match | Defensive driving training | Dental insurance | Employee assistance program | Executive Protection TrainingMid-level Full TimeUnited States1d ago
-
Delivery Solutions Architect - Public Sector USD 180K-247KBusiness Analysis | Discovery Workshops | Distributed Systems | Executive Communication | Program ManagementComprehensive benefits | Equity | Hybrid work | Performance bonus | Travel up to 30 percentSenior-level Full TimeRemote - Washington R2d ago
-
Senior Machine Learning Ops Engineer USD 150K-173KAWS | Airflow | Bash | Batch inference | CI/CDEmployee mentorship program | Leadership programsSenior-level Full TimeUnited States R2d ago
-
MLOps Engineer USD 113K-188KAWS GovCloud | Apache Spark | Artifact management | Auditability | Azure401k | Adoption Assistance | Disability insurance | Emergency back-up childcare program | Employee referral programMid-level Full TimeGH Office: Tysons Corner, VA (Headquarters), …2d ago
-
ML Engineering Director, AI for Drug Discovery USD 192K-373KAWS | Artificial Intelligence | Batch Processing | CI/CD | Cloud ComputingRelocation benefitsExecutive-level Full TimeNew York, United States2d ago
-
AI Services | AI orchestration | API Integration | Anthropic Claude | Azure AI401k | Dental insurance | Disability insurance | Health insurance | Life insuranceSenior-level Full TimeUS - Remote, United States R2d ago
-
Data/Linux Engineer III USD 104K-157KAnsible | Apache Superset | Automation | Bash | BitbucketPaid time offSenior-level Full TimeJacksonville, United States2d ago
-
Manager of Software Engineering: Data Analytics USD 170K-201KAWS | AWS Glue | AWS Lambda | AWS S3 | AWS Step FunctionsMid-level Full TimeNew York, NY, United States2d ago
-
Senior Data Platform Manager USD 144K-168KAWS | Agile | Amazon Redshift | Apache Airflow | Apache SparkHybrid work flexibility | Occasional travel for meetings and conferences | Training and conference opportunitiesSenior-level Full TimeUS-VA-Arlington2d ago
-
Customer Data Platform Manager USD 120K-147KAPIs | App analytics | Blueconic | Customer Segmentation | Data GovernanceMid-level Full TimeExton, Pennsylvania, United States2d ago
-
Director, AI & Data Platform Engineer USD 134K-179KAI Agents | Analytics | Azure | Change Management | Cloud ArchitectureContinuous learning and development | Inclusive collaborative work environment | Supportive growth cultureExecutive-level Full TimeUnited States3d ago
-
AI Developer/Project Manager USD 98K-140KAutomation | Data Analysis | Data Pipelines | Data Visualization | Machine LearningMid-level Full TimeGrand Rapids, MI, United States3d ago
-
VP, Data and Analytics (Remote US) USD 240K-260KAI integration | BI tools | Data Architecture | Data Engineering | Data GovernanceExecutive-level Full TimeUnited States R3d ago