Senior Engineering Manager AI Inference Platform, Distributed Cloud
Tasks
- Define technical vision and strategy for LLM serving stack
- Design implement and optimize LLM serving architectures
- Lead, mentor, and grow engineering teams
- Optimize GPU accelerator performance and minimize latency
- Oversee performance analysis profiling and benchmarking
- Partner with research SRE product and core library teams
Perks/Benefits
- N/A
Skills/Tech-stack
Benchmarking | C++ | Continuous batching | Deep learning | Disaggregated serving | Distributed Computing | GPU | JAX | LLM serving | Latency optimization | Machine Learning | Performance Profiling | PyTorch | Python | Resource Optimization | Scalability | System Optimization | TensorFlow | XLA
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Roles
AI | Engineering | Engineering Manager | Engineering Manager, AI | Manager
Regions
Countries
States
Cities
Related jobs
-
Forward Deployed Engineer V, Generative AI, Google Cloud USD 262K-365KAPI Integration | Agent systems | Cloud platform | CrewAI | Data PipelinesSenior-level Full TimeSan Francisco, CA, USA; Atlanta, GA, …3h ago
-
Technical Program Manager, Robotics, DeepMind USD 217K-237KDashboarding | Data Analysis | Data Quality | Hardware Integration | Logistics planningMid-level Full TimeMountain View, CA, USA3h ago
-
Principal Product Manager, AI Transformation USD 281K-392KAI Agent | AI agent architectures | Agent architectures | Artificial Intelligence | AutomationSenior-level Full TimeSunnyvale, CA, USA3h ago
-
Automation | C# | C++ | CSS | Database DesignMid-level Full TimeAnn Arbor, MI, USA3h ago
-
Principal Consultant, AI/ML, Mandiant, Google Cloud USD 168K-244KAI Evaluation | Agent systems | Cloud APIs | Cybersecurity | Data leakageSenior-level Full TimeReston, VA, USA; United States3h ago
-
Agentic AI | AppScript | Artificial Intelligence | Business Intelligence | Concurrency systemsSenior-level Full TimeSan Bruno, CA, USA3h ago
-
Senior-level Full TimeSan Jose, California, United States10h ago
-
AWS | Access Control | Agile | CRM | Cloud ComputingSenior-level Full TimeFlorida, United States15h ago
-
Associate Consultant, Generative AI USD 80K-110KAWS | Azure | CI/CD | Chroma | Cloud platformHealth savings account | Paid parental leave | Paid time offMid-level Full TimeNew York, NY, United States16h ago
-
VP, AI Solutions USD 175K-250KAdtech | Analytics frameworks | Artificial Intelligence | Attribution Modeling | Audience intelligence401k matching | Dental insurance | Flexible spending account | Medical insurance | Relocation optionExecutive-level Full TimeSan Diego, CA, Los Angeles, CA, …16h ago
-
Senior Software Engineer (AI Data Engineering) USD 180K-210KAgentic Workflows | Architecture alignment | Backend Development | Code review | Data Modeling401k matching | Catered lunches | Dental insurance | Flexible schedule | Medical insuranceSenior-level Full TimeNew York, NY or Remote R17h ago
-
Manager, Data & Analytics USD 123K-243KAgile | Alteryx | Applied statistics | Artificial Intelligence | Automation AnywhereConference attendance | Knowledge sharing | Learning budget for certifications and training | Mentoring | Travel up to 25 percentMid-level Full TimeMcLean, VA20h ago
-
Principal Technical Product Manager - VALORANT, AI/ML USD 178K-249KCross-functional | Cross-functional leadership | Functional leadership | Generative AI | Integration401k company match | Dental insurance | Flexible work schedules | Life insurance | Medical insuranceSenior-level Full TimeLos Angeles, USA20h ago
-
Manager, Data Engineering- Data Visualization USD 108K-150KAgile | Alation | Atlan | Azure | Data Definition LanguageMid-level Full TimeOak Brook, IL, United States21h ago
-
Agentforce | Analytics | Anthropic | Application development | CRM dataAsync working culture | Documentation support in Confluence | Office hours | Remote work optionMid-level Full TimeEast Coast, United States; London, United … R21h ago
-
AI Intern USD 41K-50KArtificial Intelligence | Automation | Bot development | Cloud Computing | DashboardsInternship credits | Mentorship | Networking opportunities | On-site or remote workEntry-level Internship Part TimeUnited States, United States22h ago
-
Senior-level Full TimeNew York22h ago
-
Senior AI Engineer USD 130K-197KAPI Design | AWS Bedrock | Access Control | Async Programming | Audit Logging401k retirement plan | Employee referral bonuses | Entertainment discounts | Healthcare dental vision plans | On Call RunbooksSenior-level Full TimeFort Lauderdale, Florida, United States22h ago
-
Applied AI Engineer, Agentic Systems USD 112K-153K.NET | Anthropic API | CrewAI | Evaluation | Fine TuningSenior-level Full TimeRemote - United States R23h ago
-
API Integration | AWS | Autogen | Azure | FaissHybrid workSenior-level Contract Full TimeChicago, Illinois, United States23h ago
-
Data Engineering Manager | Home Client Services USD 130K-190KAWS | Access Control | Alerting | CI/CD | Cloud Computing401k match | Employee assistance program | Flexible paid time off | Flexible spending accounts | Health insuranceMid-level Full TimeFort Mill, SC R1d ago
-
AI Engineer USD 119K-258KAI Foundry | AI Studio | APIs | Agile | Azure AI401k match | Company subsidized dental coverage | Company subsidized vision coverage | Continuing education stipend | Employee assistance programSenior-level Full TimeRemote in U.S. or Hybrid in … R1d ago
-
Manager, Analytics & Strategy USD 100K-140KCausal Inference | Excel | Experimental Methods | Generative AI | Machine Learning401k plan | Dental insurance | Disability insurance | Health insurance | Life insuranceMid-level Full TimeChicago1d ago
-
Senior Product Manager - Data Platform USD 140K-165KAgile | Anomaly Detection | Batch Processing | BigQuery | Consent Management401k matching | Dental insurance | Disability insurance | Flexible time off | Life insuranceSenior-level Full TimeJersey City, NJ, United States1d ago
-
Senior Manager / Principal Data Science USD 155K-178KAI Agents | Dashboards | Data Pipelines | Data Visualization | LLMHybrid workSenior-level ContractDeerfield Beach, United States1d ago