Inference Optimization Manager
Tasks
- Build continuous optimization loop for customer workloads
- Drive full stack performance optimizations
- Lead optimization platform for LLM inference
- Manage and develop engineering team
- Own technical direction for inference performance
- Partner with GTM for workload specific tuning
- Publish technical content on inference optimization best practices
- Set engineering priorities and execution plan
- Translate customer insights into technical roadmap
Perks/Benefits
Skills/Tech-stack
Cloud infrastructure | Distributed Systems | GPU Kernel | GPU kernel programming | Inference engine | Kernel programming | Kubernetes | Language Models | Large Language Models | Machine Learning | Performance Engineering | Workload Analysis
Education
N/A
Related jobs
-
Access Control | Artificial Intelligence | B2B SaaS | Data Governance | Data SecurityRemote workMid-level Full Timeremote, United States R12h ago
-
Cloud infrastructure | Data Processing | Debugging | Distributed Computing | Fine TuningSenior-level Full TimeSeattle, WA, USA13h ago
-
Digital Execution Lead, Digital Delivery for Construction Execution, Google Data Centers USD 163K-237KApache Airflow | Automated Monitoring | BigQuery | CI/CD | DBTSenior-level Full TimeSunnyvale, CA, USA; Atlanta, GA, USA13h ago
-
C plus plus | C# | Computer Vision | Data Compression | Data ProcessingSenior-level Full TimeMountain View, CA, USA14h ago
-
Manager, Data and Technology - Data Engineer CAD 101K-169KArtificial Intelligence | Azure | Data Architecture | Data Engineering | Data GovernanceFirm-wide closures | Flexible benefit spending account | Flexible work arrangements | Hybrid work | Learning and development daysMid-level Full TimeToronto, ON, CA, M5H 0A918h ago
-
Access Control | Anomaly Detection | Artificial Intelligence | Audit evidence | CSPMFlexible benefits | Flexible work arrangements | Hybrid work | Learning days | MentoringSenior-level Full TimeToronto, ON, CA, M5H 0A919h ago
-
Data Science Manager CAD 84K-175KAWS | Azure | Computer Vision | Data ETL | Data MiningDeloitte Days | Flexible benefits spending account | Flexible work arrangements | Hybrid work arrangement | Learning & Development DaysMid-level Full TimeToronto, ON, CA, M5H 0A919h ago
-
Data Quality Management Lead CAD 85K-156KAnomaly Detection | Ataccama | Automated Issue Remediation | Azure | Cloud PlatformsDeloitte Days | Flexible benefits spending account | Flexible work arrangements | Hybrid work structure | Learning daysSenior-level Full TimeToronto, ON, CA, M5H 0A919h ago
-
Backlog Management | Change Management | Cloud Computing | Data Modeling | Data ScienceFlexible benefits | Hybrid work | Mentoring | Paid vacationSenior-level Full TimeToronto, ON, CA, M5H 0A919h ago
-
AI Governance | Benchmarking | Fine Tuning | Generative AI | Hugging FaceDeloitte Days | Flexible benefits account | Hybrid work | Learning & Development Days | Mental health support benefitsMid-level ContractToronto, ON, CA, M5H 0A919h ago
-
Executive Director Product Solutions Manager - Chief Data Analytics Office Fusion Platform USD 236K-285KAPI Design | API documentation | Agentic AI | Artificial Intelligence | Audit trailsBackup childcare | Equal opportunity employment | Financial coaching | Health care coverage | Mental health supportExecutive-level Full TimeNew York, NY, United States23h ago
-
Senior Analytics Manager - AI Model & Prompt Engineering USD 172K-258KA/B | A/B Testing | AI Evaluation | APIs | AWS401k | Career development | Employee assistance program | Flexible spending accounts | Health savings accountSenior-level Full TimeChicago, Illinois, United States1d ago
-
Financial Crimes Model Analytics Manager USD 92K-120KAnti-Money Laundering | Counter Terrorism Financing | Counter-terrorism | Data Analysis | Data Management401k plan | Life insurance | Paid Holidays | Paid sick leave | Paid vacationMid-level Full TimeCharlotte NC - 214 North Tryon …1d ago
-
Artificial Intelligence | Claims data | Claims data analysis | Competitive benchmarking | Data AnalysisMid-level Full TimeDurham Blackwell Street, United States1d ago
-
Analytics | Customer Segmentation | Data Infrastructure | Decisioning | Digital ProductMid-level Full TimeMcLean, VA, United States1d ago
-
Manager, Data Scientist - Credit Review USD 179K-225KAWS | Apache Spark | Classification | Clustering | CondaMid-level Full TimeMcLean, VA, United States1d ago
-
Java | Machine Learning | Open Source | Python | Relational databasesSenior-level Full TimeRichmond, VA, United States1d ago
-
Generative AI - Group Manager - Senior Vice President USD 176K-265KAI compliance | AI guardrails | AWQ | AWS | Autogen401k | Accident and disability insurance | Life insurance | Medical, dental & vision coverage | Paid HolidaysSenior-level Full Time480 WASHINGTON BOULEVARD JERSEY CITY, United …1d ago
-
Senior Manager of Data, AI Engineering USD 180K-200KAPIs | Cloud Platforms | Data Architecture | Data Pipelines | Databricks401k | Adoption leave | Employer Paid Long-term Disability | Employer Paid Short-term Disability | Health insuranceSenior-level Full TimeChicago, Illinois, United States1d ago
-
Staff Technical Product Manager, Embedding & Search USD 200K-215KAWS Bedrock | Embeddings | Language Models | Machine Learning | Model EvaluationCollaborative culture | Flexible PTO | Health, dental, and vision benefits | Hybrid work | Parental leaveSenior-level Full TimeSan Francisco1d ago
-
A/B | A/B Testing | Agile | Analytics | Applied statisticsSenior-level Full TimeNew York, NY, United States1d ago
-
Amplitude | Data Governance | Data Pipelines | Data Reconciliation | Data WarehousingCo-working space access | Health insurance coverage | Health savings account | Life insurance coverage | Long-term disabilitySenior-level Full TimeMontréal - Remote R1d ago
-
Lead, Finance Analytics & Enablement AI/ML USD 400K-500KAmplitude | Automation | Business Intelligence | Cloud platform | Data Engineering401k plan | Co-working space access | Disability insurance | Flex Spending Account | Health reimbursement accountSenior-level Full TimeNew York - Remote R1d ago
-
Director of AI Engineering, AI Labs USD 275K-325KAPI Design | Agentic Workflows | Artificial Intelligence | Backend Development | Data Infrastructure401k matching | Catered lunch on Wednesdays | Free medical plans | Travel opportunitiesExecutive-level Full TimeNew York, New York1d ago
-
Senior Financial Analytics Advisor-Remote USD 105K-143KAlteryx | Analytics reporting | Artificial Intelligence | Automl | BigQuery100 percent remote work | Comprehensive benefit plans | Continuing education | Dental insurance | FSASenior-level Full TimeRochester, MN, United States R1d ago