Inference Optimization Manager
Tasks
- Build continuous optimization loop for customer workloads
- Drive full stack performance optimizations
- Lead optimization platform for LLM inference
- Manage and develop engineering team
- Own technical direction for inference performance
- Partner with GTM for workload specific tuning
- Publish technical content on inference optimization best practices
- Set engineering priorities and execution plan
- Translate customer insights into technical roadmap
Perks/Benefits
Skills/Tech-stack
Cloud infrastructure | Distributed Systems | GPU Kernel | GPU kernel programming | Inference engine | Kernel programming | Kubernetes | Language Models | Large Language Models | Machine Learning | Performance Engineering | Workload Analysis
Education
N/A
Related jobs
-
Access Control | Artificial Intelligence | B2B SaaS | Data Governance | Data SecurityRemote workMid-level Full Timeremote, United States R10h ago
-
Cloud infrastructure | Data Processing | Debugging | Distributed Computing | Fine TuningSenior-level Full TimeSeattle, WA, USA12h ago
-
Digital Execution Lead, Digital Delivery for Construction Execution, Google Data Centers USD 163K-237KApache Airflow | Automated Monitoring | BigQuery | CI/CD | DBTSenior-level Full TimeSunnyvale, CA, USA; Atlanta, GA, USA12h ago
-
C plus plus | C# | Computer Vision | Data Compression | Data ProcessingSenior-level Full TimeMountain View, CA, USA12h ago
-
Manager, Data and Technology - Data Engineer CAD 101K-169KArtificial Intelligence | Azure | Data Architecture | Data Engineering | Data GovernanceFirm-wide closures | Flexible benefit spending account | Flexible work arrangements | Hybrid work | Learning and development daysMid-level Full TimeToronto, ON, CA, M5H 0A917h ago
-
Access Control | Anomaly Detection | Artificial Intelligence | Audit evidence | CSPMFlexible benefits | Flexible work arrangements | Hybrid work | Learning days | MentoringSenior-level Full TimeToronto, ON, CA, M5H 0A917h ago
-
Data Science Manager CAD 84K-175KAWS | Azure | Computer Vision | Data ETL | Data MiningDeloitte Days | Flexible benefits spending account | Flexible work arrangements | Hybrid work arrangement | Learning & Development DaysMid-level Full TimeToronto, ON, CA, M5H 0A917h ago
-
Data Quality Management Lead CAD 85K-156KAnomaly Detection | Ataccama | Automated Issue Remediation | Azure | Cloud PlatformsDeloitte Days | Flexible benefits spending account | Flexible work arrangements | Hybrid work structure | Learning daysSenior-level Full TimeToronto, ON, CA, M5H 0A917h ago
-
Backlog Management | Change Management | Cloud Computing | Data Modeling | Data ScienceFlexible benefits | Hybrid work | Mentoring | Paid vacationSenior-level Full TimeToronto, ON, CA, M5H 0A917h ago
-
AI Governance | Benchmarking | Fine Tuning | Generative AI | Hugging FaceDeloitte Days | Flexible benefits account | Hybrid work | Learning & Development Days | Mental health support benefitsMid-level ContractToronto, ON, CA, M5H 0A917h ago
-
Executive Director Product Solutions Manager - Chief Data Analytics Office Fusion Platform USD 236K-285KAPI Design | API documentation | Agentic AI | Artificial Intelligence | Audit trailsBackup childcare | Equal opportunity employment | Financial coaching | Health care coverage | Mental health supportExecutive-level Full TimeNew York, NY, United States21h ago
-
Senior Analytics Manager - AI Model & Prompt Engineering USD 172K-258KA/B | A/B Testing | AI Evaluation | APIs | AWS401k | Career development | Employee assistance program | Flexible spending accounts | Health savings accountSenior-level Full TimeChicago, Illinois, United States23h ago
-
Senior Manager of Data, AI Engineering USD 180K-200KAPIs | Cloud Platforms | Data Architecture | Data Pipelines | Databricks401k | Adoption leave | Employer Paid Long-term Disability | Employer Paid Short-term Disability | Health insuranceSenior-level Full TimeChicago, Illinois, United States1d ago
-
Staff Technical Product Manager, Embedding & Search USD 200K-215KAWS Bedrock | Embeddings | Language Models | Machine Learning | Model EvaluationCollaborative culture | Flexible PTO | Health, dental, and vision benefits | Hybrid work | Parental leaveSenior-level Full TimeSan Francisco1d ago
-
A/B | A/B Testing | Agile | Analytics | Applied statisticsSenior-level Full TimeNew York, NY, United States1d ago
-
Amplitude | Data Governance | Data Pipelines | Data Reconciliation | Data WarehousingCo-working space access | Health insurance coverage | Health savings account | Life insurance coverage | Long-term disabilitySenior-level Full TimeMontréal - Remote R1d ago
-
Lead, Finance Analytics & Enablement AI/ML USD 400K-500KAmplitude | Automation | Business Intelligence | Cloud platform | Data Engineering401k plan | Co-working space access | Disability insurance | Flex Spending Account | Health reimbursement accountSenior-level Full TimeNew York - Remote R1d ago
-
Director of AI Engineering, AI Labs USD 275K-325KAPI Design | Agentic Workflows | Artificial Intelligence | Backend Development | Data Infrastructure401k matching | Catered lunch on Wednesdays | Free medical plans | Travel opportunitiesExecutive-level Full TimeNew York, New York1d ago
-
Senior Financial Analytics Advisor-Remote USD 105K-143KAlteryx | Analytics reporting | Artificial Intelligence | Automl | BigQuery100 percent remote work | Comprehensive benefit plans | Continuing education | Dental insurance | FSASenior-level Full TimeRochester, MN, United States R1d ago
-
Artificial Intelligence | Data Analysis | Data Quality | Data Wrangling | Data quality assessmentRemote workSenior-level Full TimeThousand Oaks, CA R1d ago
-
Manager, Software Engineering - Storage Platform USD 258K-376KDatabase migrations | Database provisioning | Distributed Caching | Distributed Systems | Incident ResponseCell phone reimbursement | Company recharge days | Generous PTO | Learning and development stipend | Mental health and wellness benefitsMid-level Full TimeSan Francisco, CA • New York, … R1d ago
-
Director, Data Platforms Engineering USD 218K-273KAWS | Apache Airflow | Astronomer | Automated testing | Batch ProcessingComprehensive benefits | Eligible bonusExecutive-level Full TimeLouisville, KY, United States1d ago
-
Analytics Solutions Manager USD 149K-188KAI | Agile | Analytics | Clustering | ConfluenceBackup childcare | Comprehensive benefits | Financial coaching | Health care coverage | Mental health supportMid-level Full TimeWilmington, DE, United States1d ago
-
Technical Program Manager, Robotics, DeepMind USD 217K-237KDashboarding | Data Analysis | Data Quality | Hardware Integration | Logistics planningMid-level Full TimeMountain View, CA, USA1d ago
-
Principal Product Manager, AI Transformation USD 281K-392KAI Agent | AI agent architectures | Agent architectures | Artificial Intelligence | AutomationSenior-level Full TimeSunnyvale, CA, USA1d ago