Principal Model Optimization Engineer
Tasks
- Debug GPU performance issues
- Develop model optimization best practices and tooling
- Develop tooling for model optimization interfaces and visualizations
- Integrate and deploy optimized models to production
- Optimize machine learning models for GPU training and inference
- Profile machine learning pipelines to identify bottlenecks
Perks/Benefits
- N/A
Skills/Tech-stack
CUDA | Continuous batching | GPU | LLM Inference | Machine Learning | Model Deployment | Model Optimization | Performance Profiling | Quantization | Speculative decoding | TensorRT | Triton
Education
Regions
Countries
States
Cities
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R3d ago
-
Data Manipulation | Distributed Systems | Embeddings | Java | KubernetesCollaborative flat culture | Direct access to technical leadership | Exposure to cutting edge generative AI | Flexible schedule | High autonomyEntry-level Full TimeCanada R23h ago
-
Azure Data | Azure Data Lakehouse | Azure SQL | DBT | Data GovernanceCareer growth | Continuous learning | Dental insurance | Flexible work arrangements | Global collaborationSenior-level Full TimeCanada R1d ago
-
Senior-level Full TimeUnited States - Remote R1d ago
-
Edge AI Engineer USD 100K-150KC++ | Core ML | Cross Platform Inference | Cross-platform | DSPCareer growth potential | Full-time remote work | H1B transfer supportSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Data Quality | Data labeling | Data quality monitoring100 percent remote | Career growth | Full-time employment | W2 employmentMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Computer Vision | Data QualityMid-level Full TimeUnited States - Remote R1d ago
-
LLM Engineer USD 100K-150KDPO | Deep learning | Distributed Training | Efficient Attention | Efficient Fine TuningRemote workMid-level Full TimeUnited States - Remote R1d ago
-
Principal Applied AI Engineer, Finance USD 193K-340KAPI Development | AWS | Bias Mitigation | CI/CD | Churn modeling401k matching | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Full TimeVirtual Office (Massachusetts), United States R1d ago
-
Data Engineer USD 72K-130KAI/ML | Analytics engineering | Azure DevOps | Bronze Silver Gold | CI/CD401k contribution | Career development opportunities | Comprehensive benefits package | Equity stock purchase | Incentive and recognition programsMid-level Full TimePrimary location: Eden Prairie, MN R1d ago
-
NLP Engineer USD 72K-130KArtifact Repositories | Artifactory | C# | CI/CD | Containerization401k contribution | Career development opportunities | Comprehensive benefits package | Equity stock purchase | Incentive and recognition programsMid-level Full TimePrimary location: San Diego, CA R1d ago
-
ML Platform Engineer USD 100K-150KAPI Gateway | Abuse detection | Automated rollback | Autoscaling | C++100% remote | Full-time W2 employment | H1B transfer supportSenior-level Full TimeUnited States - Remote R1d ago
-
ML Platform Engineer USD 100K-150KAPI Gateway | Abuse detection | Automated rollback | Autoscaling | C++Senior-level Full TimeWest Windsor / Princeton Jct., NJ R1d ago
-
Applied AI Engineer USD 120K-158KA/B | A/B Testing | API Integration | Anthropic API | B testingCareer growth | Fully remote | Global Engineering Organization | High ownership culture | Learning and development budgetMid-level Full TimeUnited States R2d ago
-
AWS | Application Security | Artificial Intelligence | Azure | Cloud SecurityConference speaking opportunities | Flexible schedule | Health Premium Plan Option | Mentorship | Paid trainingSenior-level Full TimeLos Angeles, California, United States R2d ago
-
Machine Learning Engineer V USD 231K-382KAWS | Agent Orchestration | Automated testing | Azure | CI/CDBonus eligibility | Disability insurance | Life insurance | Paid parental leave | Paid time offSenior-level Full TimeRemote, United States R2d ago
-
Senior AI Engineer USD 145K-181KAWS | Alerting | Azure | Docker | Embeddings401k match | Commuter benefits | Dental | Healthcare | Remote friendly workplaceSenior-level Full Time3750 Market Street, Philadelphia, PA, United … R3d ago
-
Senior Machine Learning Engineer USD 180K-250KComputer Vision | Data Pipelines | Data labeling | Deep learning | Embedding Models100 percent remote | 13 paid holidays | 401k plan | Dental insurance | Medical insuranceSenior-level Full TimeRemote USA R3d ago
-
Senior AI Engineer USD 250K-300KAPI Development | Artificial Intelligence | Cost Optimization | GitHub | Inference Optimization401k match | Co working sessions | Flexible PTO | Health and wellness allowance | Health insuranceSenior-level Full TimeSan Francisco (Hybrid) R3d ago
-
AWS | AWS CDK | Access Control | Airflow | Athena401k plan | Health insurance | Paid Holidays | Paid time off | Phone stipendSenior-level Full TimeSan Carlos - Hybrid R3d ago
-
Sr AI Engineer - Agentic Systems USD 166K-205KAI Safety | API Integration | Agent Orchestration | Artificial Intelligence | Distributed SystemsSenior-level Full TimeAnywhere, US R3d ago
-
Airflow | Auction design | BigQuery | Budget Optimization | Experimentation401k employer match | Coaching support | Family planning support | Flexible vacation | Gender-affirming careSenior-level Full TimeRemote - United States R3d ago
-
Machine Learning Engineer USD 140K-180KAWS | Alerting | Apache Spark | Azure | CI/CD401k | Dental insurance | Life insurance | Light travel | Medical insuranceSenior-level Full TimeSt. Louis, MO; Boston, MA; New … R3d ago
-
Principal Machine Learning Engineer USD 285K-457KArtificial Intelligence | Classification | Deep learning | Embeddings | ExperimentationIn-person onboarding | Remote work optionsSenior-level Full TimeRemote - USA R3d ago
-
AI Engineer USD 100K-197KARIMA | Amazon SageMaker | Bias Mitigation | Computer Vision | Deep learningMid-level Full TimeUSA - Remote R3d ago