Staff Technical Lead for Inference & ML Performance
Tasks
- Apply compiler strategies for inference
- Collaborate with research and applied machine learning teams
- Contribute to critical inference performance optimizations
- Develop and optimize kernels
- Guide team to build high performance inference solutions
- Identify and eliminate inference performance bottlenecks
- Implement model parallelism
- Implement performance optimizations
- Improve model serving performance
- Influence inference strategies and deployment techniques
- Mentor and scale performance focused engineers
- Set technical direction for inference performance
Perks/Benefits
- N/A
Skills/Tech-stack
CUDA | Compilation | Cutlass | Distributed Serving | Kernel optimization | Machine Learning | Model Inference | Model Parallelism | NVIDIA Triton | Profiling | PyTorch | Quantization | TensorRT | Transformer Models | TransformerEngine
Education
N/A
Regions
Countries
States
Related jobs
-
Sr. Machine Learning Engineer USD 91K-177KAlgorithms | Anomaly Detection | Apache Airflow | Data Analysis | Deep learning401k plan | Employee recognition | Employee stock purchase plan | Health insurance | Paid time offSenior-level Full TimeIrvine, CA, US3h ago
-
Staff Engineer, Machine Learning USD 196K-269KCamera | Computer Vision | Convolutional Neural Networks | DETR | Deep Neural Networks401k employer match | Dental insurance | Life insurance | Long-term disability | Medical insuranceSenior-level Full TimeMountain View, CA11h ago
-
Senior Manufacturing Analytics Engineer USD 115K-140KChemometrics | Data Preparation | Descriptive Analytics | Feature Engineering | Machine LearningComprehensive benefits | Medical benefits | Sick leave | Travel up to 15 percentSenior-level Full TimeWayzata, Minnesota, US United States, 5539112h ago
-
Sr. Applied AI Engineer USD 160K-200KAPIs | Cloud Computing | Generative AI | Machine Learning | PythonHome-office equipment | Hybrid work model | Self-development budget | Top of market equity and cash compensation packageSenior-level Full TimeNew York Office12h ago
-
Applied AI Engineer USD 110K-160KGenerative AI | Machine Learning | Python | REST APIs | SQLEquipment provided | Hybrid work | Mentorship | Self-development budgetMid-level Full TimeNew York Office12h ago
-
Sr. Staff Machine Learning Engineer, Content Quality USD 268K-469KApache Flink | Apache Hadoop | Apache Kafka | Apache Spark | Big DataSenior-level Full TimeSan Francisco, CA, US12h ago
-
Machine Learning Engineer, Foundation Model USD 129K-247KAuto-regressive models | C++ | Data Pipelines | Deep learning | Diffusion ModelsSenior-level Full TimeSan Jose13h ago
-
AWS | Azure | CI/CD | Cloud platform | Data PipelinesLong-term contractSenior-level Contract Full TimeDallas, TX, United States14h ago
-
Sr. ML Engineer USD 183K-246KAgent systems | Data extraction | Document processing | Entity Relationship Modeling | Entity relationshipDental insurance | Hybrid work option | In-person retreats | Learning stipend | Meal benefitsSenior-level Full TimeNew York City HQ14h ago
-
Special Missions – AI Lead USD 142K-312KAI Platform | AI Platform Architecture | Artificial Intelligence | Business Development | Cloud ComputingSenior-level Full TimeWashington, DC15h ago
-
Senior AI ML Engineer USD 137K-188KAPI Integration | AWS | AWS Bedrock | AWS Fargate | AWS GlueCareer growth opportunities | Flexible work schedule | Remote-first work environmentSenior-level Contract Full TimeUnited States15h ago
-
Senior Software Engineer - Data Infrastructure, Safety USD 196K-243KA/B | A/B Testing | AI | Automation | B testingSenior-level Full TimeSan Mateo, CA, United States R15h ago
-
Staff AI Research Engineer USD 220K-331KArtificial Intelligence | Computer Vision | Feature Engineering | Fine Tuning | Language ModelsCollaboration | Learning opportunities | Mentorship | Professional developmentSenior-level Full TimePittsburgh, PA15h ago
-
Junior Software Engineer (Backend + AI) USD 90K-110KAWS ECR | AWS S3 | Anthropic API | CI/CD | DjangoCodebase access to production projects | Hybrid work | Internship to production code | Training with AI engineering practicesEntry-level Full TimeGreater Boston Area15h ago
-
Perception Engineer USD 125K-220KC++ | CI/CD | CUDA | Computer Vision | Convolutional Neural NetworkHealth insurance | Professional development | Retirement plansSenior-level Full TimeHuntington Beach15h ago
-
Senior ML Engineer, LLM / VLM Distillation USD 213K-263KBayesian Inference | Deep learning | Generative Modeling | Language Models | Large Language ModelsDiscretionary annual bonus program | Equity incentive plan | Generous company benefits program | Hybrid work scheduleSenior-level Full TimeMountain View, California, United States, Mountain …15h ago
-
AWS | C# | Generative Models | Google Cloud | Langchain401k matching | Dental insurance | Health insurance | Paid Holidays | Paid time offMid-level Full TimeRedlands, CA16h ago
-
Senior Machine Learning (ML) Engineer USD 165K-218KAir-gapped | Air-gapped systems | Airflow | Anomaly Detection | Apache Spark401k match | Caregiver leave | Commuter benefits | Dental benefits | Generous time offSenior-level Full TimeFort Collins, Colorado, United States16h ago
-
Senior Consumer Analytics Engineer I USD 130K-180KAirflow | CI/CD | DBT | Dagster | Data Governance401k program | Employee resource groups | Fitness and wellness memberships | Flexible work environment | Learning and development stipendSenior-level Full TimeAustin17h ago
-
Evaluation metrics | Generative AI | Golang | Information Retrieval | Language ModelsSenior-level Full TimeMountain View, CALIFORNIA, United States17h ago
-
Research Infrastructure Engineer, Training Systems USD 295K-380KAPI Design | Benchmarking | Debugging | Distributed Systems | GPU ComputingMid-level Full TimeSan Francisco18h ago
-
Software Engineer in Data Science USD 180K-253KAPIs | AWS | Airflow | Code review | Continuous DeliveryCompetitive benefits package | In office five days per weekSenior-level Full TimeHouston, TX, United States19h ago
-
A/B | A/B Testing | API Design | B testing | Causal InferenceSenior-level Full TimeDenver, CO; New York, NY; San …19h ago
-
Machine Learning Engineer USD 180K-250KAWS | Azure | CUDA | DDP | Distributed Training401k employer match | Health, dental, vision insurance | Paid time off | Professional development | Work-life balanceMid-level Full TimeEmeryville, California, United States; Hybrid (2-3 … R19h ago
-
Strategic Project Lead - Code USD 200K-250KArtificial Intelligence | Data Pipelines | Fine Tuning | Human Feedback | LLM EvaluationFive-day workweek | Flexible working hours | Supportive work cultureSenior-level Full TimeSan Francisco, California, United States19h ago