Staff Technical Lead for Inference & ML Performance
Tasks
- Apply compiler strategies for inference
- Collaborate with research and applied machine learning teams
- Contribute to critical inference performance optimizations
- Develop and optimize kernels
- Guide team to build high performance inference solutions
- Identify and eliminate inference performance bottlenecks
- Implement model parallelism
- Implement performance optimizations
- Improve model serving performance
- Influence inference strategies and deployment techniques
- Mentor and scale performance focused engineers
- Set technical direction for inference performance
Perks/Benefits
- N/A
Skills/Tech-stack
CUDA | Compilation | Cutlass | Distributed Serving | Kernel optimization | Machine Learning | Model Inference | Model Parallelism | NVIDIA Triton | Profiling | PyTorch | Quantization | TensorRT | Transformer Models | TransformerEngine
Education
N/A
Regions
Countries
States
Related jobs
-
Data Synthesis | Deep learning | Language Models | Language Processing | Large Language ModelsEntry-level InternshipSan Jose, California, United States14h ago
-
AWS | Alteryx | Amazon SageMaker | Azure | Azure DataMid-level Full TimeNew York, NY, United States14h ago
-
Strategic Intelligence & Advanced Analytics Engineer USD 108K-136KAnomaly Detection | Artificial Intelligence | Azure | Data Pipelines | Data QualityPaid parental leave | Paid time off | Public service loan forgiveness | Tuition reimbursement | Wellness programsMid-level Full TimeTexas-Dallas-5323 Harry Hines Blvd14h ago
-
Fine Tuning | GPU resource management | Intelligent agents | Language Models | Large Language ModelsEntry-level Full TimeSan Jose, California, United States14h ago
-
Software Engineer, Video AI/ML Specialist USD 141K-211KAI | AV1 | AV2 | Audio Processing | Audio/VideoMid-level Full TimeBellevue, WA | Menlo Park, CA …15h ago
-
Tech Lead, AI Research Scientist (Robotics) USD 170K-251KAction Conditioned World Models | Artificial Intelligence | Computer Vision | Deep learning | Dexterous ManipulationMentorship opportunities | Open science contributions | Work authorization supportSenior-level Full TimeMenlo Park, CA15h ago
-
Network Engineer, Deployment & Support USD 101K-156K400G | 800G | AI | Automation | Coherent opticsMid-level Full TimeMenlo Park, CA | Eagle Mountain, …15h ago
-
Senior Software Engineer, Database Internals, AlloyDB USD 174K-252KC# | C++ | Code optimization | Concurrency Control | Database InternalsEntry-level Full TimeSunnyvale, CA, USA15h ago
-
Artificial Intelligence | Data Analysis | Data Structures | Data structures algorithms | Human-in-the-loopSenior-level Full TimeMountain View, CA, USA15h ago
-
Agent tooling | Artificial Intelligence | C++ | Cloud Architecture | Conversational AISecret clearance | TravelSenior-level Full TimeAtlanta, GA, USA; Austin, TX, USA15h ago
-
Software Engineer III, AI/ML GenAI, Google Cloud Compute USD 147K-211KAudio generation | C++ | Computer Vision | Data Processing | DebuggingSenior-level Full TimeSunnyvale, CA, USA15h ago
-
Senior Photonic Engineer, Machine Learning USD 159K-231KCircuit simulation | Data center | Data center network | Data center network architecture | Digital SignalSenior-level Full TimeSunnyvale, CA, USA15h ago
-
Data Processing | Data Storage | Data Structures | Data Structures and Algorithms | Distributed SystemsSenior-level Full TimeMountain View, CA, USA15h ago
-
Senior Data Scientist - Clinical AI development USD 100K-155KAPI Design | CI/CD | Cloud Computing | Containerization | Data Pipelines401k | Disability insurance | Employee assistance program | Flexible vacation | Life insuranceSenior-level Full TimeLexington, MA, US18h ago
-
Applied AI ML Lead - LLM SUITE ENGINEERING USD 176K-215KAPI Design | AWS | Agentic AI | Caching | Cloud NativeBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeWilmington, DE, United States23h ago
-
Senior-level Full TimeRaleigh, NC, US1d ago
-
Senior AI Engineer USD 107K-199KAKS | API Design | Alerts | Anomaly Detection | Apache SparkHybrid work environment | Inclusion support | Learning opportunities | Well-being supportSenior-level Full TimeUSA, Massachusetts, Boston, 200 Berkeley Street, …1d ago
-
Associate AI Engineer USD 80K-134KAPI Development | Azure | Cloud Platforms | Data Preparation | DocumentationFlexible work environment | Hybrid work arrangement | Inclusion programs | Paid time off | Wellness benefitsMid-level Full TimeUSA, Massachusetts, Boston, 200 Berkeley Street, …1d ago
-
Entry-level Full TimeUnited States - Remote R1d ago
-
CI/CD | Docker | Drift Detection | Embeddings | Experiment trackingMentorship | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Marketing Intelligence Engineer USD 150K-175KAPIs | Analytics | Automation | Azure | Dashboarding401k matching | Dental insurance | Health insurance | Hybrid work flexibility | Paid parental leaveSenior-level Full TimeMadison, WI1d ago
-
Senior Data Engineer USD 82K-172KAWS | Apache Spark | Artificial Intelligence | BERT | BitbucketContinuing education | Family support benefits | Flexible time off | Healthcare benefits | Learning resourcesSenior-level Full Time606 KING OF PRUSSIA PA, United …1d ago
-
Staff AI/ML Engineer USD 108K-227KAWS | Adversarial Networks | Bitbucket | CUDA | CupyFlexible time off | Learning resources | MentoringSenior-level Full Time606 KING OF PRUSSIA PA, United …1d ago
-
Staff AI/ML Engineer (LLMs) USD 108K-227KAWS Bedrock | Agentic AI | Arize Phoenix | Bitbucket | CUDAFlexible time off | Learning and development resourcesSenior-level Full Time606 KING OF PRUSSIA PA, United …1d ago
-
Machine Learning Engineer II USD 131K-184KAzure | Batch inference | Data Pipelines | Databricks | Distributed SystemsContinuous learning | Flexible ways of working | Growth mindset cultureMid-level Full TimeUSA TX Houston Hybrid, United States R1d ago