AI Performance Optimization Engineer
Tasks
- Build benchmark suites and regression frameworks
- Collaborate on ML and platform best practices
- Document performance tuning playbooks
- Evaluate new hardware and software offerings
- Identify and eliminate bottlenecks
- Implement and tune quantization sparsity and pruning
- Implement compiler-level optimizations
- Improve cost efficiency through scheduling and hardware selection
- Optimize KV cache and batching for LLM serving
- Optimize data pipelines sharding and storage access
- Optimize distributed training using parallelism and sharding
- Profile and optimize AI training and inference pipelines
- Translate AI research into production improvements
- Tune attention implementations for performance
Perks/Benefits
Skills/Tech-stack
C++ | Continuous batching | Custom Kernel | Custom kernel development | Cutlass | DeepSpeed | Distributed Training | FSDP | FlashAttention | GPU Architecture | KV cache | Kernel development | Memory Management | Model Parallelism | Paged Attention | Pipeline parallelism | Profiling | Pruning | Python | Quantization | Sparsity | Speculative decoding | TVM | TensorRT-LLM | TorchInductor | Triton | VLLM | XLA | Zero
Education
Related jobs
-
Staff Machine Learning Engineer, Embeddings USD 253K-354KA/B | A/B Testing | B testing | C++ | Cloud ComputingCaregiving support | Comprehensive healthcare benefits | Employer 401k match | Family planning support | Flexible vacationSenior-level Full TimeRemote - United States R21h ago
-
AI systems | APIs | Agent Frameworks | Architecture Design | DebuggingCompetitive equity | Relocation support | Remote work | Travel occasionallySenior-level Full TimePalo Alto, CA; Onsite R1d ago
-
Software Engineer AI/ML USD 112K-150KA/B | A/B Testing | AWS | Anomaly Detection | Artificial IntelligenceDental insurance | Employee assistance program | Health coaching program | Health insurance | Retirement benefitsMid-level Full TimeEvendale, United States R1d ago
-
AI Services | AWS Glue | AWS Lambda | AWS Step Functions | Amazon AICareer advancement | Certification opportunities | Exposure to cutting-edge technologies | Mentorship programs | Ongoing trainingMid-level Full TimeUnited States - Remote R1d ago
-
AI/ML Engineer - Higher Ed USD 101K-163KAWS Bedrock | AWS Lambda | Amazon ECS | Amazon SageMaker | Anthropic APIMid-level Full TimeVirtual US IL, United States R1d ago
-
AI/ML Engineer - School USD 101K-163KAWS Bedrock | AWS Lambda | Amazon ECS | Amazon SageMaker | Anthropic APIMid-level Full TimeVirtual US IL, United States R1d ago
-
Automatic Clustering | CI/CD | DBT | Data Modeling | Data WarehousingHybrid work schedule | Onsite 3 days per weekSenior-level ContractTrenton, NJ R1d ago
-
Senior Staff Data Engineer USD 225K-290KAI | Alerting | Astronomer Airflow | BigQuery | Blameless postmortemsFlexible work environment | Inclusive culture | Stakeholder mindsetSenior-level Full TimeU.S. - California, United States R1d ago
-
Senior Manager - AI Engineering USD 159K-207KAI Foundry | Agentic AI | Artificial Intelligence | Azure | Azure AISenior-level Full TimeRemote - PA, United States R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAccelerator hardware | Agentic Systems | Computer Vision | Data Quality | Data labelingRemote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KActive Learning | Apache Beam | CI/CD | Caching | Code reviewCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Senior Data Engineer, Knowledge & Information USD 153K-238KAWS | Alerting | Apache Airflow | Apache Spark | CI/CD401k company match | Dental insurance | Disability insurance | Flexible time off | Health insuranceSenior-level Full TimeUnited States R1d ago
-
Quantitative Engineer USD 140K-155KAI Assistant | API Design | AWS | CI/CD | Credit facility401k | Dental insurance | Fitness fund | Health insurance | Learning and development fundSenior-level Full TimeRemote - USA R1d ago
-
Senior, ML Engineer - Auto Tagger USD 177K-212KAWS | Apache Arrow | Apache Beam | Apache Spark | Cloud platform401k match | Company holiday office closures | Company-paid medical, dental & vision | Disability insurance | Flexible scheduleSenior-level Full TimeAnn Arbor, MI, Remote - US R1d ago
-
Enterprise Sales Engineer - Southern California USD 118K-157K.NET | CRM | Csharp | Go | JavaCareer pathing | Community guilds | Continuous professional development | Hybrid workplace | Inclusion talksSenior-level Full TimeCalifornia, USA, Remote R1d ago
-
Senior AI Engineer USD 160K-250KAPI Design | Agent Orchestration | Agent systems | Audit Logging | Authentication401k eligibility | Flexible work environment | Hybrid work option | Paid time off | Parental leave eligibilitySenior-level Full TimeUnited States (Remote) R1d ago
-
Senior Analytics Engineer, GTM USD 175K-244KBigQuery | ClickHouse | DBT | LLMs | PythonFlexible time off | Flexible work environment | Global gatherings | Healthcare employer contributions | Home office setupSenior-level Full TimeSan Francisco, USA (Hybrid) R1d ago
-
Senior Analytics Engineer, Product USD 175K-244KAI Automation | AI automation frameworks) | Analytics engineering | Automation frameworks | BigQueryEquity stock options | Flexible time off | Flexible work environment | Global gatherings | Healthcare employer contributionsSenior-level Full TimeSan Francisco, USA (Hybrid) R1d ago
-
Software Engineer II, Computational Platform USD 124K-154KAPIs | AWS | Cloud Networking | Data Modeling | Docker401k plan | Commuter support | Company-provided laptop | Flexible paid time off | Holiday payMid-level Full TimeRemote; Watertown, Massachusetts, United States R1d ago
-
Senior-level Full TimeRemote - United States R1d ago
-
Staff SW Engineer, Machine Learning USD 150K-180KAWS | ClearML | Computer Vision | Deep learning | Docker401k match | Dental insurance | Disability insurance | Employee assistance program | Employee stock purchase programSenior-level Full TimeRemote, USA R1d ago
-
Senior AI Engineer USD 90K-150KAWS | Agentic Workflows | Amazon Bedrock | BigQuery | Call Support401k match | Insurance | PTO | Stock option grants | Stock purchase planSenior-level Full TimeRemote - United States R1d ago
-
Senior Data Engineer IS - Remote USD 122K-208KAPI Integration | Access Control | Alerting | Bash | CDCRotational on-callSenior-level Full TimeRenton, WA, United States R1d ago
-
Senior-level Full TimeRemote - US R1d ago
-
Machine Learning Engineer USD 128K-214KAWS | Agile | Azure | Cloud platform | GitHealth insurance | Holiday pay | Learning and development | Life insurance | Long-term disabilityMid-level Full TimeUSA-Remote Work R1d ago