AI Performance Optimization Engineer
USD 136K-258K (estimate) Mid-level Full Time
Tasks
- Build benchmarking and regression frameworks
- Collaborate on ML systems best practices
- Document performance tuning playbooks
- Drive compiler level optimizations
- Evaluate hardware and software for adoption
- Identify and eliminate performance bottlenecks
- Implement quantization sparsity and pruning
- Improve AI cost efficiency
- Optimize KV cache and batching for LLM serving
- Optimize data pipelines and storage access
- Optimize distributed training with parallelism
- Profile and optimize AI training and inference pipelines
- Stay current with AI performance research
- Tune attention implementations
Perks/Benefits
Skills/Tech-stack
Access patterns | Benchmarking | C++ | Cache optimization | Compiler optimization | Continuous batching | Data Sharding | Deep learning | Distributed Training | Distributed inference | FSDP | FlashAttention | GPU Architecture | KV cache | KV cache optimization | Memory Management | Model Parallelism | Paged Attention | Pipeline parallelism | Profiling tools | Pruning | Python | Quantization | Regression testing | Sparsity | Speculative decoding | Storage Access | Storage Access Patterns | TVM | Tensor Parallelism | TorchInductor | Triton | XLA | Zero
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Related jobs
-
AI Developer - Model Creation & Full Stack USD 150K-175KAWS | Angular | Azure | CI/CD | D3.jsRemote work | USPS Public Trust Clearance eligibleMid-level Full TimeWork from home, VA, United States R8h ago
-
API Integration | AWS | AWS Glue | Batch Processing | Code reviewSenior-level Full TimeIndianapolis, IN, United States R10h ago
-
Applied AI Engineer, Agentic Systems USD 115K-192K.NET | APIs | Anthropic | CrewAI | Evaluation FrameworksAI and productivity tools access | Remote work accessSenior-level Full TimeRemote - United States R19h ago
-
Senior Industrial Engineer, Process Optimization USD 100K-120K5S | AutoCAD | Cause analysis | Cost modeling | Excel401k | Dental insurance | Disability insurance | Flexible spending account | Health savings accountSenior-level Full TimeBethlehem, PA, United States R23h ago
-
Machine Learning Engineer II GBP 124K-186KAWS | Anomaly Detection | Athena | Bedrock | C++Formal learning opportunities | Hybrid work | On-the-job learningMid-level Full TimeUSA – MN – Minneapolis, United … R1d ago
-
Edge AI Engineer USD 130K-200KBenchmarking | C++ | Core ML | Edge Computing | Embedded SystemsCareer growth | Health benefits | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 150K-222KAccelerator hardware | Agentic Systems | Data Quality | Data quality monitoring | Deep learningCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Distinguished Engineer, Applied AI USD 150K-300KAWS | Agentic AI | Algorithms | Artificial Intelligence | Auto-failover401k match | Adoption Assistance | Career mentorship | Certification assistance | Employee trainingSenior-level Full TimeCA Palo Alto Office, United States R1d ago
-
AI Data Infrastructure Engineer USD 146K-189KApache Beam | CI/CD | Code review | Data Lineage | Data ModelingBenefits package | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 146K-189KActive Learning | Apache Beam | CI/CD | Caching | Code reviewMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 150K-270KAdapter-Tuning | DPO | Dataset curation | Distributed Training | Evaluation methodologyCareer growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
LLM Platform Engineer (Windchill / Teamcenter) USD 116K-177KAWS | Ansible | Azure | CAD Integration | CI/CDCareer growth opportunities | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Performance Optimization Engineer USD 136K-258KC++ | Continuous batching | Deep learning | Distributed Systems | FSDPMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering Architect USD 119K-228KAgent systems | Agentic Systems | Embeddings | Evaluation Frameworks | LLM APIsCareer growth | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Quantitative Developer (Fintech) USD 121K-213KAudit trails | Backtesting | C++ | Cloud Native | Cloud Native ArchitectureMid-level Full TimeUnited States - Remote R1d ago
-
Robotics Software Engineer USD 125K-169KBehavior Trees | C++ | Cameras | Concurrent Systems | Control SystemsCareer growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AWS | Cloud Data | Cloud data warehousing | Data Modeling | Data WarehousingSenior-level Contract Full TimeRemote, OR, United States R1d ago
-
Edge AI Engineer USD 141K-200KC++ | Core ML | Edge inference | Energy optimization | Federated LearningSenior-level Full TimeUnited States - Remote R2d ago
-
Senior-level Full TimeUnited States - Remote R2d ago
-
Edge AI Engineer USD 141K-200KBenchmarking | C++ | Core ML | Digital Signal | Digital Signal ProcessorSenior-level Full TimeUnited States - Remote R2d ago
-
Senior-level Full TimeUnited States - Remote R2d ago
-
Senior-level Full TimeUnited States - Remote R2d ago
-
AI Research Engineer (Applied AI) USD 150K-222KAgentic Systems | Computer Vision | Data Quality | Data labeling | Data quality monitoringMid-level Full TimeUnited States - Remote R2d ago
-
AI Research Engineer (Applied AI) USD 150K-222KAccelerator hardware | Agentic Systems | Data Quality | Data labeling | Data quality monitoringMid-level Full TimeUnited States - Remote R2d ago
-
AI Research Engineer (Applied AI) USD 153K-222KAblation Studies | Accelerator hardware | Agentic Systems | Computer Vision | Data QualitySenior-level Full TimeUnited States - Remote R2d ago