AI Performance Optimization Engineer
USD 136K-258K (estimate) Mid-level Full Time
Tasks
- Build benchmark suites
- Collaborate on ML best practices
- Document performance tuning playbooks
- Drive compiler level optimizations
- Evaluate new hardware and software
- Identify and eliminate bottlenecks
- Implement continuous batching
- Implement quantization sparsity pruning
- Implement speculative decoding
- Improve cost efficiency
- Optimize attention implementations
- Optimize data pipelines
- Optimize kv cache
- Profile and optimize AI inference pipelines
- Profile and optimize AI training pipelines
- Research and translate AI advances
- Tune distributed training
Perks/Benefits
- N/A
Skills/Tech-stack
Benchmarking | C++ | Continuous batching | Custom Kernel | Custom Kernel Authoring | Cutlass | Deep learning | DeepSpeed | Distributed Training | Distributed inference | FSDP | FinOps | Flash Attention | GPU Architecture | KV cache | Kernel authoring | Memory Management | Model Parallelism | Paged Attention | Pipeline parallelism | Profiling | Pruning | Python | Quantization | Sparsity | Speculative decoding | TVM | Tensor Parallelism | TensorRT-LLM | Torch Inductor | Triton | VLLM | XLA | Zero
Education
Related jobs
-
Forward Deployed Machine Learning Engineer USD 180K-300KAPI Design | Cloud Computing | Deep learning | Diffusion Models | Fine TuningIn-person collaboration days | Remote work flexibility | Travel cost coverageSenior-level Full TimeSan Francisco (USA) R10h ago
-
Principal AI Platform Engineer USD 167K-220KAgent Orchestration | Backend Development | Braintrust | Cost Optimization | Data PipelinesEquity | Flexible Token Limits | Health, dental, vision coverage | Unlimited paid time offSenior-level Full TimeSan Francisco, California R13h ago
-
Full Stack Software Engineer - Robotics USD 125K-200KAWS | Datadog | Distributed Systems | Edge Computing | Grafana401k | Cell phone reimbursement | DC FSA | Employee assistance program | EquityMid-level Full TimeSan Francisco || Oakland, CA R18h ago
-
AI Developer - Model Creation & Full Stack USD 150K-175KAWS | Angular | Azure | CI/CD | D3.jsRemote work | USPS Public Trust Clearance eligibleMid-level Full TimeWork from home, VA, United States R18h ago
-
Graduate AI Solutions Analyst USD 65K-65KAI Prototyping | Artificial Intelligence | Automation | ChatGPT | Generative AI401k | Dental insurance | Disability insurance | Employee assistance program | Flexible time offEntry-level Full TimeDenver, Colorado, United States R19h ago
-
Sales Engineer USD 125K-160KAPI | AWS EKS | Amazon Web Services | Apache Spark | Cloud platformWork from anywhereMid-level Full TimeUnited States R20h ago
-
API Integration | AWS | AWS Glue | Batch Processing | Code reviewSenior-level Full TimeIndianapolis, IN, United States R20h ago
-
Applied AI Engineer, Agentic Systems USD 115K-192K.NET | APIs | Anthropic | CrewAI | Evaluation FrameworksAI and productivity tools access | Remote work accessSenior-level Full TimeRemote - United States R1d ago
-
Senior-level Full TimeSan Francisco - Remote, CA, United … R1d ago
-
Senior Industrial Engineer, Process Optimization USD 100K-120K5S | AutoCAD | Cause analysis | Cost modeling | Excel401k | Dental insurance | Disability insurance | Flexible spending account | Health savings accountSenior-level Full TimeBethlehem, PA, United States R1d ago
-
Machine Learning Engineer II GBP 124K-186KAWS | Anomaly Detection | Athena | Bedrock | C++Formal learning opportunities | Hybrid work | On-the-job learningMid-level Full TimeUSA – MN – Minneapolis, United … R1d ago
-
Edge AI Engineer USD 130K-200KBenchmarking | C++ | Core ML | Edge Computing | Embedded SystemsCareer growth | Health benefits | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 150K-222KAccelerator hardware | Agentic Systems | Data Quality | Data quality monitoring | Deep learningCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Distinguished Engineer, Applied AI USD 150K-300KAWS | Agentic AI | Algorithms | Artificial Intelligence | Auto-failover401k match | Adoption Assistance | Career mentorship | Certification assistance | Employee trainingSenior-level Full TimeCA Palo Alto Office, United States R1d ago
-
AI Data Infrastructure Engineer USD 146K-189KApache Beam | CI/CD | Code review | Data Lineage | Data ModelingBenefits package | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 146K-189KActive Learning | Apache Beam | CI/CD | Caching | Code reviewMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 150K-270KAdapter-Tuning | DPO | Dataset curation | Distributed Training | Evaluation methodologyCareer growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
LLM Platform Engineer (Windchill / Teamcenter) USD 116K-177KAWS | Ansible | Azure | CAD Integration | CI/CDCareer growth opportunities | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Performance Optimization Engineer USD 136K-258KC++ | Continuous batching | Deep learning | Distributed Systems | FSDPMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering Architect USD 119K-228KAgent systems | Agentic Systems | Embeddings | Evaluation Frameworks | LLM APIsCareer growth | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Quantitative Developer (Fintech) USD 121K-213KAudit trails | Backtesting | C++ | Cloud Native | Cloud Native ArchitectureMid-level Full TimeUnited States - Remote R1d ago
-
Robotics Software Engineer USD 125K-169KBehavior Trees | C++ | Cameras | Concurrent Systems | Control SystemsCareer growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Mid-level Full Time6314 Remote/Teleworker US, United States R1d ago
-
Senior AI Engineer USD 107K-195KAI Evaluation | AI Safety | API Integration | Agent systems | AutogenSenior-level Full Time6314 Remote/Teleworker US, United States R1d ago
-
Senior Engineer, Data Science USD 111K-178KCloud Computing | Data Engineering | Data Governance | Data Pipelines | DatabricksSenior-level Full TimeOklahoma City, OK, United States R1d ago