AI Performance Optimization Engineer
Tasks
- Build benchmark suites and regression frameworks
- Collaborate with ML and platform teams to apply best practices
- Document performance tuning playbooks and share results
- Drive compiler level optimizations with Triton XLA TorchInductor or TVM
- Evaluate new hardware and software offerings and advise on adoption
- Identify and eliminate bottlenecks in data loading compute communication and memory
- Implement and tune quantization sparsity and pruning
- Optimize AI training and inference pipelines for throughput latency and cost
- Optimize KV cache continuous batching and speculative decoding for LLM serving
- Optimize data pipeline sharding strategies and storage access patterns
- Optimize distributed training using tensor parallelism pipeline parallelism FSDP and ZeRO style sharding
- Tune attention implementations with FlashAttention and paged attention
Perks/Benefits
- N/A
Skills/Tech-stack
Benchmarking | C++ | CUDA | Compiler optimization | Continuous batching | Cutlass | DeepSpeed | Distributed Training | Distributed inference | FSDP | FlashAttention | GPU Architecture | KV cache | Memory Management | Paged Attention | Pipeline parallelism | Profiling | Pruning | Python | Quantization | Sparsity | Speculative decoding | TVM | Tensor Parallelism | TensorRT-LLM | TorchInductor | Triton | VLLM | XLA | Zero
Education
Related jobs
-
Databricks Solution Architect USD 180K-247KAWS S3 | Apache Spark | Autoscaling | Azure Data | Azure Data LakeSenior-level Full TimeUnited States R23h ago
-
C++ | Cloud Computing | Code Reviews | Deployment Automation | Distributed Systems401k match | Caregiving support | Family planning support | Flexible vacation | Gender-affirming careSenior-level Full TimeRemote - United States R1d ago
-
APIs | Agentic Workflows | CI/CD | Cost Management | GeminiSenior-level Full TimeRemote - USA, United States R1d ago
-
Edge AI Engineer USD 100K-150KBias Evaluation | C++ | Core ML | DSP | Edge ComputingCareer growth opportunities | Health benefits | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Edge AI Engineer USD 100K-150KBias Evaluation | C++ | Core ML | Edge Computing | Edge inferenceCareer growth potential | H1B transfer support for eligible candidates | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Agentic Systems | Computer Vision | Data Quality | Data quality monitoringMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerator hardware | Computer Vision | Data Quality | Data quality monitoringMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KActive Learning | Apache Beam | Apache Spark | CI/CD | CachingCareer growth | Inclusion and diversity | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KActive Learning | Apache Beam | CI/CD | Caching | Code reviewMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter-Tuning | Attention Optimization | DPO | Distributed Training | Evaluation methodologyCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Mid-level Full TimeUnited States - Remote R1d ago
-
AI Performance Optimization Engineer USD 100K-150KAttention Mechanisms | Benchmarking | C++ | Compiler optimization | Continuous batchingBenefits | Career growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering Architect USD 100K-150KAgent systems | Agentic Workflows | Cost Optimization | Embeddings | Evaluation FrameworksCareer growth | Employee mentoring | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering Architect USD 100K-150KAgentic Workflows | Chunking | Embeddings | Evaluation Frameworks | IndexingSenior-level Full TimeUnited States - Remote R1d ago
-
Quantitative Developer (Fintech) USD 100K-150KBacktesting | C++ | Cloud Computing | Concurrency | DebuggingMid-level Full TimeUnited States - Remote R1d ago
-
Robotics Software Engineer USD 100K-150KBehavior Trees | C++ | Concurrent Systems | Control Systems | Embedded Systems100 percent remote | Benefits | Full time direct W2 employment | H1B transfer supportMid-level Full TimeUnited States - Remote R1d ago
-
Robotics Software Engineer USD 100K-150KBehavior Tree | C++ | Concurrent programming | Control Systems | DebuggingMid-level Full TimeUnited States - Remote R1d ago
-
Senior AI Engineer, Enterprise Agentic Solutions USD 142K-196KAPI Development | AWS | Agent systems | Autogen | AzureComprehensive health benefits | Onsite onboarding travel reimbursement | Paid time off | Remote work | Retirement benefitsSenior-level Full TimeRemote - Minnesota, United States R1d ago
-
Manager, AI Engineering - Analytics USD 197K-267KAgent systems | Artificial Intelligence | Data Modeling | Data Warehousing | EvalsHybrid work flexibility | Professional growth opportunities | Stock equity | Work-life balanceMid-level Full TimeHybrid - San Francisco R1d ago
-
Machine Learning Operations Engineer USD 133K-167KAWS SageMaker | Docker | GitHub Actions | Machine Learning | NumPyCareer development | Communities | Commuting cost coverage | Corporate giving programs | Daily free lunchMid-level Full TimeBoston, Massachusetts, United States R1d ago
-
Applied AI Engineer, Investments USD 134K-183KAPIs | Artificial Intelligence | Cloud technologies | Data Pipelines | Data Processing401k match | Family-forming benefits | Paid time off | Relocation support | Volunteer time offEntry-level Full TimeRedwood City, CA (Hybrid) R1d ago
-
Senior-level Full TimeRemote - United States R1d ago
-
Forward Deployed AI Engineer, West USD 125K-175KAWS | Azure | Docker | GCP | Generative AI401k plan | Dental insurance | Medical insurance | Parental leave | Unlimited paid time offMid-level Full TimeRemote (San Francisco) R1d ago
-
Senior Machine Learning Engineer, Reinforcement Learning USD 150K-250KDomain Randomization | Embedded Systems | Gazebo | Isaac-Gym | Mujoco401k retirement plan | Dental insurance | Employee referral bonus | Flexible PTO | Free lunchSenior-level Full TimeColumbus, Ohio or Remote R1d ago
-
Senior Data Platform Engineer USD 140K-220KApache Hudi | Apache Spark | CI/CD | Delta Lake | Distributed StorageSenior-level Full TimePittsburgh, PA or Remote R1d ago