AI Performance Optimization Engineer
Tasks
- Build benchmark suites and regression frameworks
- Collaborate with ML and platform engineering teams
- Document performance tuning playbooks
- Drive compiler level optimization with Triton XLA TorchInductor and TVM
- Evaluate new hardware and software offerings
- Identify and eliminate bottlenecks in data loading model compute communication and memory
- Improve cost efficiency through model architecture hardware selection and scheduling
- Optimize AI training and inference pipelines for throughput latency and cost
- Optimize KV cache continuous batching and speculative decoding for LLM serving
- Optimize data pipelines sharding strategies and storage access patterns
- Optimize distributed training with tensor parallelism pipeline parallelism FSDP and ZeRO style sharding
- Stay current with AI systems research and apply to production
- Tune attention implementations with FlashAttention and paged attention
- Tune quantization sparsity and pruning strategies
Perks/Benefits
Skills/Tech-stack
Attention Mechanisms | Benchmarking | C++ | Compiler optimization | Continuous batching | DeepSpeed | Distributed Training | FSDP | FinOps | FlashAttention | GPU Architecture | HPC | KV cache | Memory Management | Model Parallelism | Paged Attention | Pipeline parallelism | Profiling | Pruning | Python | Quantization | Regression testing | Sparsity | Speculative decoding | TVM | Tensor Parallelism | TensorRT-LLM | TorchInductor | Triton | VLLM | XLA | Zero
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Related jobs
-
Databricks Solution Architect USD 180K-247KAWS S3 | Apache Spark | Autoscaling | Azure Data | Azure Data LakeSenior-level Full TimeUnited States R23h ago
-
C++ | Cloud Computing | Code Reviews | Deployment Automation | Distributed Systems401k match | Caregiving support | Family planning support | Flexible vacation | Gender-affirming careSenior-level Full TimeRemote - United States R1d ago
-
APIs | Agentic Workflows | CI/CD | Cost Management | GeminiSenior-level Full TimeRemote - USA, United States R1d ago
-
Edge AI Engineer USD 100K-150KBias Evaluation | C++ | Core ML | DSP | Edge ComputingCareer growth opportunities | Health benefits | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Edge AI Engineer USD 100K-150KBias Evaluation | C++ | Core ML | Edge Computing | Edge inferenceCareer growth potential | H1B transfer support for eligible candidates | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Agentic Systems | Computer Vision | Data Quality | Data quality monitoringMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerator hardware | Computer Vision | Data Quality | Data quality monitoringMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KActive Learning | Apache Beam | Apache Spark | CI/CD | CachingCareer growth | Inclusion and diversity | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KActive Learning | Apache Beam | CI/CD | Caching | Code reviewMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter-Tuning | Attention Optimization | DPO | Distributed Training | Evaluation methodologyCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Mid-level Full TimeUnited States - Remote R1d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | CUDA | Compiler optimization | Continuous batchingMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering Architect USD 100K-150KAgent systems | Agentic Workflows | Cost Optimization | Embeddings | Evaluation FrameworksCareer growth | Employee mentoring | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering Architect USD 100K-150KAgentic Workflows | Chunking | Embeddings | Evaluation Frameworks | IndexingSenior-level Full TimeUnited States - Remote R1d ago
-
Quantitative Developer (Fintech) USD 100K-150KBacktesting | C++ | Cloud Computing | Concurrency | DebuggingMid-level Full TimeUnited States - Remote R1d ago
-
Robotics Software Engineer USD 100K-150KBehavior Trees | C++ | Concurrent Systems | Control Systems | Embedded Systems100 percent remote | Benefits | Full time direct W2 employment | H1B transfer supportMid-level Full TimeUnited States - Remote R1d ago
-
Robotics Software Engineer USD 100K-150KBehavior Tree | C++ | Concurrent programming | Control Systems | DebuggingMid-level Full TimeUnited States - Remote R1d ago
-
Senior AI Engineer, Enterprise Agentic Solutions USD 142K-196KAPI Development | AWS | Agent systems | Autogen | AzureComprehensive health benefits | Onsite onboarding travel reimbursement | Paid time off | Remote work | Retirement benefitsSenior-level Full TimeRemote - Minnesota, United States R1d ago
-
Manager, AI Engineering - Analytics USD 197K-267KAgent systems | Artificial Intelligence | Data Modeling | Data Warehousing | EvalsHybrid work flexibility | Professional growth opportunities | Stock equity | Work-life balanceMid-level Full TimeHybrid - San Francisco R1d ago
-
Machine Learning Operations Engineer USD 133K-167KAWS SageMaker | Docker | GitHub Actions | Machine Learning | NumPyCareer development | Communities | Commuting cost coverage | Corporate giving programs | Daily free lunchMid-level Full TimeBoston, Massachusetts, United States R1d ago
-
Applied AI Engineer, Investments USD 134K-183KAPIs | Artificial Intelligence | Cloud technologies | Data Pipelines | Data Processing401k match | Family-forming benefits | Paid time off | Relocation support | Volunteer time offEntry-level Full TimeRedwood City, CA (Hybrid) R1d ago
-
Senior-level Full TimeRemote - United States R1d ago
-
Forward Deployed AI Engineer, West USD 125K-175KAWS | Azure | Docker | GCP | Generative AI401k plan | Dental insurance | Medical insurance | Parental leave | Unlimited paid time offMid-level Full TimeRemote (San Francisco) R1d ago
-
Senior Machine Learning Engineer, Reinforcement Learning USD 150K-250KDomain Randomization | Embedded Systems | Gazebo | Isaac-Gym | Mujoco401k retirement plan | Dental insurance | Employee referral bonus | Flexible PTO | Free lunchSenior-level Full TimeColumbus, Ohio or Remote R1d ago
-
Senior Data Platform Engineer USD 140K-220KApache Hudi | Apache Spark | CI/CD | Delta Lake | Distributed StorageSenior-level Full TimePittsburgh, PA or Remote R1d ago