AI Performance Optimization Engineer
Tasks
- Apply compiler level optimizations for end to end performance
- Build benchmark suites and regression frameworks
- Collaborate with ML and platform teams to standardize best practices
- Document performance tuning playbooks and share findings
- Evaluate new hardware and software offerings
- Identify and eliminate performance bottlenecks
- Implement and tune quantization sparsity and pruning
- Improve cost efficiency through architecture hardware and scheduling
- Optimize AI training and inference pipelines for throughput latency and cost
- Optimize KV cache and batching for LLM serving
- Optimize distributed training with parallelism and sharding
- Translate AI research into production performance improvements
- Tune attention implementations for inference efficiency
Perks/Benefits
Skills/Tech-stack
Attention Mechanisms | C++ | Compiler optimization | Continuous batching | Custom Kernel | Custom Kernel Authoring | Cutlass | DeepSpeed | Distributed Training | FSDP | FinOps | FlashAttention | GPU Architecture | HPC | KV cache | Kernel authoring | Memory Management | Model Parallelism | Paged Attention | Pipeline parallelism | Profiling | Pruning | Python | Quantization | Sparsity | Speculative decoding | TVM | Tensor Parallelism | TensorRT-LLM | TorchInductor | Triton | VLLM | XLA | Zero
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Related jobs
-
Junior AI Engineer (Open to remote) USD 110K-135KAPI Development | Language Model | Language Model Evaluation | Language Models | Language Processing401k | Dental insurance | Health savings account | Medical insurance | Paid time offEntry-level Full TimeNew York, NY, US, NY 10019 R6h ago
-
Senior Data Platform Engineer, Remote USD 135K-180KAWS | AWS Lambda | Access Control | Amazon Aurora | Amazon CloudWatchSenior-level Full TimeUnited States, UNITED STATES, United States R7h ago
-
AI Software Engineer USD 181K-270KAWS | CI/CD | Docker | Edge Functions | GitHub CopilotComprehensive benefits | Equity | Learning stipend | Remote-first cultureSenior-level Full TimeUnited States or Canada R11h ago
-
Prompt Engineering Architect USD 100K-150KAgentic Workflows | Embeddings | Evaluation Frameworks | Fine Tuning | Language ModelsSenior-level Full TimeUnited States - Remote R13h ago
-
Robotics Software Engineer USD 100K-150KBehavior Trees | C++ | Concurrent Systems | Embedded Systems | Fault detectionRemote workMid-level Full TimeUnited States - Remote R13h ago
-
Senior Data Engineer - Remote - Multiple Levels USD 85K-141KAWS Data | AWS Data Migration | AWS Data Migration Service | AWS Lambda | Airflow401k retirement plan | Dental insurance | Health insurance | Paid Holidays | Parental leaveSenior-level Full TimeHome Office: Tysons, VA, United States R13h ago
-
Senior Machine Learning Engineer, GenAI Data USD 243K-295KAmazon S3 | Batch Processing | C plus plus | Data Pipelines | Data PreprocessingSenior-level Full TimeSan Mateo, CA, United States R13h ago
-
AI Engineer USD 105K-132KAWS | CAP | CLIA | Electronic Health Records | FDA401k benefits | Baby bonding leave | Commuter benefits | Dental insurance | Disability insuranceMid-level Full TimeUS Remote R14h ago
-
Senior Software Engineer, Machine Learning USD 190K-220KAWS | Airflow | DBT | Kubernetes | MLflow401k match | Medical/Dental/Vision insurance | Paid Holidays | Paid parental leave | Remote-first teamSenior-level Full TimeRemote (United States) R15h ago
-
Member of Technical Staff - Principal ML Engineer USD 200K-300KAPI Design | Access Management | Auth0 | Cloud Architecture | Entra ID401k | Equity incentives | FSA | Health insurance | Mental health benefitsSenior-level Full TimeRemote (USA) R15h ago
-
Site Reliability Engineer - Storage Engineer USD 98K-192KAWS | Ansible | Bash | CI/CD | Ceph401k retirement plan | Dental insurance | Employee Assistance Program (EAP) | Employee Health Insurance | Hybrid work optionsSenior-level Full TimeAustin, Texas, United States R16h ago
-
Sr. Data Engineer USD 152K-223KAWS | Access Control | CI/CD | Change Data Capture | DBT401k match | Disability insurance | EAP | Health insurance | Hybrid work flexibilitySenior-level Full TimeUtah | Hybrid R16h ago
-
Enterprise Sales Engineer - FED USD 118K-157K.NET | CRM | Go | Java | Node.js401k match | Community guilds | Dental | Employee stock purchase plan | Fitness reimbursementSenior-level Full TimeDistrict of Columbia, USA, Remote; Virginia, … R16h ago
-
Senior Staff Data Engineer - Platform Data and Analytics USD 268K-368KAWS | Airflow | Alerting | Apache Spark | Compute OptimizationComprehensive benefits | Equity | Hybrid/Remote flexibilitySenior-level Full TimeSan Francisco, CA R17h ago
-
Forward Deployed Engineer (West) USD 220K-250KAI Prototyping | API Integration | AWS | Automation | Cloud NetworkingMid-level Full TimePacific or Mountain Time Zone (Remote) R19h ago
-
Senior AI Engineer USD 95K-197KAWS | Autogen | Azure | CI/CD | Clean CodeAutonomy | Learning and development programs | MentorshipSenior-level Full TimeChicago, Illinois, USA; Los Angeles, California, … R19h ago
-
Lead AI Engineer USD 198K-261KAgentic Frameworks | CI/CD | Cloud Platforms | Containers | Fine TuningSenior-level Full TimeChicago, Illinois, USA; San Francisco, California, … R19h ago
-
Agent systems | Agentic Systems | Air gapped deployments | Air-gapped | Artificial Intelligence401k | Career advancement | Employer paid health care | Equity incentives | FSASenior-level Full TimeSeattle, WA or McLean, VA or … R19h ago
-
Forward Deployed AI Solutions Engineer USD 125K-156KAgentic Workflows | Authentication | CLI | Dashboards | Data Quality401k | Baby bonding leave | Commuter benefits | Dental insurance | Disability insuranceMid-level Full TimeUS Remote R20h ago
-
Data Analyst USD 114K-166KA/B | A/B Testing | AI tools | AWS | B testingAI Tool Support | Career growth support | Collaborative code review | No on call expectations | Standard business hoursMid-level Full TimeRemote (United States) R20h ago
-
Data Engineer USD 74K-133KAgile | Apache Airflow | Cloud Composer | Cloud DataStream | Cloud Dataflow401k retirement program | Dental insurance | Disability insurance | Employer 401k match | Flexible time offMid-level Full TimeLisle, IL, United States R22h ago
-
Forward Deployed AI Engineer – Claude 2026 - US USD 200K-305KAPIs | AWS | Access Control | Agents | AnthropicRemote work within the USMid-level Full TimeAtlanta, GA - Remote R1d ago
-
Staff Machine Learning Engineer, Embeddings USD 253K-354KA/B | A/B Testing | B testing | C++ | Cloud ComputingCaregiving support | Comprehensive healthcare benefits | Employer 401k match | Family planning support | Flexible vacationSenior-level Full TimeRemote - United States R1d ago
-
AI systems | APIs | Agent Frameworks | Architecture Design | DebuggingCompetitive equity | Relocation support | Remote work | Travel occasionallySenior-level Full TimePalo Alto, CA; Onsite R1d ago
-
Software Engineer AI/ML USD 112K-150KA/B | A/B Testing | AWS | Anomaly Detection | Artificial IntelligenceDental insurance | Employee assistance program | Health coaching program | Health insurance | Retirement benefitsMid-level Full TimeEvendale, United States R1d ago