AI Performance Optimization Engineer
USD 136K-258K (estimate) Mid-level Full Time
Tasks
- Build benchmark suites and regression frameworks
- Collaborate with ML and platform engineering teams
- Document performance tuning playbooks
- Drive compiler level optimizations
- Evaluate hardware and software for adoption
- Identify and eliminate throughput latency and memory bottlenecks
- Implement and tune quantization sparsity and pruning
- Improve AI cost efficiency through architecture and scheduling
- Optimize KV cache continuous batching and speculative decoding
- Optimize data pipelines sharding and storage access patterns
- Optimize distributed training with parallelism and sharding
- Profile and optimize AI training and inference pipelines
- Translate AI research advances to production
- Tune attention implementations for performance
Perks/Benefits
- N/A
Skills/Tech-stack
Access Optimization | Attention Optimization | Benchmarking | C++ | Compiler optimization | Continuous batching | Data loading | Data loading optimization | Deep learning | Distributed Training | FSDP | FlashAttention | GPU Architecture | KV cache | Loading Optimization | Memory Management | Model Compression | Model Parallelism | Paged Attention | Pipeline parallelism | Profiling | Pruning | Python | Quantization | Quantization aware training | Regression testing | Sparsity | Speculative decoding | Storage Access | Storage Access Optimization | TVM | Tensor Parallelism | TorchInductor | Triton | XLA | Zero
Education
Related jobs
-
Software Engineer, Machine Learning USD 213K-293KAI ethics | API Design | Agent Orchestration | Artificial Intelligence | Bias MitigationSenior-level Full TimeSunnyvale, CA | Remote, US | … R12h ago
-
Sr AI Engineer USD 124K-171KAPIs | Cause analysis | Code review | JavaScript | JiraCompany year end break | Flexible time off | Learning and development stipend | Medical/Dental/Vision insurance | Mental wellbeing resourcesSenior-level Full TimeRemote - United States R21h ago
-
AI Research Engineer (Applied AI) USD 150K-222KAccelerator hardware | Agentic Systems | Computer Vision | Data labeling | Deep learningRemote workMid-level Full TimeUnited States - Remote R23h ago
-
LLM Fine-Tuning Engineer USD 150K-270KAdapter-Tuning | DPO | Dataset curation | Efficient Attention | EvaluationHealth insurance | Paid time off | Remote workMid-level Full TimeUnited States - Remote R23h ago
-
Prompt Engineering Architect USD 119K-228KAgent Frameworks | Chunking | Embeddings | Evaluation | Fine TuningSenior-level Full TimeUnited States - Remote R23h ago
-
Robotics Software Engineer USD 125K-169KBehavior Trees | C++ | Concurrent Systems | Control | Embedded SystemsMid-level Full TimeUnited States - Remote R23h ago
-
API Design | AWS | AWS Lambda | Agentic AI | Amazon EC2Senior-level Full TimeOffice Location or Remote - USA R23h ago
-
API Design | AWS | Agentic AI | Cypher | Data ArchitectureSenior-level Full TimeOffice Location or Remote - USA R23h ago
-
Senior AI Engineer - Contract USD 136K-172KBehavior Trees | C# | C++ | CPU Optimization | Game AICareer improvement plan | Company events | Flexible work arrangements | Generous time-off policy | Medical, dental & vision coverageSenior-level Full TimeIrvine, CA R1d ago
-
Staff AI Engineer USD 200K-300KAccuracy Monitoring | Agent systems | Artificial Intelligence | Authentication | Authorization401k eligibility | Hybrid work | Paid time off | Parental leave | Remote workSenior-level Full TimeUnited States (Remote) R1d ago
-
AI Analyst USD 80K-120KAWS | Azure | Computer Vision | Data Analysis | Deep learning401k employer match | AD&D insurance | Dental insurance | Health insurance | Life insuranceMid-level Full TimeRemote, United States R1d ago
-
Principal AI Software Engineer USD 224K-308KAWS | Cloud Computing | Data Processing | Docker | Endpoint Security401k match | Adoption and surrogacy reimbursement | Cancer Care Program | Dependent care FSA | Employee assistance programSenior-level Full TimeUnited States - Remote R1d ago
-
Senior Analytics Engineer USD 140K-170KAirbyte | DBT | Data Governance | Data Modeling | GitHubFlexible work schedule | Paid time off | Remote-friendly work environment | Team inclusionSenior-level Full TimeRemote - US R1d ago
-
Sr. Agentic AI Software Engineer USD 139K-258KAgent Orchestration | Architecture | Claude Code | Context engineering | DebuggingSenior-level Full TimeFarmington Hills or Remote (US only) R1d ago
-
Sr. Engineer, Machine Learning USD 127K-228KAWS | Azure | Bias Mitigation | CI/CD | Data EngineeringSenior-level Full TimeUnited States R1d ago
-
Sr. Engineer, Machine Learning USD 127K-228KAWS | Azure | CI/CD | Deep learning | Delta LakeRemote work | Time off | Volunteer days | Wellness initiativesSenior-level Full TimeUnited States R1d ago
-
Senior-level Full TimeRemote United States R1d ago
-
Software Engineer, Machine Learning Platform USD 187K-259KAWS | Amazon Kinesis | Apache Flink | Apache Kafka | Apache Spark401k match | Child elder pet care backup | Commuter benefit | Disability insurance | Life insuranceSenior-level Full TimeSan Francisco, CA, USA R1d ago
-
AI Product Builder USD 141K-203KAI Agents | AI coding | AI coding tools | Agent Frameworks | Artificial IntelligenceMid-level Full TimeRemote - USA R1d ago
-
API Integration | AWS ACM | Agile | Alerting | AnsibleCross-functional workshops | Hybrid work | Professional mentorship | Remote work flexibilitySenior-level ContractPittsburgh, United States R1d ago
-
Lead Forward Deployed Engineer, Databricks 2026- US, UK USD 180K-247KAgents | Apache Spark | Data Pipelines | Data product | DatabricksRemote workSenior-level Full TimeAtlanta, GA / London, GB - … R1d ago
-
Data Engineer USD 120K-165KApplication Performance Monitoring | Application performance | Azure | Azure Data | Azure Data Factory401k match | Fertility assistance | Hybrid work eligibility | In-office role | Paid HolidaysMid-level Full TimePlano, TX, United States R1d ago
-
Data Engineer USD 90K-150KAgile | Application Performance Monitoring | Application performance | Azure Data | Azure Data Factory401k match | Back Up Adult Care | Back Up Elder Care | Back-up child care | Family-forming assistanceMid-level Full TimeCamas, WA, United States R1d ago
-
Associate Director, Biostatistics & AI USD 173K-217K21 CFR | 21 CFR Part 11 | ADaM | Adaptive Design | Annex 11401k employer match | Company provided life and disability | Comprehensive health care | Employee stock purchase program | Flex Spending AccountsMid-level Full TimeRemote - USA R1d ago
-
Senior Computational Fluid Dynamics Engineer USD 100K-190KANSYS-FLUENT | Computational Fluid Dynamics | Data Preprocessing | Data postprocessing | Fluid Dynamics401k | Bonuses | Equity | FSA | Flexible time offSenior-level Full TimeSanta Clara, CA or Remote R1d ago