AI Performance Optimization Engineer
Tasks
- Build benchmark suites and regression frameworks
- Collaborate to embed best practices in production pipelines
- Document performance tuning playbooks and share findings
- Drive compiler level optimizations for end to end performance
- Evaluate new hardware and software and recommend adoption
- Identify and eliminate performance bottlenecks
- Implement and tune quantization sparsity and pruning
- Improve cost efficiency through model hardware and scheduling
- Optimize AI training and inference pipelines for throughput latency and cost
- Optimize KV cache continuous batching and speculative decoding
- Optimize data pipelines and sharding for training throughput
- Optimize distributed training with tensor and pipeline parallelism
- Translate AI research advances into production improvements
- Tune attention implementations for faster inference
Perks/Benefits
- N/A
Skills/Tech-stack
Access Optimization | Attention Mechanisms | Benchmarking | C plus plus | CPU | Cache optimization | Compiler optimization | Continuous batching | Data Sharding | Data loading | Data loading optimization | DeepSpeed | Distributed Training | FSDP | FlashAttention | GPU Architecture | GPU Profiling | KV cache | KV cache optimization | Loading Optimization | Memory Management | Model Parallelism | Paged Attention | Pipeline parallelism | Profiling tools | Pruning | Python | Quantization | Regression testing | Sparsity | Speculative decoding | Storage Access | Storage Access Optimization | TVM | Tensor Parallelism | TensorRT | TorchInductor | Triton | XLA | Zero
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Related jobs
-
Senior AI/ML Engineer USD 125K-157KAWS | Access Controls | Agent Orchestration | Audit trails | CI/CD401k | Commuter benefits | Employee referral program | Fertility care benefits | Free testingSenior-level Full TimeUS Remote R9h ago
-
Bioinformatics Production Analyst/Production Engineer USD 102K-128KData Visualization | Genomics | HIPAA | Health information | Linux401k benefits | Commuter benefits | Disability insurance | Employee referral program | Fertility care benefitsEntry-level Full TimeUS Remote R11h ago
-
IT Data Engineer USD 117K-124KANSI X12 | ARM Templates | Automl | Azure | Azure DataBackground screening provided | Flexible work hours | Mentorship | Professional development opportunitiesSenior-level Full TimeUS - Remote R17h ago
-
Analytics Engineer USD 147K-225KApache Airflow | BigQuery | DBT | Data Modeling | Data Visualization401k | Comprehensive benefits | Equity | Flexible time offSenior-level Full TimeUS Remote, Los Angeles, CA; San … R1d ago
-
Autonomy | C++ | CPU GPU | CPU GPU Debugging | Critical Systems401k | Health insurance | Paid Company Holidays | Paid time off | Phone stipendSenior-level Full TimeSan Carlos - Hybrid R1d ago
-
Autonomy | C++ | Data Ingestion | Data Ingestion Pipelines | Deployment401k | Health insurance | Paid Holidays | Paid time off | Phone stipendMid-level Full TimeSan Carlos - Hybrid R1d ago
-
Senior Databricks Engineer USD 180K-247KAWS | Autoscaling | Azure | CI/CD | CachingVisa sponsorshipSenior-level Full TimeCanada R1d ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Data Quality | Data labeling100 percent remote work | Career growth opportunities | Visa transfer support for qualified candidatesMid-level Full TimeUnited States - Remote R1d ago
-
Hadoop Big Data Developer USD 100K-150KAWS EMR | Airflow | Apache Atlas | Apache Flink | Apache HBaseLong-term career growth | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Hadoop Big Data Developer USD 100K-150KAWS EMR | Airflow | Apache Atlas | Apache Flink | Apache HiveCareer growth | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Mid-level Full TimeUnited States - Remote R1d ago
-
AI Data Engineer USD 100K-150KActive Learning | Apache Beam | Apache Spark | CI/CD | CachingBenefits | Career growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Software Engineer (Remote) USD 50K-130KAgile | ETL | Git | Google BigQuery | Google DataformRemote workEntry-level Full TimeTEXAS - VIRTUAL - TX01, United … R1d ago
-
Sr Staff - Data Platform Engineer USD 220K-255KAWS EMR | AWS Lambda | AWS S3 | Airflow | Apache HudiDental insurance | Disability insurance | Flexible spending account | Health insurance | Health savings accountSenior-level Full TimeCalifornia - Remote Office, United States R1d ago
-
LLM Engineer USD 100K-150KAdapter-Tuning | Automated benchmarking | DPO | Dataset curation | Direct Preference OptimizationCareer growth potential | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Data & Integrations Engineer USD 100K-100KAPI Gateway | API Management | AWS API | AWS API Gateway | AWS LambdaCareer development | Fully remoteSenior-level Full TimeRemote Ohio, United States R1d ago
-
ML Platform Engineer USD 100K-150KAPI Gateway | Abuse detection | Automated rollback | Autoscaling | C++100 percent remote | Career growth | H1B transfer supportSenior-level Full TimeUnited States - Remote R1d ago
-
Audio Machine Learning | Audio Separation | Cloud Computing | Fine Tuning | Language ModelsAviation immersion opportunities | Equity upside | Free flight training | Remote work | Technical leadership opportunitySenior-level Full TimeSan Francisco, CA; Onsite R1d ago
-
Senior MLOps & Generative AI Engineer - Remote USD 91K-152KAWS | Alerting | Azure | CI/CD | Deep learningAdoption reimbursement | Emergency backup care | Fertility and surrogacy reimbursement | Long-term disability | Medical/Dental/VisionSenior-level Full TimeCorp Facilities MPB - 350 Centre … R1d ago
-
Manager, Applied AI Engineer USD 171K-375KAI/ML | ASR | CRM Integration | Go | GuardrailsHybrid work | Remote work | Work-life balanceSenior-level Full TimeRemote (US), United States R1d ago
-
Deep learning | Distributed Computing | JAX | Machine Learning | PyTorch401k match | Dental insurance | Employee assistance program | Flexible work schedules | HolidaysSenior-level Full TimeUS-CT-EAST HARTFORD-ETC ~ 400 Main St … R1d ago
-
Senior-level Full TimeCanada R1d ago
-
Sr. AI Engineer (Applied AI & ML Systems) USD 132K-165KAgentic AI | Context engineering | Continuous Improvement | Data Engineering | Data PipelinesE learning license | Hackathons | Healthcare benefits | Home office setup allowance | Identity theft protectionSenior-level Full TimeUnited States R1d ago
-
Data Engineer USD 95K-140KApache Spark | Automated testing | Azure Databricks | CI/CD | Data ModelingMid-level Full TimeUS Remote R1d ago
-
Senior Analytics Engineer USD 180K-208KBigQuery | Cube | Dashboards | Data Modeling | Data orchestration401k with payroll match | Dental vision and mental health care | Employer sponsored medical care | Equity | Flexible PTOSenior-level Full TimeSan Francisco R1d ago