Performance Engineer, GPU
San Francisco, CA | New York City, NY | Seattle, WA
USD 280K-850K Senior-level Full Time
Tasks
- Architect GPU performance systems
- Build distributed GPU communication strategies
- Develop custom GPU kernels
- Develop performance modeling frameworks
- Implement GPU utilization optimizations
- Implement kernel fusion strategies
- Improve end to end training and inference efficiency
- Optimize tensor core performance
- Partner with hardware vendors
- Profile production ML performance bottlenecks
Perks/Benefits
- Flexible working hours
- Generous vacation
- Hybrid work 25 percent
- Optional equity donation matching
- Parental leave
- Visa sponsorship support
Skills/Tech-stack
Bandwidth Optimization | CUDA | Cluster Orchestration | Collective communication | Custom Operators | Cutlass | FP8 Quantization | Fault Tolerance | Flash Attention | Int8 Quantization | JAX | Kernel Fusion | Memory bandwidth | Memory bandwidth optimization | Mixed Precision | Model Parallelism | NCCL | NVLink | Nsight | PyTorch | Tensor Core | Tensor core optimization | Torch compile | Triton | XLA
Education
Regions
Countries
States
Related jobs
-
Data Scientist II - Computer Vision USD 140K-170KComputer Vision | Convolutional Neural Networks | Deep learning | Experiment tracking | Field extractionMid-level Full TimeRemote - US R1d ago
-
AI Developer Subcontractor USD 101K-195KAWS | Big Data | C plus plus | Computer Vision | Data ProcessingMid-level Contract Full TimeFL, United States1d ago
-
Senior Software Engineer (Machine Learning) USD 180K-213KAirflow | Apache Spark | BigQuery | CI/CD | Cloud infrastructureSenior-level Full TimeChicago1d ago
-
Senior Machine Learning Operations Engineer USD 140K-220KAWS | Azure | C++ | Computer Vision | Convolutional Neural NetworksCareer growth | Collaborative work environment | Professional development opportunitiesSenior-level Full TimeNew York, NY1d ago
-
Senior Staff ML Engineer USD 240K-310KAWS | Airflow | Apache Spark | Cloud platform | Deep learning401k match | Fertility and family building support | Flexible vacation policy | Gender affirming care cost coverage | Monthly Food StipendSenior-level Full TimePalo Alto1d ago
-
Senior Machine Learning Engineer, Perception USD 170K-215K3D Geometry | BEV Representation | Computer Vision | Deep learning | Distributed TrainingSenior-level Full TimeSanta Clara, CA1d ago
-
Senior Staff Machine Learning Engineer USD 264K-378KAgile | Experimentation | Generative AI | JAX | Java401k retirement plan | Health insurance | Meal allowance | Paid days off | Paid flexible holidaysSenior-level Full TimeNew York, NY1d ago
-
Machine Learning Operations Engineer USD 162K-210KAWS SageMaker | Amazon Web Services | CI/CD | Data labeling | DevOpsDental insurance | Equity | Health insurance | Professional development | Vision insuranceMid-level Full TimeBoston, Massachusetts1d ago
-
Sr. Java Full Stack Developer USD 103K-173KAPI Design | AWS | Ansible | Automated testing | BenchmarkingSenior-level Full TimeDallas, Texas, United States2d ago
-
LLM Post-Training Engineer, Research & Product USD 212K-389KData Pipelines | Deep learning | Distributed Training | Human preference learning | Instruction TuningSenior-level Full TimeSan Jose, California, United States2d ago
-
Senior-level Full TimeSan Francisco, US2d ago
-
Mid-level Full TimeChattanooga, TN, United States R2d ago
-
Partner 20, Applied ML, Engineer, ASG USD 362K-422KAirflow | CI/CD | Data Engineering | Docker | Feature EngineeringMid-level Full TimeSan Francisco, California, United States2d ago
-
Senior Machine Learning Engineer USD 218K-273KA/B | A/B Testing | B testing | CatBoost | Embeddings401k | Employee assistance program | Equity compensation | Flexible PTO | HSA/FSASenior-level Full TimeNew York, New York, United States; …2d ago
-
Staff Machine Learning SWE Infra USD 238K-302KAutoregressive models | Cloud Computing | Data Engineering | Distributed Training | Gradient ShardingCompany benefits program | Discretionary annual bonus | Equity incentive planSenior-level Full TimeMountain View, CA, USA2d ago
-
Senior Developer, Data & IT - AI Solutions USD 120K-142KAI Agents | API Integration | AWS | AWS Bedrock | AWS SageMakerDental insurance | Dependent Care Account | Health insurance | Health savings account | Mental health counseling supportSenior-level Full TimeNew York, NY, United States2d ago
-
AI Research Engineer USD 190K-280KAgentic AI | Clinical data | Data Pipelines | Data integration | Deep learningDiversity and inclusion initiatives | Flexible work environment | Friendly work environment | Professional developmentMid-level Full TimeSeattle, Washington, United States; South San …2d ago
-
Member of Technical Staff - Imagine Model USD 180K-440KAudio Processing | C++ | Computer Vision | Data Annotation | Data Augmentation401k | Dental insurance | Disability insurance | Employee discounts | Health insuranceSenior-level Full TimePalo Alto, CA; Seattle, WA2d ago
-
Lead Software Engineer - Full Stack-Generative AI USD 184K-215KAWS | CI/CD | CrewAI | DevOps | GitHub ActionsSenior-level Full TimePlano, TX, United States2d ago
-
Generative AI Inference Engineer USD 152K-287KAWS | CUDA | Cloud platform | Diffusion Models | DockerSenior-level Full TimeUnited States2d ago
-
Senior MLOps Platform Engineer {S} USD 120K-185KAWS EKS | Airflow | Amazon S3 | Argo CD | Batching401k match | Dental insurance | Employee assistance program | HSA contributions | Health insuranceSenior-level Full TimeColorado Springs, Colorado, United States R3d ago
-
Risk Management - Data Scientist Associate USD 173K-210KAgile | Auto-GPT | Data Pipelines | Deep learning | Drift monitoringBackup childcare | Financial coaching | Health care coverage | Mental health support | Onsite health and wellness centersMid-level Full TimeJersey City, NJ, United States3d ago
-
Senior Generative AI Engineer USD 125K-188KAI Safety | AI Search | AWS Bedrock | Amazon SageMaker | Amazon Web ServicesSenior-level Full TimeRidgefield Park, NJ, United States3d ago
-
Senior Staff Engineer - Data Scientist USD 150K-190KAmazon Web Services | Anomaly Detection | Apache Airflow | Apache Kafka | Apache SparkContract-to-hire | Remote work | Travel opportunitiesSenior-level Full TimeRemote, REMOTE, United States R3d ago
-
Software Engineer III - Machine Learning Platform USD 172K-210KAI Platform | AWS SageMaker | Airflow | Azure Machine Learning | CI/CDSenior-level Full TimeNew York, NY, United States3d ago