Performance Engineer, GPU
San Francisco, CA | New York City, NY | Seattle, WA
USD 280K-850K Senior-level Full Time
Tasks
- Architect GPU performance systems
- Build distributed GPU communication strategies
- Develop custom GPU kernels
- Develop performance modeling frameworks
- Implement GPU utilization optimizations
- Implement kernel fusion strategies
- Improve end to end training and inference efficiency
- Optimize tensor core performance
- Partner with hardware vendors
- Profile production ML performance bottlenecks
Perks/Benefits
- Flexible working hours
- Generous vacation
- Hybrid work 25 percent
- Optional equity donation matching
- Parental leave
- Visa sponsorship support
Skills/Tech-stack
Bandwidth Optimization | CUDA | Cluster Orchestration | Collective communication | Custom Operators | Cutlass | FP8 Quantization | Fault Tolerance | Flash Attention | Int8 Quantization | JAX | Kernel Fusion | Memory bandwidth | Memory bandwidth optimization | Mixed Precision | Model Parallelism | NCCL | NVLink | Nsight | PyTorch | Tensor Core | Tensor core optimization | Torch compile | Triton | XLA
Education
Regions
Countries
States
Related jobs
-
Machine Learning Engineer, Physical AI USD 150K-200KAWS | CUDA | Computer Vision | Deep learning | FastAIBi-annual offsites | Dental insurance | Flexible PTO | Health insurance | Learning and development budgetMid-level Full TimeSan Francisco, CA, US8h ago
-
Senior Software Engineer, Machine Learning, Core ML USD 174K-252KC++ | Compiler optimization | Data Processing | Data parallelism | DebuggingSenior-level Full TimeMountain View, CA, USA11h ago
-
Software Engineer, AI/ML, Google Cloud USD 147K-211KAccelerated Linear Algebra | C++ | Data Processing | Debugging | FP8Mid-level Full TimeMountain View, CA, USA11h ago
-
Research Scientist - LLM USD 225K-400KAudio Models | Evaluation Frameworks | Language Models | Language Processing | Large Language ModelsCommuter reimbursement | Internet reimbursement | Medical/Dental/Vision insurance | Phone bill reimbursement | Relocation providedSenior-level Full TimeSan Francisco Bay Area17h ago
-
Research, Mid-Training USD 225K-380KContext Length Extension | Data Filtering | Data weighting | Deep learning | Distributed TrainingMid-level Full TimeSan Francisco Bay Area17h ago
-
Mid-level Full TimeSeattle19h ago
-
AI Scientist USD 98K-123KCode Quality | Data pipeline | Deep learning | Forecasting | InferenceCareer development | Global opportunities | Pay transparencyMid-level Full TimeAtlanta, GA, United States, United States23h ago
-
APIs | AWS SageMaker | Azure AI | CI/CD | Data PipelinesHybrid workMid-level Full TimeCharlotte, NC23h ago
-
Senior Staff AI Research Engineer, AI Platform USD 236K-295KAWS | Azure | Benchmarking | Fine Tuning | GCPEducational reimbursement | Employee insurance plans | Unlimited vacation | Wellness reimbursementSenior-level Full TimeNew York, NY1d ago
-
Machine Learning Engineer USD 120K-160KAdaptive modulation | Anomaly Detection | Beamforming | C++ | Cognitive radioHybrid work scheduleMid-level Full TimeLos Angeles CA R1d ago
-
Sr Data Scientist - Gen AI ML - Irving USD 62K-217KAPI Security | Agent systems | Anthropic Claude | Asynchronous programming | CI/CD401k retirement plan | Medical, dental, vision benefits | Paid Holidays | Paid time offSenior-level Full TimeUnited States1d ago
-
API Security | Asynchronous programming | CI/CD | Docker | Embeddings401k retirement plan | Dental insurance | Medical insurance | Paid Holidays | Paid time offMid-level Full TimeUnited States1d ago
-
AI/ML Research Engineer USD 120K-205KAWS | Data Engineering | Deep learning | Distributed Computing | DockerOnsite role | Relocation assistance (if needed)Mid-level Full TimeBoston, MA or San Francisco, CA1d ago
-
Lead Machine Learning Engineer USD 172K-276KAWS | Azure | CI/CD | Databricks | Distributed SystemsRemote workSenior-level Full TimeChicago, Illinois, USA R1d ago
-
Senior / Staff AI Research Engineer, Real-Time Inference USD 160K-300KC++ | CUDA | CUDA kernels | Edge Computing | Embedded Systems401k plan | Dental insurance | Equity program | Fully stocked kitchen | Green card supportSenior-level Full TimeMilpitas, CA1d ago
-
Staff AI Research Engineer, Perception USD 170K-291KC++ | Computer Vision | Depth Estimation | Distributed Training | Embedded Systems401k plan | Dental insurance | Equity programs | Fully stocked kitchen | Green card supportSenior-level Full TimeMilpitas, CA1d ago
-
Senior-level Full TimeHouston, TX, United States1d ago
-
Amazon EKS | Angular | Bedrock | CI/CD | Diffusion ModelsSenior-level Full TimeJersey City, NJ, United States1d ago
-
AI Search | AWS Bedrock | AWS SageMaker | Anthropic | Azure AI401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States1d ago
-
AWS Glue | AWS Lambda | AWS SageMaker | Amazon EC2 | Amazon KinesisOnsite from day 1Senior-level Full TimeJersey City, New Jersey, United States1d ago
-
Senior Geospatial AI/ML Engineer USD 160K-228KComputer Vision | Deep learning | Docker | Geospatial analysis | KubernetesCommuter benefits | Employee assistance program | Equity | Health savings account | Home office reimbursementSenior-level Full TimeUnited States, Remote R1d ago
-
Machine Learning Engineer, Genai Technology USD 185K-300KArtificial Intelligence | C++ | Java | Machine Learning | PyTorch401k employer match | Fully-paid health care | Generous parental and family leave | Mental and physical wellness programs | Non profit matching gift programSenior-level Full TimeUnited States1d ago
-
Senior Software Engineer, AI Coding Tools USD 244K-588KDeep learning | Distributed Training | Fine Tuning | GPU Computing | Inference OptimizationSenior-level Full TimeSan Jose, California, United States1d ago
-
AWS | Agents | Amazon S3 | Artificial Intelligence | Cloud StorageMid-level Full TimePalo Alto, California, United States1d ago
-
AWS | Agent | Dataset Pipeline | Django | DockerBonus | Equity | On-site work | Visa sponsorshipMid-level Full TimeMiami, Florida, United States1d ago