Member of Technical Staff, Performance Optimization
Tasks
- Analyze and improve latency throughput memory usage and compute efficiency
- Build performance benchmarking and monitoring infrastructure
- Collaborate with ML researchers on hardware efficient model design
- Evaluate and integrate optimizations for hardware accelerators and specialized runtimes
- Implement low level optimizations with CUDA and Triton
- Improve execution speed and resource utilization
- Improve mixed precision quantization and model graph optimization
- Optimize system and GPU performance for AI workloads
- Profile GPU and kernel bottlenecks
- Scale inference and training across multi GPU multi node environments
Perks/Benefits
- N/A
Skills/Tech-stack
CUDA | CUPTI | Distributed Systems | GPU Profiling | Infiniband | Kernel optimization | Kubernetes | Mixed Precision | Model Optimization | NVProf | Nsight | Parallel Programming | PyTorch | Quantization | RDMA | ROCm | RoCE | Torch compile | Triton | XLA
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Java Full Stack Developer-Software Engineer II USD 93K-155KAPI Design | AWS | Ansible | Artificial Intelligence | BenchmarkingMid-level Full TimeDallas, Texas, United States4h ago
-
C++ | Cloud Native | Container Orchestration | Deep learning | Distributed SystemsCareer growth | Open Source contribution | World Class CollaborationEntry-level Full TimeSan Jose, California, United States4h ago
-
Research Engineer, Robotics USD 184K-356KC++ | CUDA | Computer Graphics | GPU Architectures | GPU KernelsSenior-level Full TimeRedmond, WA5h ago
-
Partner Engineer, Generative AI USD 159K-223KAWS | Agent Orchestration | Azure | Bias Mitigation | C++Senior-level Full TimeMenlo Park, CA5h ago
-
Staff Research Engineer, MRS AI USD 146K-208KA/B | A/B Testing | Alignment techniques | B testing | BenchmarkingSenior-level Full TimeBellevue, WA5h ago
-
Senior Software Developer, Computer Vision, XR USD 100K-253KAr | Augmented Reality | C++ | Computer Vision | Data ProcessingSenior-level Full TimeSan Jose, CA, USA; Waterloo, ON, …5h ago
-
Research Engineer, Pretraining, DeepMind USD 174K-253KFine Tuning | Inference Optimization | JAX | Language Models | Large Language ModelsMid-level Full TimeNew York, NY, USA5h ago
-
Staff Datacloud Blackbelt Engineer, Data and AI USD 183K-266KAI/ML | AI/ML workflows | BigQuery | Cloud Architecture | Computer VisionSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA5h ago
-
Senior Staff Software Engineer, AI/ML, Google Cloud USD 262K-365KAlgorithms | Data Processing | Data Structures | Debugging | Distributed SystemsSenior-level Full TimeSeattle, WA, USA5h ago
-
Senior Software Engineer, AI/ML, Google Cloud Platforms USD 174K-253KC++ | Code Reviews | Data Processing | Data Structures | Data structures algorithmsSenior-level Full TimeKirkland, WA, USA5h ago
-
Staff Software Engineer, Infrastructure, Google Cloud AI USD 207K-301KCompute Technologies | Cross-Functional Collaboration | Cross-functional | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeKirkland, WA, USA; Sunnyvale, CA, USA5h ago
-
Senior Software Engineer, AI/ML GenAI, Google Cloud USD 174K-253KAlgorithms | C++ | Computer Vision | Data Processing | Data StructuresSenior-level Full TimeSunnyvale, CA, USA5h ago
-
C++ | Computer Vision | Data Processing | Debugging | Image classificationSenior-level Full TimeSan Diego, CA, USA5h ago
-
Software Engineer III, Core ML Performance USD 147K-211KAuto-tuning | Automation | Benchmarking | C++ | CUDASenior-level Full TimeSunnyvale, CA, USA5h ago
-
Technical Lead, Storage Distributed and Sovereign Cloud USD 207K-301KAI/ML | AI/ML Workloads | Access Control | Automated remediation | Block StorageSenior-level Full TimeRaleigh, NC, USA; Durham, NC, USA5h ago
-
Agent Construction | Agent Orchestration | Air Gapped Computing | Air-gapped | Data IngestionBonus | Equity | Security clearance travel availabilitySenior-level Full TimeWashington D.C., DC, USA; Maryland, USA5h ago
-
Senior Data Scientist, Machine Learning USD 194K-218KAWS | Active Learning | Airflow | Amazon Redshift | Automated Labeling100% TelecommutingSenior-level Full TimeRedwood City, CA R15h ago
-
Principal Scientist, Machine Learning - Biomolecules USD 208K-286KAWS Batch | AWS ECS | AWS EKS | AWS S3 | AWS SageMakerAnnual incentive program | Healthcare coverage | Retirement benefitsSenior-level Full TimeCambridge, MA USA15h ago
-
Mid-level Full TimeSan Francisco16h ago
-
Data Engineer Data Pipelines and ETL USD 99K-147KAnomaly Detection | Apache Airflow | CDC | Cloud Composer | Data Governance401k plan | Disability benefits | Life insurance | Life insurance coverage | Medical/Dental/VisionMid-level Full TimeBurbank, CA, US, 9150517h ago
-
Sr. Software Engineer, Data Streaming Systems USD 130K-195KAutoscaling | Blocking I/O | CI/CD | Concurrency | Distributed Systems401k plan | Dental insurance | Disability benefits | Life insurance | Medical insuranceSenior-level Full TimeBurbank, CA, US, 9150517h ago
-
Senior-level Full Time245 Summer St, Boston MA, United …17h ago
-
Machine Learning Engineer USD 140K-190KApache Flink | Apache Kafka | Apache Spark | Bigtable | CI/CDMid-level Full TimeRemote - USA R18h ago
-
Senior Data Engineer III USD 183K-205KAWS EMR | AWS S3 | Access Control | Amazon Redshift | Apache AirflowSenior-level Full TimeUnited States18h ago
-
Senior Embedded Software Engineer - Future Forward USD 153K-201KAgile | Authentication | Board Bring-up | Bring-up | C#Senior-level Full TimeSunnyvale, CA, United States R18h ago