Senior High-Performance LLM Training Engineer
US, CA, Santa Clara, United States
USD 184K-356K Senior-level Full Time
Tasks
- Analyze and profile LLM training workloads
- Automate workload analysis and optimization
- Build and support MLPerf Training submissions
- Collaborate on GPU and hardware performance roadmap
- Implement DL training workloads in processor and system simulators
- Implement production software across deep learning stack
- Optimize AI training performance on GPU systems
Perks/Benefits
- N/A
Skills/Tech-stack
C++ | CUDA | Computer Architecture | Deep learning | GPU Architecture | GPU Profiling | JAX | Machine Learning | Machine Learning Benchmarking | Neural Networks | Performance Analysis | Performance optimization | PyTorch | Python
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Regions
Countries
States
Cities
Related jobs
-
Robotics Platform Security Engineer USD 90K-300KAppArmor | Auditd | C# | C++ | CIS BenchmarksHybrid work option | On-site collaboration | Remote work optionSenior-level Full TimeIrvine, CA3h ago
-
C++/CUDA Systems Engineer – Surgical Robotics Platform USD 140K-160KC++ | C++17 | C++20 | CPU GPU Scheduling | CUDAEquity | Health insurance | Paid time off | Performance bonusMid-level Full TimeLos Angeles, California9h ago
-
AWS Batch | AWS EC2 | AWS IAM | AWS Lambda | AWS S3Annual bonus | Company paid benefits | Equity | Paid time offSenior-level Full TimeLos Angeles, California9h ago
-
Staff Applied AI Engineer, Enterprise GenAI USD 216K-270KAWS | Cloud platform | Data Analysis | Generative AI | Google CloudCommuter stipend | Equity compensation | Health, dental, vision insurance | Learning and development stipend | Paid time offSenior-level Full TimeSan Francisco, CA; Seattle, WA; New …9h ago
-
AI/ML Engineer USD 130K-223KAgentic AI | Deep learning | Distributed Training | Docker | EmbeddingsMid-level Full TimeScottsdale, AZ10h ago
-
Principal Engineer, Data & ML Platform USD 119K-180KAPIs | Automated testing | Cloud Native | Cloud platform | Continuous DeploymentSenior-level Full TimeScottsdale, AZ10h ago
-
Sr Sales Engineer, West USD 160K-196KAnalytics | Apache Spark | Artificial Intelligence | Dataiku | Kubernetes401k match | Dental insurance | Employer paid disability coverage | Flexible spending accounts | Medical insuranceSenior-level Full TimeUnited States, Remote R10h ago
-
Machine Learning Engineer, Foundation Model USD 129K-247KAuto-regressive models | C plus plus | Deep learning | Diffusion Models | Distributed TrainingSenior-level Full TimeSan Jose10h ago
-
AI Engineer USD 53K-119KAPI Design | Cost Optimization | Embeddings | Evaluation | JSONDental insurance | Gym stipend | Health insurance | Medical membership | Offsite retreatsSenior-level Full TimeRemote, US R11h ago
-
Sr. Data Engineer USD 145K-160KAPI Development | AWS | AWS Glue | AWS Lambda | Amazon AthenaComplimentary club membership | Personal training | Pilates | Shop discounts | SpaßSenior-level Full TimeNew York, NY, United States11h ago
-
Distinguished Software Engineer, Data Infrastructure USD 248K-406KAI | Batch Processing | Data Infrastructure | Data Privacy | Data ProcessingExecutive-level Full TimeMountain View, CA, United States11h ago
-
Senior Machine Learning Engineer, Operations Research USD 173K-218KCombinatorial Optimization | Deep learning | Keras | Machine Learning | Mathematical ProgrammingAnnual refresh grants | Equity grant | Flexible work policy | Remote workSenior-level Full TimeUnited States - Remote R11h ago
-
Machine Learning & Game Tech Architect USD 186K-257KAPI Design | C# | C++ | Game engines | Go401k matching | Dental insurance | Flexible schedule | Flexible working hours | Health insuranceSenior-level Full TimeBoston, MA, United States R11h ago
-
Machine Learning Engineer USD 217K-260KAWS | Cloud platform | Dash | Database Management | Docker401k employer match | Comprehensive healthcare benefits | Family planning support | Flexible vacation | Gender-affirming careSenior-level Full TimeSan Francisco, CA12h ago
-
Senior Software Engineer, Machine Learning USD 216K-303KDjango | GNN | Graph Neural Networks | JavaScript | Keras401k with employer match | Comprehensive healthcare benefits | Family planning support | Flexible vacation | Gender-affirming careSenior-level Full TimeSan Francisco, CA12h ago
-
Machine Learning Engineer USD 260K-303KAmazon Web Services | Apache Beam | Apache Spark | Cassandra | Cloud platform401k employer match | Comprehensive healthcare benefits | Family planning support | Flexible vacation | Full-time Telecommuting OptionMid-level Full TimeNew York City, NY12h ago
-
A/B | A/B Testing | AUC | B testing | CTR401k match | Commuter benefits | Dental insurance | FSA | Flexible work scheduleSenior-level Full TimeSan Francisco, CA12h ago
-
Data/Analytics Engineering Co-op USD 67K-67KApache Airflow | Cloud infrastructure | Dagster | Looker | PowerBIHybrid workEntry-level Full TimeBoston or NYC12h ago
-
Data & Analytics Engineer USD 115K-150KCI/CD | CTE | Clustering | DBT | Data Lineage401k matching | Birthday time off | Cell phone reimbursement | Childcare expense reimbursement | Company-Paid HolidaysMid-level Full TimeUnited States13h ago
-
Staff Data Engineer USD 138K-221KAmazon Kinesis | Amazon Redshift | Amazon Web Services | Apache Airflow | BigQueryHybrid work modelSenior-level Full TimeBoston, MA14h ago
-
Data Engineer (remote) USD 85K-100KAgile | Apache Hive | Apache Impala | Apache Spark | Artificial Intelligence401k match | Employee assistance program | Flexible schedule | Health insurance | Life insuranceMid-level Full TimeWork From Home, United States R15h ago
-
Engineer 2, Embedded Software USD 105K-154KAgile | C# | C++ | Development Lifecycle | Device DriversMid-level Full TimeSan Antonio, TX, United States15h ago
-
Embedded Software Engineer I USD 106K-170KAlgorithms | C# | C++ | Code Quality | Data StructuresDiscretionary paid time off | Emotional & mental wellness support | Fitness programs | Learning and development programs | Medical, dental, vision plansMid-level Full TimeSeattle, Washington, United States R15h ago
-
Senior Forward Deployed AI Engineer (Remote Eligible) USD 227K-245KAWS Bedrock | Agent Framework | Agent Orchestration | Cloud infrastructure | Databricks401k match | Dental insurance | Flexible time off | Health insurance | Life insuranceSenior-level Full Time-REMOTE, USA- R15h ago
-
Quantitative Technologist (C++ Intern) USD 175K-250KAlgorithms | C# | C++ | Data Structures | Distributed SystemsEntry-level InternshipChicago15h ago