Senior Engineer 2: Inference Optimizations
Tasks
- Advise on GPU hardware and software ecosystem
- Collaborate with product teams to develop new features
- Conduct code reviews and mentor team members
- Engage with open-source AI community
- Engineer solutions for GPU performance bottlenecks
- Implement advanced model and kernel optimizations
- Lead performance optimization for inference engines
Perks/Benefits
- Conferences and training reimbursement
- Employee assistance program
- Equity compensation
- Flexible time off
- Professional development support
- Remote work
- Stock purchase program
Skills/Tech-stack
AI infrastructure | AI model | AI model families | CUDA | Deep learning | GPU Kernel Development | GPU Programming | Hardware Architecture | High Performance | High-Performance Computing | Kernel development | Memory Management | Model Optimization | Model families | OpenAI Triton | Parallelization | Performance Computing | PyTorch | ROCm | TensorFlow | TensorRT
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Principal Machine Learning Researcher (Physical AI) USD 200K-400KAnomaly Detection | C++ | Computational Geometry | Computer Vision | Deep learning401k | Casual dress | Company holidays | Employer paid Medical Dental Vision Insurance | Flexible work hoursSenior-level Full TimeLos Angeles, CA (On-site) R23h ago
-
Senior Machine Learning Engineer - Camera Model USD 177K-212K3D Perception | BEV | CNN | Camera Calibration | Computer Vision100 percent paid medical dental and vision premiums | 401k employer match | Accidental death and dismemberment insurance | Company paid holiday office closures | Flexible scheduleSenior-level Full TimeRemote - U.S, Ann Arbor, MI R1d ago
-
Sr. Staff Embedded AI Engineer USD 145K-185KBare Metal | C# | C++ | CMSIS NN | Code generationEmployee resource groups | Flexible work environment | Hybrid work model | Remote work optionSenior-level Full TimeColumbia, MARYLAND, United States R1d ago
-
Senior Software Engineer, LLM Performance USD 180K-339KC++ | CUDA | Cutlass | FlashAttention | FlashInferSenior-level Full TimeSF Bay Area (Hybrid) R1d ago
-
Machine Learning Engineer, AI Search - USDS USD 145K-250KComputer Vision | Deep learning | Generative AI | Language Models | Language ProcessingMid-level Full TimeSan Jose, California, United States R1d ago
-
Senior-level Full TimeSan Jose, United States R1d ago
-
Sr Machine Learning Engineer 5 -- AEP, Agentic System USD 172K-306KAgent Orchestration | Automated retraining | CI/CD | Context engineering | HuggingfaceSenior-level Full TimeSan Jose, United States R1d ago
-
Solutions Architect, Physical AI and Robotics USD 152K-241KBenchmarking | C++ | CUDA | Cosmos | Digital TwinsBenefits | EquitySenior-level Full TimeUS, CA, Remote, United States R1d ago
-
Senior Machine Learning Engineer USD 155K-220KDeployment Monitoring | Drift Detection | Error Analysis | Experiment tracking | Feature EngineeringCommuter benefits | Competitive vacation policy | Dental insurance | Flexible working hours | Health insuranceSenior-level Full TimeUS - Remote, Canada - Remote R2d ago
-
Senior Manager, Computer Vision USD 178K-319KCloud Computing | Computer Vision | Deep learning | Edge Computing | ExperimentationHealth insurance | Parental leave | Professional development stipend | Remote work | Travel up to 5 percentSenior-level Full TimeRemote - US R2d ago
-
Agent systems | Behavior Cloning | Data Ingestion | Debugging | Distributed Training401k match | Disability insurance | Hybrid work | Life insurance | Paid holidays office closuresSenior-level Full TimeRemote - U.S, Ann Arbor, MI R2d ago
-
Data Engineer USD 112K-160KAWS EMR | Anomaly Detection | Apache Spark | Batch Processing | Data LineageSenior-level Full TimeMcLean, Virginia, United States - Remote R2d ago
-
4109- Applied AI & Analytics Consultant USD 155K-210KAPI | AWS | Azure | Classification | DBTBasic life insurance | Dental insurance | Discounted Legal Aid | Long-term disability | Medical insuranceSenior-level Full TimeUnited States - Remote R2d ago
-
AWS | Airflow | Azure | Batch inference | C#Remote work eligibilitySenior-level Full TimeMcLean, VA, United States R2d ago
-
AI Engineer USD 85K-95KAWS | Azure | Data Analysis | Data Governance | Data Security401k retirement plan | Commuter benefits | Dental insurance | Employee assistance program | Family support programSenior-level Full TimeUS - Morrisville, NC - 3900 … R2d ago
-
Senior Systems Engineer, UDS Data Management - East USD 204K-265KAWS | Airflow | Amazon Redshift | Apache Kafka | Apache SparkSenior-level Full TimeRemote - Massachusetts, United States, United … R2d ago
-
AI Data Specialist USD 75K-158KAgile | Automated Scheduling | Automated reporting | Compliance tracking | Data PrivacyContinuing education | Flexible time off | Healthcare | Learning resources | Retirement planMid-level Full Time999 REMOTE, United States R2d ago
-
Engineer III, Machine Learning Software USD 226K-240KAlgorithms | Apache Spark | Automl | Data Structures | Deep learningPartial telecommutingSenior-level Full Time645 Clyde Avenue, Mountain View, CA, … R2d ago
-
AWS | Agentic AI | Autogen | Azure | CI/CDAccess to latest AI tooling | Collaborative low bureaucracy environment | Encouragement to experiment and innovate | Remote within Atlanta areaSenior-level ContractAtlanta, GA R3d ago
-
Senior AI Systems Engineer USD 122K-188KAlerting | Bash | CI/CD | CMMC | Cause analysisFully remote option | Hybrid option | Onsite optionSenior-level Full TimeRaleigh, North Carolina, United States; Albuquerque, … R5d ago
-
Senior ML Engineer USD 180K-200KBlack-box | Black-box optimization | C plus plus | CUDA C | CUDA C plus plusDental insurance | Health insurance | Offsites | Paid parental leave | Regular team eventsSenior-level Full TimeRemote - US R5d ago
-
Sr. Machine Learning Engineer USD 175K-230KAmazon Web Services | Apache Spark | C plus plus | Computer Vision | Data Preprocessing401k plan | Cell phone reimbursement | Dental insurance | Flexible paid time off | Health Savings Account contributionSenior-level Full TimeRemote - United States R5d ago
-
Senior-level Full TimeRemote, United States R5d ago
-
AI Governance | AWS | Amazon EC2 | Amazon S3 | Amazon SageMaker401k matching | Bonus eligibility | Commuting subsidy | Educational assistance | Equity awards (eligibility)Senior-level Full Time5000 - Vertex US - Fan … R5d ago
-
Decision Intelligence Engineer - Next Best Action USD 129K-177KA3C | Backtesting | Bellman Equation | Conservative Q Learning | Constraint Mapping401k retirement savings plan | Medical, dental, and vision benefits | Occasional travel | Remote work | Time offSenior-level Full TimeRemote US, United States R5d ago