Senior Engineer 2: Inference Optimizations
Tasks
- Advise on hardware procurement and integration
- Collaborate with product teams on feature development
- Contribute to open source AI communities
- Engineer solutions for GPU kernel performance
- Implement cutting-edge AI inference techniques
- Lead performance optimization for inference engine
- Mentor team through code reviews
Perks/Benefits
- Employee assistance program
- Flexible time off
- Health benefits
- Learning and training budget
- Professional development resources
- Remote work
- Stock options
Skills/Tech-stack
AI infrastructure | BF16 | Bandwidth Optimization | Batch size optimization | CUDA | FP8 | GPU Programming | High Performance | High-Performance Computing | Kernel Fusion | Memory bandwidth | Memory bandwidth optimization | Model Inference | OpenAI Triton | Parallelization | Performance Computing | ROCm | Size optimization | TensorRT | Transformers
Education
Roles
Related jobs
-
Senior Engineer 2: Inference Optimizations USD 167K-209KAI infrastructure | CUDA | Deep learning | GPU Kernels | GPU ProgrammingBonuses | Career development support | Conference reimbursement | Employee assistance program | Flexible time offSenior-level Full TimeDenver R17h ago
-
Senior Engineer 2: Inference Optimizations USD 167K-209KAI infrastructure | AI model | AI model families | CUDA | Deep learningConferences and training reimbursement | Employee assistance program | Equity compensation | Flexible time off | Professional development supportSenior-level Full TimeBoston R17h ago
-
Senior Engineer 2: Inference Optimizations USD 167K-209KAI Inference | AI infrastructure | CUDA | GPU Architecture | GPU kernel tuningCareer development resources | Employee assistance program | Equity compensation | Flexible time off | Remote workSenior-level Full TimeAustin R17h ago
-
Senior Engineer 2: Inference Optimizations USD 167K-209KAI Model Optimization | AI model | C# | CUDA | GPU ArchitectureCareer development resources | Competitive benefits | Equity compensation | Flexible remote work | Reimbursement for trainingSenior-level Full TimeSan Francisco R17h ago
-
Senior Applied AI Solutions Engineer USD 195K-255KCUDA | Deep learning | Distributed Training | Fine Tuning | HuggingfaceBenefits package | Competitive salary | Flexible working | Innovative environment | Professional growthSenior-level Full TimeAmsterdam, Netherlands; Remote - Europe; Remote … R1d ago
-
Research Scientist / Engineer – Training Infrastructure USD 200K-300KCUDA | Containerization | Distributed Systems | GPU clusters | LinuxSenior-level Full TimePalo Alto, CA, Remote - International, … R2d ago
-
AI Engineer (Remote) USD 200K-250KAI frameworks | Agent systems | Cloud infrastructure | Context engineering | LLM APIs401k plan | Dental insurance | Disability insurance | Flexible spending account | Health insuranceExecutive-level Full TimeFort Washington, PA, United States R4d ago
-
Research Software Engineer, AI/ML USD 110K-130KCI/CD | Containerization | GPU Computing | Git | Inference frameworksProfessional development opportunities | University shared governanceMid-level Full TimeBlacksburg, Virginia, Hybrid R4d ago
-
AI infrastructure | APIs | Distributed Systems | Embedding pipelines | Language ModelsCollaborative hybrid work environment | Health, dental, vision coverage | Impactful work on real-world AI systems | Relocation supportMid-level Full TimeSan Francisco, CA; Hybrid R5d ago
-
Applied AI Engineer - Federal (TS Required) USD 160K-250KAirflow | Chroma | CrewAI | Data Generation | Deep learningCareer development | Health insurance | Paid time off | Work in federal security environmentsMid-level Full TimeUnited States (Remote); Washington, D.C. (Remote) R5d ago
-
Applied AI Engineer - AI Solutions USD 172K-300KAPI Development | Chroma | CrewAI | Data Processing | Deep learningCareer growth opportunities | Equity | Flexible work options | Learning and development opportunitiesMid-level Full TimeNew York City, NY (Hybrid); Redwood … R5d ago
-
Senior Software Engineer, AI Inference USD 133K-220KAI infrastructure | Ansible | C++ | CI/CD | CUDADental coverage | Employee assistance program | Flexible work arrangements | Medical coverage | Paid time offSenior-level Full TimeBoston, United States R7d ago
-
Machine Learning Engineer, Senior Manager USD 184K-270KChain-of-Thoughts | DAGs | Data Science | Databricks | Deep learning401k match | Adoption Assistance | Flexible work arrangements | Medical/Dental/Vision | Parental leaveSenior-level Full TimeSilver Triangle Building, United States R7d ago
-
ML Systems Engineer, ML Acceleration USD 144K-160KCUDA | Distributed Training | Machine Learning | Profiling tools | PyTorch401k plan | Dental insurance | Hybrid work | Medical insurance | Vision insuranceSenior-level Full TimeRemote U.S. R7d ago
-
ML Systems Engineer, ML Acceleration USD 144K-160KCUDA | Distributed Systems | Machine Learning | PyTorch | Python401k | Dental insurance | Health savings account | Life insurance | Medical insuranceSenior-level Full TimePittsburgh, Pennsylvania, United States R7d ago
-
ML Systems Engineer, ML Acceleration USD 144K-160KCUDA | Distributed Training | Profiling tools | PyTorch | Python401k | Dental | Health savings account | Life insurance | MedicalSenior-level Full TimeBoston, Massachusetts, United States R7d ago
-
Sr. Engineer - Applied AI USD 155K-180KAWS | Azure | CI/CD | Data Pipelines | Docker401k plan | Dental | Disability insurance | Holidays | Life insuranceSenior-level Full TimeRemote (United States) R8d ago
-
Machine Learning Engineer USD 80K-110KAI Deployment | API Development | Automated testing | Data Analysis | Data ProcessingFlexible work schedule | Health benefits | Remote work opportunitiesMid-level Full TimeDenver, Colorado, United States; Remote R8d ago
-
Machine Learning Engineer USD 185K-303KAirflow | BigQuery | Convolutional Neural Networks | Data Processing | Distributed Systems401k with employer match | Family planning support | Flexible vacation | Gender-affirming care | Healthcare benefitsMid-level Full TimeRemote - United States R8d ago
-
Senior Applied Researcher AI/ML (US) USD 124K-138KAWS | Azure | Data Engineering | Google Cloud | JavaAdoption support | Employee assistance program | Fertility support | Flexible paid time off | Parental leaveSenior-level Full TimeRemote - US R8d ago
-
Technical Architect - Machine Learning USD 175K-265KAPI Gateway | AWS | Airflow | Amazon Bedrock | Deep learningCareer growth opportunities | Exposure to AI and cloud technologies | Innovative tech environment | Remote workSenior-level Full TimeUSA - Remote, United States R11d ago
-
Machine Learning Engineer, Images USD 200K-265KCUDA | Cloud Platforms | Deep learning | Diffusers | Diffusion TransformersFree lunch and snacks | Lifestyle spending | Medical membership | Medical/Dental/Vision insurance | Paid time offMid-level Full TimeBay Area or Remote R13d ago
-
Principal Machine Learning Engineer USD 155K-170KA/B | A/B Testing | B testing | Big Data | Data EngineeringHealth insurance | Paid time off | Professional development | Remote workSenior-level Full TimeBoston, MA, United States R14d ago
-
AI Automation Engineer _ Virtual USD 160K-190KAI Model Evaluation | AI infrastructure | Bias Mitigation | CI/CD | Data LineageContinuing education | Flexible work arrangements | Health coverage | Retirement plans | Wellbeing programsMid-level Full TimeUS-IL-Illinois-Virtual, United States R14d ago
-
ML Engineer II, Navigation USD 140K-210KDecision Making | Imitation Learning | PyTorch | Python | ROSMid-level Full TimeAnywhere in the US R14d ago