Senior Engineer 2: Inference Optimizations
Tasks
- Advise on hardware procurement and integration
- Collaborate with product teams on feature development
- Contribute to open source AI communities
- Engineer solutions for GPU kernel performance
- Implement cutting-edge AI inference techniques
- Lead performance optimization for inference engine
- Mentor team through code reviews
Perks/Benefits
- Employee assistance program
- Flexible time off
- Health benefits
- Learning and training budget
- Professional development resources
- Remote work
- Stock options
Skills/Tech-stack
AI infrastructure | BF16 | Bandwidth Optimization | Batch size optimization | CUDA | FP8 | GPU Programming | High Performance | High-Performance Computing | Kernel Fusion | Memory bandwidth | Memory bandwidth optimization | Model Inference | OpenAI Triton | Parallelization | Performance Computing | ROCm | Size optimization | TensorRT | Transformers
Education
Roles
Related jobs
-
Senior Machine Learning Engineer - Camera Model USD 177K-212K3D Perception | BEV | CNN | Camera Calibration | Computer Vision100 percent paid medical dental and vision premiums | 401k employer match | Accidental death and dismemberment insurance | Company paid holiday office closures | Flexible scheduleSenior-level Full TimeRemote - U.S, Ann Arbor, MI R1d ago
-
Senior Software Engineer, LLM Performance USD 180K-339KC++ | CUDA | Cutlass | FlashAttention | FlashInferSenior-level Full TimeSF Bay Area (Hybrid) R1d ago
-
Solutions Architect, Physical AI and Robotics USD 152K-241KBenchmarking | C++ | CUDA | Cosmos | Digital TwinsBenefits | EquitySenior-level Full TimeUS, CA, Remote, United States R1d ago
-
Agent systems | Behavior Cloning | Data Ingestion | Debugging | Distributed Training401k match | Disability insurance | Hybrid work | Life insurance | Paid holidays office closuresSenior-level Full TimeRemote - U.S, Ann Arbor, MI R2d ago
-
Data Engineer USD 112K-160KAWS EMR | Anomaly Detection | Apache Spark | Batch Processing | Data LineageSenior-level Full TimeMcLean, Virginia, United States - Remote R2d ago
-
Senior Machine Learning Engineer USD 175K-230KAWS Bedrock | AWS SageMaker | Data Engineering | Experimentation | Language ModelsCase by case accommodation for medical beliefs | Case by case accommodation for religious beliefs | Remote work flexibilitySenior-level Full TimeUnited States R3d ago
-
Senior AI Systems Engineer USD 122K-188KAlerting | Bash | CI/CD | CMMC | Cause analysisFully remote option | Hybrid option | Onsite optionSenior-level Full TimeRaleigh, North Carolina, United States; Albuquerque, … R5d ago
-
A/B | A/B Testing | Anomaly Detection | Assortment optimization | Azure MLBackground check | Remote workSenior-level Full TimeRemote - US, United States R5d ago
-
Sales Engineer - US Fed USD 132K-228KArtificial Intelligence | File systems | GPU clusters | High Performance | High-Performance ComputingSenior-level Full TimeRemote, United States R6d ago
-
Sales Engineer - East USD 132K-228KAI | Cloud Platforms | Data Architecture | File systems | GPU clustersSenior-level Full TimeRemote, United States R6d ago
-
Sales Engineer - FSI USD 132K-228KAI | Cloud Platforms | Data Protocols | File systems | GPU clustersSenior-level Full TimeRemote, United States R6d ago
-
Sales Engineer - West USD 132K-228KAI infrastructure | Data Security | File systems | GPU Computing | HPCSenior-level Full TimeRemote, United States R6d ago
-
Senior Forward Deployed Engineer (AI/ML) USD 140K-174KBenchmarking | CUDA | Continuous batching | CrewAI | DatabasesConference reimbursement | Employee assistance program | Flexible time off | LinkedIn Learning access | Local Employee MeetupsSenior-level Full TimeSeattle R7d ago
-
Senior Forward Deployed Engineer (AI/ML) USD 140K-174KArtificial Intelligence | CUDA | Continuous batching | CrewAI | DatabasesConference reimbursement | Employee assistance program | Flexible time off | LinkedIn Learning access | Remote workSenior-level Full TimeSan Francisco R7d ago
-
Staff Machine Learning Engineer, GenAI Platform USD 253K-354KCUDA | DeepSpeed | Distributed Systems | Docker | FSDP401k employer match | Family planning support | Flexible vacation | Gender-affirming care | Healthcare benefitsSenior-level Full TimeRemote - United States R7d ago
-
AWQ | C++ | CRD | CUDA | DockerCareer development | Employee resource groups | Flexible work from home | Generous paid time off | Volunteer timeSenior-level Full TimeUS-Texas-Austin, United States R7d ago
-
Principal Machine Learning Engineer USD 220K-300KCUDA | Continuous Learning | Data Preparation | Drift monitoring | Embedding401k | Employee assistance program | Employee stock purchase plan | Health savings account | Medical/Dental/Vision insuranceSenior-level Full TimeUnited States | Remote R8d ago
-
Alerting | BMC | Bash | Data center | Data center networkingSenior-level Full TimeUS, TX, Remote, United States R9d ago
-
Sr. Embedded & Compute Software Developer USD 130K-160KC# | C++ | CUDA | DO-178 | Debugging401k matching | Dental insurance | Employee assistance program | Health insurance | Paid HolidaysSenior-level Full TimeRemote (United States); Canada R12d ago
-
ML Engineer - Inference USD 151K-270KCUDNN | Edge Computing | Model Compression | Model Evaluation | Model Monitoring401k matching | Continued Education | Health coverage | Life and disability coverage | Paid time offMid-level Full TimeUnited States R12d ago
-
Engineering Manager, Machine Learning (Caper) USD 201K-253KBigQuery | CI/CD | Computer Vision | Deep learning | DockerRemote workSenior-level Full TimeUnited States - Remote R12d ago
-
Engineer - HPC Platform USD 129K-243KAltair Grid Engine | Ansible | Apptainer | Automation | BashCareer advancement path | Professional development opportunitiesMid-level Full TimeUnited States - Remote R13d ago
-
AI Engineer - Responsible AI USD 150K-160KAI interaction | AWS | Adversarial ML | Airflow | ArgoCDMentorship | Remote work optionMid-level Full TimeRemote Work( USA), United States R13d ago
-
Sr Machine Learning Engineer USD 169K-243KETL | Knowledge graphs | LLM APIs | Langchain | Language ModelsHealthcare coverage | Hybrid work model | Mental health resources | Paid time offSenior-level Full TimeUSA - California - San Jose … R13d ago
-
Staff Machine Learning Engineer USD 230K-322KBias Mitigation | Convolutional Neural Networks | Data Pipelines | Feature Engineering | Fine Tuning401k employer match | Caregiving support | Comprehensive healthcare benefits | Family planning support | Flexible vacationSenior-level Full TimeRemote - United States R15d ago