Senior Engineer 2: Inference Optimizations
Tasks
- Advise on GPU hardware and software ecosystem
- Collaborate with product teams to develop new features
- Conduct code reviews and mentor team members
- Engage with open-source AI community
- Engineer solutions for GPU performance bottlenecks
- Implement advanced model and kernel optimizations
- Lead performance optimization for inference engines
Perks/Benefits
- Conferences and training reimbursement
- Employee assistance program
- Equity compensation
- Flexible time off
- Professional development support
- Remote work
- Stock purchase program
Skills/Tech-stack
AI infrastructure | AI model | AI model families | CUDA | Deep learning | GPU Kernel Development | GPU Programming | Hardware Architecture | High Performance | High-Performance Computing | Kernel development | Memory Management | Model Optimization | Model families | OpenAI Triton | Parallelization | Performance Computing | PyTorch | ROCm | TensorFlow | TensorRT
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Senior Engineer 2: Inference Optimizations USD 167K-209KAI infrastructure | BF16 | Bandwidth Optimization | Batch size optimization | CUDAEmployee assistance program | Flexible time off | Health benefits | Learning and training budget | Professional development resourcesSenior-level Full TimeSeattle R15h ago
-
Senior Engineer 2: Inference Optimizations USD 167K-209KAI infrastructure | CUDA | Deep learning | GPU Kernels | GPU ProgrammingBonuses | Career development support | Conference reimbursement | Employee assistance program | Flexible time offSenior-level Full TimeDenver R15h ago
-
Senior Engineer 2: Inference Optimizations USD 167K-209KAI Inference | AI infrastructure | CUDA | GPU Architecture | GPU kernel tuningCareer development resources | Employee assistance program | Equity compensation | Flexible time off | Remote workSenior-level Full TimeAustin R15h ago
-
Senior Engineer 2: Inference Optimizations USD 167K-209KAI Model Optimization | AI model | C# | CUDA | GPU ArchitectureCareer development resources | Competitive benefits | Equity compensation | Flexible remote work | Reimbursement for trainingSenior-level Full TimeSan Francisco R15h ago
-
AWS | Azure | Communication Protocols | Computer Vision | Data CommunicationDisability benefits | Health benefits | Life insurance | Paid parental leave | Remote workExecutive-level Full TimeCambridge, United States R1d ago
-
Senior-level Full TimeUS-United States-Virtual, United States R1d ago
-
Senior Staff Machine Learning Engineer USD 261K-330KAI architectures | Deep learning | Embeddings | Experimentation | Language ModelsCompetitive compensation | Equity grants | Flexible work options | Health benefits | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
MLOps Engineer USD 151K-202KAWS | CI/CD | CloudWatch | Docker | ECR401k | Disability insurance | Health insurance | Life insurance | Paid time offSenior-level Full TimeRemote- United States R1d ago
-
Senior Applied AI Solutions Engineer USD 195K-255KCUDA | Deep learning | Distributed Training | Fine Tuning | HuggingfaceBenefits package | Competitive salary | Flexible working | Innovative environment | Professional growthSenior-level Full TimeAmsterdam, Netherlands; Remote - Europe; Remote … R1d ago
-
Senior / Staff Perception Engineer USD 158K-269K3D Detection | Computer Vision | Deep learning | Machine Learning | Model DeploymentCatered meals | Competitive compensation | Equity awards | Flexible hours | Health insuranceSenior-level Full TimeRemote US & Canada R2d ago
-
AI/NLP Intern USD 60K-64KArtificial Intelligence | Computer Vision | Deep learning | Language Processing | Natural LanguageFlexible hours | Remote workEntry-level InternshipUnited States - Remote R2d ago
-
AI Engineer USD 99K-198KAWS | Azure | Data Analysis | Data Interpretation | Data Pipeline DevelopmentFlexible work options | Medical coverage | Paid time off | Retirement plan | Tuition reimbursementSenior-level Full TimeUnited States of America : Remote R2d ago
-
Research Scientist / Engineer – Training Infrastructure USD 200K-300KCUDA | Containerization | Distributed Systems | GPU clusters | LinuxSenior-level Full TimePalo Alto, CA, Remote - International, … R2d ago
-
Senior Data Engineer (Core Data Platform) USD 130K-185KAWS | Alerting | Amazon Redshift | Apache Airflow | Apache IcebergDental insurance | Equity | Flexible PTO | Home office stipend | Lifestyle Savings AccountSenior-level Full TimeRemote - US R4d ago
-
ML Platform / MLOps Engineer USD 180K-250KCI/CD | Cloud Computing | Data Pipelines | Docker | GCPGrowth opportunities | Health insurance | Paid time offMid-level Full TimeEmeryville, California, United States; Hybrid (2-3 … R4d ago
-
Senior Machine Learning Engineer, AI Platform USD 160K-235KA/B | A/B Testing | B testing | Batch inference | CI/CDHealth insurance | Learning and development budget | Retirement plan | Virtual team activities | Wellness supportSenior-level Full TimeSan Francisco, CA; USA (Remote) R4d ago
-
Founding Staff ML Engineer (Tech Lead) USD 190K-260KData Processing | Deep learning | Fine Tuning | LLM | ML deploymentSenior-level Full TimePalo Alto, USA Remote, New York R4d ago
-
Principal Data Engineer USD 142K-200KAPI Development | AWS | Airflow | Cassandra | Data ArchitectureDental coverage | Gym reimbursement | Health insurance | Leadership programs | Mental health supportSenior-level Full TimeRemote, US R4d ago
-
Research Software Engineer, AI/ML USD 110K-130KCI/CD | Containerization | GPU Computing | Git | Inference frameworksProfessional development opportunities | University shared governanceMid-level Full TimeBlacksburg, Virginia, Hybrid R4d ago
-
AI infrastructure | APIs | Distributed Systems | Embedding pipelines | Language ModelsCollaborative hybrid work environment | Health, dental, vision coverage | Impactful work on real-world AI systems | Relocation supportMid-level Full TimeSan Francisco, CA; Hybrid R5d ago
-
Machine Learning Engineer, Agentic AI USD 138K-232KLangchain | Langgraph | ML Infrastructure | Multi-step reasoning | PyTorchEquity options | Remote workSenior-level Full TimeRemote-USA, United States R5d ago
-
Machine Learning Engineer USD 160K-200KAgent Frameworks | BigQuery | Cloud Functions | Cloud GCP | Cloud RunFlexible hours | Health insurance | Remote work | Skill development opportunitiesMid-level Full TimeLos Angeles, CA; Remote (United States) R5d ago
-
Applied AI Engineer - Federal (TS Required) USD 160K-250KAirflow | Chroma | CrewAI | Data Generation | Deep learningCareer development | Health insurance | Paid time off | Work in federal security environmentsMid-level Full TimeUnited States (Remote); Washington, D.C. (Remote) R5d ago
-
Applied AI Engineer - AI Solutions USD 172K-300KAPI Development | Chroma | CrewAI | Data Processing | Deep learningCareer growth opportunities | Equity | Flexible work options | Learning and development opportunitiesMid-level Full TimeNew York City, NY (Hybrid); Redwood … R5d ago
-
Lead AI Engineer USD 94K-169KAPI Development | Cloud Platforms | Data Analysis | DevOps | Generative AIRemote work eligibilitySenior-level Full TimeRemote - TN, United States R6d ago