Senior Engineer 2: Inference Optimizations
Tasks
- Advise on GPU hardware and software ecosystem
- Collaborate with product teams to develop new features
- Conduct code reviews and mentor team members
- Engage with open-source AI community
- Engineer solutions for GPU performance bottlenecks
- Implement advanced model and kernel optimizations
- Lead performance optimization for inference engines
Perks/Benefits
- Conferences and training reimbursement
- Employee assistance program
- Equity compensation
- Flexible time off
- Professional development support
- Remote work
- Stock purchase program
Skills/Tech-stack
AI infrastructure | AI model | AI model families | CUDA | Deep learning | GPU Kernel Development | GPU Programming | Hardware Architecture | High Performance | High-Performance Computing | Kernel development | Memory Management | Model Optimization | Model families | OpenAI Triton | Parallelization | Performance Computing | PyTorch | ROCm | TensorFlow | TensorRT
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Senior AI/ML Engineer USD 125K-157KAI platforms | AWS | Agentic AI | Agentic AI Platforms | Data GovernanceDental insurance | Disability plans | Employee referral program | Fertility benefits | Life insuranceSenior-level Full TimeUS Remote R4h ago
-
Senior Generative AI Engineer USD 125K-156KAI Governance | AWS | Bedrock | Data Processing | ECS401k plan | Dental insurance | Disability insurance | Employee referral program | Fertility benefitsSenior-level Full TimeUS Remote R4h ago
-
AI Model Evaluation | AI model | Data Analysis | Language Models | Large Language ModelsMid-level Freelance Part TimeUnited States R8h ago
-
Principal Data Scientist - Foundational Models USD 142K-237KContainer Orchestration | Data Manipulation | Observability tools | PyTorch | PythonEquity | Growth opportunities | Inclusive health benefitsSenior-level Full TimeRemote, USA R11h ago
-
Senior Engineer 2: Inference Optimizations USD 167K-209KAI infrastructure | BF16 | Bandwidth Optimization | Batch size optimization | CUDAEmployee assistance program | Flexible time off | Health benefits | Learning and training budget | Professional development resourcesSenior-level Full TimeSeattle R17h ago
-
Senior Engineer 2: Inference Optimizations USD 167K-209KAI infrastructure | CUDA | Deep learning | GPU Kernels | GPU ProgrammingBonuses | Career development support | Conference reimbursement | Employee assistance program | Flexible time offSenior-level Full TimeDenver R17h ago
-
Senior Engineer 2: Inference Optimizations USD 167K-209KAI Inference | AI infrastructure | CUDA | GPU Architecture | GPU kernel tuningCareer development resources | Employee assistance program | Equity compensation | Flexible time off | Remote workSenior-level Full TimeAustin R17h ago
-
Senior Engineer 2: Inference Optimizations USD 167K-209KAI Model Optimization | AI model | C# | CUDA | GPU ArchitectureCareer development resources | Competitive benefits | Equity compensation | Flexible remote work | Reimbursement for trainingSenior-level Full TimeSan Francisco R17h ago
-
AWS | Azure | Communication Protocols | Computer Vision | Data CommunicationDisability benefits | Health benefits | Life insurance | Paid parental leave | Remote workExecutive-level Full TimeCambridge, United States R1d ago
-
Senior-level Full TimeUS-United States-Virtual, United States R1d ago
-
PhD Intern - AI/ML/NLP Engineer USD 70K-86KAI platforms | Bayesian Methods | C# | CI/CD | Cloud ComputingDental insurance | Health insurance | Paid time off | Retirement plan | Tuition reimbursementEntry-level Full Time InternshipCalifornia - Remote Office, United States R1d ago
-
Senior Staff Machine Learning Engineer USD 261K-330KAI architectures | Deep learning | Embeddings | Experimentation | Language ModelsCompetitive compensation | Equity grants | Flexible work options | Health benefits | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
MLOps Engineer USD 151K-202KAWS | CI/CD | CloudWatch | Docker | ECR401k | Disability insurance | Health insurance | Life insurance | Paid time offSenior-level Full TimeRemote- United States R1d ago
-
Senior Applied AI Solutions Engineer USD 195K-255KCUDA | Deep learning | Distributed Training | Fine Tuning | HuggingfaceBenefits package | Competitive salary | Flexible working | Innovative environment | Professional growthSenior-level Full TimeAmsterdam, Netherlands; Remote - Europe; Remote … R1d ago
-
AI Model Evaluation | AI model | Automation tools | Data Analysis | Language ModelsFlexible schedule | Remote workSenior-level Freelance Part TimeUnited States R2d ago
-
Senior / Staff Perception Engineer USD 158K-269K3D Detection | Computer Vision | Deep learning | Machine Learning | Model DeploymentCatered meals | Competitive compensation | Equity awards | Flexible hours | Health insuranceSenior-level Full TimeRemote US & Canada R2d ago
-
AI/NLP Intern USD 60K-64KArtificial Intelligence | Computer Vision | Deep learning | Language Processing | Natural LanguageFlexible hours | Remote workEntry-level InternshipUnited States - Remote R2d ago
-
AI Engineer USD 99K-198KAWS | Azure | Data Analysis | Data Interpretation | Data Pipeline DevelopmentFlexible work options | Medical coverage | Paid time off | Retirement plan | Tuition reimbursementSenior-level Full TimeUnited States of America : Remote R2d ago
-
Research Scientist / Engineer – Training Infrastructure USD 200K-300KCUDA | Containerization | Distributed Systems | GPU clusters | LinuxSenior-level Full TimePalo Alto, CA, Remote - International, … R2d ago
-
Senior Data Engineer (Core Data Platform) USD 130K-185KAWS | Alerting | Amazon Redshift | Apache Airflow | Apache IcebergDental insurance | Equity | Flexible PTO | Home office stipend | Lifestyle Savings AccountSenior-level Full TimeRemote - US R4d ago
-
ML Platform / MLOps Engineer USD 180K-250KCI/CD | Cloud Computing | Data Pipelines | Docker | GCPGrowth opportunities | Health insurance | Paid time offMid-level Full TimeEmeryville, California, United States; Hybrid (2-3 … R4d ago
-
Senior Machine Learning Engineer, AI Platform USD 160K-235KA/B | A/B Testing | B testing | Batch inference | CI/CDHealth insurance | Learning and development budget | Retirement plan | Virtual team activities | Wellness supportSenior-level Full TimeSan Francisco, CA; USA (Remote) R4d ago
-
Founding Staff ML Engineer (Tech Lead) USD 190K-260KData Processing | Deep learning | Fine Tuning | LLM | ML deploymentSenior-level Full TimePalo Alto, USA Remote, New York R4d ago
-
Principal Data Engineer USD 142K-200KAPI Development | AWS | Airflow | Cassandra | Data ArchitectureDental coverage | Gym reimbursement | Health insurance | Leadership programs | Mental health supportSenior-level Full TimeRemote, US R4d ago
-
Research Software Engineer, AI/ML USD 110K-130KCI/CD | Containerization | GPU Computing | Git | Inference frameworksProfessional development opportunities | University shared governanceMid-level Full TimeBlacksburg, Virginia, Hybrid R4d ago