Senior Engineer 2: Inference Optimizations
Tasks
- Advise on hardware procurement and software integration
- Collaborate with product teams
- Contribute to open source AI communities
- Engineer solutions for complex performance issues
- Implement cutting-edge optimization techniques
- Lead code and design reviews
- Lead performance benchmarking and optimization
Perks/Benefits
- Career development resources
- Employee assistance program
- Equity compensation
- Flexible time off
- Remote work
- Training and conferences
Skills/Tech-stack
AI Inference | AI infrastructure | CUDA | GPU Architecture | GPU kernel tuning | High Performance | High-Performance Computing | Kernel tuning | Memory Management | Model Optimization | OpenAI Triton | Parallelization | Performance Computing | ROCm | System design | TensorRT
Education
Related jobs
-
Senior Forward Deployed AI Engineer USD 106K-180KAWS | Automation | CI/CD | Distributed Systems | EmbeddingsBenefits | Bonus eligibility | Remote work optionSenior-level Full TimeUnited States - Remote R23h ago
-
Senior Software Engineer, LLM Performance USD 180K-339KC++ | CUDA | Cutlass | FlashAttention | FlashInferSenior-level Full TimeSF Bay Area (Hybrid) R1d ago
-
Solutions Architect, Physical AI and Robotics USD 152K-241KBenchmarking | C++ | CUDA | Cosmos | Digital TwinsBenefits | EquitySenior-level Full TimeUS, CA, Remote, United States R1d ago
-
Data Engineer USD 112K-160KAWS EMR | Anomaly Detection | Apache Spark | Batch Processing | Data LineageSenior-level Full TimeMcLean, Virginia, United States - Remote R2d ago
-
Associate Director, AI Engineer USD 94K-152KAPI Architecture | Autogen | Azure | CI/CD | Container Orchestration401k | Flexible paid time off | Life insurance | Long-term disability | Medical/Dental/Vision insuranceMid-level Full TimeUSA - Remote - Maryland, United … R2d ago
-
Senior AI Systems Engineer USD 122K-188KAlerting | Bash | CI/CD | CMMC | Cause analysisFully remote option | Hybrid option | Onsite optionSenior-level Full TimeRaleigh, North Carolina, United States; Albuquerque, … R5d ago
-
Full Stack AI Engineer (Staff level) USD 160K-226KAWS | Agent Orchestration | Agentic Workflows | Context engineering | Distributed SystemsSenior-level Full TimeUS Remote R6d ago
-
Sales Engineer - US Fed USD 132K-228KArtificial Intelligence | File systems | GPU clusters | High Performance | High-Performance ComputingSenior-level Full TimeRemote, United States R6d ago
-
Sales Engineer - East USD 132K-228KAI | Cloud Platforms | Data Architecture | File systems | GPU clustersSenior-level Full TimeRemote, United States R6d ago
-
Sales Engineer - FSI USD 132K-228KAI | Cloud Platforms | Data Protocols | File systems | GPU clustersSenior-level Full TimeRemote, United States R6d ago
-
Sales Engineer - West USD 132K-228KAI infrastructure | Data Security | File systems | GPU Computing | HPCSenior-level Full TimeRemote, United States R6d ago
-
Senior Forward Deployed Engineer (AI/ML) USD 140K-174KBenchmarking | CUDA | Continuous batching | CrewAI | DatabasesConference reimbursement | Employee assistance program | Flexible time off | LinkedIn Learning access | Local Employee MeetupsSenior-level Full TimeSeattle R7d ago
-
Senior Forward Deployed Engineer (AI/ML) USD 140K-174KArtificial Intelligence | CUDA | Continuous batching | CrewAI | DatabasesConference reimbursement | Employee assistance program | Flexible time off | LinkedIn Learning access | Remote workSenior-level Full TimeSan Francisco R7d ago
-
Staff Machine Learning Engineer, GenAI Platform USD 253K-354KCUDA | DeepSpeed | Distributed Systems | Docker | FSDP401k employer match | Family planning support | Flexible vacation | Gender-affirming care | Healthcare benefitsSenior-level Full TimeRemote - United States R7d ago
-
Software Engineer III USD 145K-187KAWS | Azure | C# | Docker | Elasticsearch401k match | Dental insurance | Health insurance | Health savings account | PTOSenior-level Full TimeRemote, United States R7d ago
-
AWQ | C++ | CRD | CUDA | DockerCareer development | Employee resource groups | Flexible work from home | Generous paid time off | Volunteer timeSenior-level Full TimeUS-Texas-Austin, United States R7d ago
-
Staff Software Engineer, Combinatorial Optimization USD 155K-213KC# | C++ | CI/CD | Cloud Native | Cloud Native ArchitectureCompany holidays | Health insurance | Learning and development reimbursement | Life insurance | Long-term disabilitySenior-level Full TimeTorrance, California, United States; US - … R8d ago
-
Senior Software Engineer, Data Infrastructure USD 191K-225KAWS EMR | Apache Airflow | Apache Iceberg | Apache Spark | Data ETLEmployee travel credits | Remote eligibleSenior-level Full TimeUSA - Remote R8d ago
-
Principal Machine Learning Engineer USD 220K-300KCUDA | Continuous Learning | Data Preparation | Drift monitoring | Embedding401k | Employee assistance program | Employee stock purchase plan | Health savings account | Medical/Dental/Vision insuranceSenior-level Full TimeUnited States | Remote R9d ago
-
APIs | Agent Orchestration | Agentic Systems | Air-gapped | Air-gapped environments401k option | Comprehensive health care | Equity Incentives Option | FSA option | Mental health benefitsSenior-level Full TimeSeattle, WA or McLean, VA or … R9d ago
-
AI/Machine Learning Engineer Intern USD 55K-86KAPIs | Artificial Intelligence | Benchmarking | Evaluation | LLM APIsFun events | Leadership speaker series | Mentorship | Professional network | Training and developmentEntry-level InternshipUnited States - Remote R9d ago
-
Jr. AI Engineer USD 70K-85KAPI Development | Backend Development | Database Design | Embeddings | Generative AI401k matching | Bonuses | Cell phone reimbursement | Dental insurance | Health insuranceEntry-level Full TimeNew York, NY; Remote/Hybrid R9d ago
-
Alerting | BMC | Bash | Data center | Data center networkingSenior-level Full TimeUS, TX, Remote, United States R9d ago
-
Sr. Embedded & Compute Software Developer USD 130K-160KC# | C++ | CUDA | DO-178 | Debugging401k matching | Dental insurance | Employee assistance program | Health insurance | Paid HolidaysSenior-level Full TimeRemote (United States); Canada R12d ago
-
ML Engineer - Inference USD 151K-270KCUDNN | Edge Computing | Model Compression | Model Evaluation | Model Monitoring401k matching | Continued Education | Health coverage | Life and disability coverage | Paid time offMid-level Full TimeUnited States R12d ago