Research Engineer - LLM/VLM Inference Optimization (Seed Infra)
Seattle, Washington, United States
USD 232K-427K Mid-level Full Time
Tasks
- Apply parallel computing and graph fusion
- Collaborate with research teams on model optimization
- Conduct performance analysis and identify bottlenecks
- Design high performance inference systems for LLMs and VLMs
- Develop CUDA kernels
- Develop inference engines and serving frameworks
- Develop model toolchains
- Enable streaming inference
- Implement compiler-level optimizations
- Implement speculative decoding
- Optimize end to end deployment pipelines
- Optimize high concurrency requests
- Use low precision computation
Perks/Benefits
- N/A
Skills/Tech-stack
CUDA | CUDA kernel | Compiler optimization | Deployment Pipelines | Graph Fusion | High concurrency | Inference Optimization | Language Models | Large Language Models | Low-precision computing | Parallel Computing | Performance Profiling | Precision computing | Speculative decoding | Streaming inference | Vision Language Models | Vision-language
Education
N/A
Roles
Related jobs
-
Amazon S3 | Data Engineering | Data Modeling | Data Pipelines | Data QualitySenior-level Full TimeNew York15h ago
-
Amazon S3 | Automation | Data Engineering | Data Modeling | Data Pipelines401k match | Dental insurance | Life insurance | Long-term disability | Medical insuranceSenior-level Full TimePrinceton15h ago
-
Senior Databricks Forward Deployed Engineer - GPS USD 119K-198KAPI Integration | AWS | Airflow | Azure | CI/CDTravelSenior-level Full TimeArlington/Rosslyn, Virginia, United States; Atlanta, Georgia, …15h ago
-
Lead AI and Data Solutions Engineer II USD 137K-229KAmazon Web Services | Apache Spark | Application Programming | Application Programming Interfaces | Cloud ComputingSenior-level Full TimeSacramento, California, United States; Tempe, Arizona, …15h ago
-
Senior Software Engineer, Generative AI, Google Ads USD 174K-252KComputer Vision | Data Processing | Debugging | GenAI | Information RetrievalSenior-level Full TimeMountain View, CA, USA16h ago
-
Staff Software Engineer, AI/ML Performance USD 207K-300KAlgorithms | Auto sharding | C++ | Code debugging | Code generationSenior-level Full TimeSunnyvale, CA, USA16h ago
-
Senior Software Engineer, Generative AI USD 174K-252KAgent-based | Agent-based systems | Cloud platform | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSunnyvale, CA, USA16h ago
-
Software Engineer III, Generative AI, Payments Risk USD 147K-211KAgent systems | Algorithms | Analytics | Big Data | Computer VisionSenior-level Full TimeMountain View, CA, USA16h ago
-
C++ | Data Analysis | Data Processing | Deep learning | EmbeddingsSenior-level Full TimeMountain View, CA, USA16h ago
-
Machine Learning Research Engineer USD 146K-222KData Analysis | Data Visualization | Deep learning | GPU Programming | Graph Neural Networks401k | Education reimbursement program | Flexible benefits package | Flexible schedule | Relocation assistanceMid-level Full TimeLivermore, CA, United States23h ago
-
Principal AI/ML Engineer USD 165K-226KC# | C++ | CI/CD | CUDA | Computer Vision401k match | Dental insurance | Health insurance | Life insurance | Paid time offSenior-level Full TimeRemote PA - PA PAR, United … R1d ago
-
Senior AI Engineer USD 74K-147KAI Builder | API Development | AWS | Azure | Azure MLFlexible remote work policy | Flexible work-life balance | Knowledge sharing | Professional development | Supportive environmentSenior-level Full TimeChicago, United States1d ago
-
Senior Agentic AI Engineer USD 83K-203KArtificial Intelligence | Azure OpenAI | Cloud Computing | Code review | Data PipelinesDental insurance | Medical insurance | Paid time off | Retirement savings options | Vision insuranceSenior-level Full TimeWork At Home-Texas, United States1d ago
-
Agile | C++ | Deep learning | Distributed Computing | GPU ComputingDiscretionary bonus | Flexible time off | Healthcare | Leave benefits | Retirement benefitsExecutive-level Full TimeNY7 - 50 Hudson Yards, New … R1d ago
-
AI Software Development Engineer USD 170K-275KAPI Development | Agent AI | Automation | C# | CI/CDHealth benefits | Hybrid work model | Retirement benefits | VacationMid-level Full TimeUSA - AZ - Chandler, United …1d ago
-
Senior GenAI Engineer USD 131K-219KGit | Hugging Face | Language Models | Language Processing | Large Language Models401k matching | Dental insurance | Disability benefits | Employee assistance program | Health CoachSenior-level Full TimeNiskayuna, United States1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerator hardware | Computer Vision | Data Quality | Data labelingCareer growth | Full-time employment | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Senior Machine Learning Engineer USD 156K-211KAPI Development | AWS | Agentic Workflows | CI/CD | Cloud ArchitectureAward-winning time-off plans | Comprehensive health, dental, vision coverage | Flexible work models | Life and disability insurance | Retirement and savings planSenior-level Full TimeUS - California - Thousand Oaks … R1d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | CUDA | Compiler optimization | Continuous batchingCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering Architect USD 100K-150KAgentic Systems | Chunking | Cost Optimization | Embeddings | Evaluation Frameworks100 percent remote | Career growth | MentorshipSenior-level Full TimeUnited States - Remote R1d ago
-
Software Engineer AI/ML USD 112K-150KA/B | A/B Testing | AWS | Anomaly Detection | Automated testingDental benefits | Employee assistance program | Health Coach | Health benefits | Retirement benefitsMid-level Full TimeEvendale, United States R1d ago
-
Sr Staff Gen AI Application Engineer USD 174K-210KAPI Development | Agentic Workflows | Application Security | CI/CD | Claude CodeAdoption Assistance | Disability insurance | Employee assistance program | Health Coach | HealthAhead programsSenior-level Full TimeRemote, United States R1d ago
-
Perception Engineer, Machine Learning USD 166K-220KAutomated testing | C++ | CI/CD | CUDA | Camera CalibrationMid-level Full TimeSeattle, Washington, United States1d ago
-
Senior Consultant - AI Engineer USD 175K-200KAI Search | APIs | Azure | Azure AI | Azure AI Search401k matching | Dental insurance | Health insurance | Paid time off | Vision insuranceSenior-level Full TimeSeattle, WA1d ago
-
AI Safety | AI Search | AWS Bedrock | AWS SageMaker | Agent systems401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States1d ago