Research Engineer - LLM/VLM Inference Optimization (Seed Infra)

Seattle, Washington, United States

USD 232K-427K Mid-level Full Time

@ B...

Apply Save

Found 1mo ago

Tasks

Apply parallel computing and graph fusion
Collaborate with research teams on model optimization
Conduct performance analysis and identify bottlenecks
Design high performance inference systems for LLMs and VLMs
Develop CUDA kernels
Develop inference engines and serving frameworks
Develop model toolchains
Enable streaming inference
Implement compiler-level optimizations
Implement speculative decoding
Optimize end to end deployment pipelines
Optimize high concurrency requests
Use low precision computation

Perks/Benefits

Skills/Tech-stack

Education

N/A

Apply Save

Language: en Views:

2 Clicks:

0 Saves: 0

Related jobs

Senior Data Management Professional - Data Engineering (Shared Infrastructure) USD 110K-190K

Amazon S3 | Data Engineering | Data Modeling | Data Pipelines | Data Quality

Senior-level Full Time

New York

15h ago
Senior Data Management Professional - Data Engineering (Shared Infrastructure) USD 110K-190K

Amazon S3 | Automation | Data Engineering | Data Modeling | Data Pipelines

401k match | Dental insurance | Life insurance | Long-term disability | Medical insurance

Senior-level Full Time

Princeton

15h ago
Senior Databricks Forward Deployed Engineer - GPS USD 119K-198K

API Integration | AWS | Airflow | Azure | CI/CD

Travel

Senior-level Full Time

Arlington/Rosslyn, Virginia, United States; Atlanta, Georgia, …

15h ago
Lead AI and Data Solutions Engineer II USD 137K-229K

Amazon Web Services | Apache Spark | Application Programming | Application Programming Interfaces | Cloud Computing

Senior-level Full Time

Sacramento, California, United States; Tempe, Arizona, …

15h ago
Senior Software Engineer, Generative AI, Google Ads USD 174K-252K

Computer Vision | Data Processing | Debugging | GenAI | Information Retrieval

Senior-level Full Time

Mountain View, CA, USA

16h ago
Staff Software Engineer, AI/ML Performance USD 207K-300K

Algorithms | Auto sharding | C++ | Code debugging | Code generation

Senior-level Full Time

Sunnyvale, CA, USA

16h ago
Senior Software Engineer, Generative AI USD 174K-252K

Agent-based | Agent-based systems | Cloud platform | Data Structures | Data Structures and Algorithms

Senior-level Full Time

Sunnyvale, CA, USA

16h ago
Software Engineer III, Generative AI, Payments Risk USD 147K-211K

Agent systems | Algorithms | Analytics | Big Data | Computer Vision

Senior-level Full Time

Mountain View, CA, USA

16h ago
Senior Software Engineer, Recommendations, Rankings, Predictions, Search Discover USD 174K-252K

C++ | Data Analysis | Data Processing | Deep learning | Embeddings

Senior-level Full Time

Mountain View, CA, USA

16h ago
Machine Learning Research Engineer USD 146K-222K

Data Analysis | Data Visualization | Deep learning | GPU Programming | Graph Neural Networks

401k | Education reimbursement program | Flexible benefits package | Flexible schedule | Relocation assistance

Mid-level Full Time

Livermore, CA, United States

23h ago
Principal AI/ML Engineer USD 165K-226K

C# | C++ | CI/CD | CUDA | Computer Vision

401k match | Dental insurance | Health insurance | Life insurance | Paid time off

Senior-level Full Time

Remote PA - PA PAR, United … R

1d ago
Senior AI Engineer USD 74K-147K

AI Builder | API Development | AWS | Azure | Azure ML

Flexible remote work policy | Flexible work-life balance | Knowledge sharing | Professional development | Supportive environment

Senior-level Full Time

Chicago, United States

1d ago
Senior Agentic AI Engineer USD 83K-203K

Artificial Intelligence | Azure OpenAI | Cloud Computing | Code review | Data Pipelines

Dental insurance | Medical insurance | Paid time off | Retirement savings options | Vision insurance

Senior-level Full Time

Work At Home-Texas, United States

1d ago
Quantitative Researcher, Global Allocation - Vice President USD 162K-215K

Agile | C++ | Deep learning | Distributed Computing | GPU Computing

Discretionary bonus | Flexible time off | Healthcare | Leave benefits | Retirement benefits

Executive-level Full Time

NY7 - 50 Hudson Yards, New … R

1d ago
AI Software Development Engineer USD 170K-275K

API Development | Agent AI | Automation | C# | CI/CD

Health benefits | Hybrid work model | Retirement benefits | Vacation

Mid-level Full Time

USA - AZ - Chandler, United …

1d ago
Senior GenAI Engineer USD 131K-219K

Git | Hugging Face | Language Models | Language Processing | Large Language Models

401k matching | Dental insurance | Disability benefits | Employee assistance program | Health Coach

Senior-level Full Time

Niskayuna, United States

1d ago
AI Research Engineer (Applied AI) USD 100K-150K

Ablation Studies | Accelerator hardware | Computer Vision | Data Quality | Data labeling

Career growth | Full-time employment | Remote work

Mid-level Full Time

United States - Remote R

1d ago
Senior Machine Learning Engineer USD 156K-211K

API Development | AWS | Agentic Workflows | CI/CD | Cloud Architecture

Award-winning time-off plans | Comprehensive health, dental, vision coverage | Flexible work models | Life and disability insurance | Retirement and savings plan

Senior-level Full Time

US - California - Thousand Oaks … R

1d ago
AI Performance Optimization Engineer USD 100K-150K

Benchmarking | C++ | CUDA | Compiler optimization | Continuous batching

Career growth | Remote work

Mid-level Full Time

United States - Remote R

1d ago
Prompt Engineering Architect USD 100K-150K

Agentic Systems | Chunking | Cost Optimization | Embeddings | Evaluation Frameworks

100 percent remote | Career growth | Mentorship

Senior-level Full Time

United States - Remote R

1d ago
Software Engineer AI/ML USD 112K-150K

A/B | A/B Testing | AWS | Anomaly Detection | Automated testing

Dental benefits | Employee assistance program | Health Coach | Health benefits | Retirement benefits

Mid-level Full Time

Evendale, United States R

1d ago
Sr Staff Gen AI Application Engineer USD 174K-210K

API Development | Agentic Workflows | Application Security | CI/CD | Claude Code

Adoption Assistance | Disability insurance | Employee assistance program | Health Coach | HealthAhead programs

Senior-level Full Time

Remote, United States R

1d ago
Perception Engineer, Machine Learning USD 166K-220K

Automated testing | C++ | CI/CD | CUDA | Camera Calibration

Mid-level Full Time

Seattle, Washington, United States

1d ago
Senior Consultant - AI Engineer USD 175K-200K

AI Search | APIs | Azure | Azure AI | Azure AI Search

401k matching | Dental insurance | Health insurance | Paid time off | Vision insurance

Senior-level Full Time

Seattle, WA

1d ago
Sr. Gen AI Engineer A USD 80K-141K

AI Safety | AI Search | AWS Bedrock | AWS SageMaker | Agent systems

401k | Dental insurance | Medical insurance | Paid sick hours | Vision insurance

Senior-level Contract Full Time

Ridgefield Park, NJ, United States

1d ago

Research Engineer - LLM/VLM Inference Optimization (Seed Infra)

Tasks

Perks/Benefits

Skills/Tech-stack

Education

Roles

Regions

Countries

States

Cities

Related jobs