aijobs.net

Research Engineer - LLM/VLM Inference Optimization (Seed Infra)

Seattle, Washington, United States

USD 232K-427K Mid-level Full Time

Apply Save
Found 6h ago
Tasks
Perks/Benefits
Skills/Tech-stack

CUDA | CUDA kernel | Compiler optimization | Deployment Pipelines | Graph Fusion | High concurrency | Inference Optimization | Language Models | Large Language Models | Low-precision computing | Parallel Computing | Performance Profiling | Precision computing | Speculative decoding | Streaming inference | Vision Language Models | Vision-language

Education

N/A

Roles

Engineer | Research Engineer

Regions

North America

Countries

United States

States

Washington, US

Cities

Seattle, Washington, US

Apply Save
Language: en | Views: 0 | Clicks: 0 | Saves: 0

Related jobs