aijobs.net

Research Engineer - LLM/VLM Inference Optimization (Seed Infra)

San Jose, California, United States

USD 244K-450K Mid-level Full Time

Apply Save
Found 8h ago
Tasks
Perks/Benefits
Skills/Tech-stack

CUDA | CUDA kernels | Compiler optimization | Graph Fusion | High Performance | High-Performance Computing | Inference Optimization | Low Precision | Low-precision computing | Parallel Computing | Performance Analysis | Performance Computing | Precision computing | Speculative decoding | Streaming inference

Education

N/A

Roles

Engineer | Learning Engineer | Machine Learning Engineer | Research Engineer

Regions

North America

Countries

United States

States

California, US

Cities

San Jose, California, US

Apply Save
Language: en Views: 2 Clicks: 0 Saves: 0

Related jobs