aijobs.net

Performance Engineer, GPU

San Francisco, CA | New York City, NY | Seattle, WA

USD 280K-850K Senior-level Full Time

Apply Save
Found 3d ago
Tasks
Perks/Benefits
Skills/Tech-stack

Bandwidth Optimization | CUDA | Cluster Orchestration | Collective communication | Custom Operators | Cutlass | FP8 Quantization | Fault Tolerance | Flash Attention | Int8 Quantization | JAX | Kernel Fusion | Memory bandwidth | Memory bandwidth optimization | Mixed Precision | Model Parallelism | NCCL | NVLink | Nsight | PyTorch | Tensor Core | Tensor core optimization | Torch compile | Triton | XLA

Education

Bachelor of Science

Roles

Engineer | GPU Performance Engineer | Performance Engineer

Regions

North America

Countries

United States

States

New York, US | California, US | Washington, US

Cities

New York City, New York, US | San Francisco, California, US | Seattle, Washington, US

Apply Save
Language: en | Views: 0 | Clicks: 0 | Saves: 0

Related jobs