aijobs.net

AI Performance Optimization Engineer

United States - Remote R

USD 136K-258K (estimate) Mid-level Full Time

Apply Save
Found 22h ago
Tasks
Perks/Benefits
Skills/Tech-stack

Access Optimization | Attention Optimization | Benchmarking | C++ | Compiler optimization | Continuous batching | Data loading | Data loading optimization | Deep learning | Distributed Training | FSDP | FlashAttention | GPU Architecture | KV cache | Loading Optimization | Memory Management | Model Compression | Model Parallelism | Paged Attention | Pipeline parallelism | Profiling | Pruning | Python | Quantization | Quantization aware training | Regression testing | Sparsity | Speculative decoding | Storage Access | Storage Access Optimization | TVM | Tensor Parallelism | TorchInductor | Triton | XLA | Zero

Education

Bachelor of Science | Master of Science

Roles

AI | AI Performance Optimization Engineer | Engineer | Optimization Engineer | Performance Optimization Engineer

Regions

North America

Countries

United States

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs