aijobs.net

AI Performance Optimization Engineer

United States - Remote R

USD 100K-150K Mid-level Full Time

Apply Save
Found 2d ago
Tasks
Perks/Benefits
Skills/Tech-stack

Batching | Benchmarking | C++ | CUDA | Compiler optimization | Deep learning | DeepSpeed | Distributed Training | FSDP | Flash Attention | GPU | KV cache | LLM | LLM Inference | Machine Learning | Memory Management | Model Pruning | Model Quantization | Model Sparsity | Paged Attention | Pipeline parallelism | Profiling | Python | Regression testing | Speculative decoding | TVM | Tensor Parallelism | TensorRT-LLM | Torch | Torch Inductor | Triton | VLLM | XLA | Zero

Education

Bachelor of Science | Master of Science

Roles

AI | AI Performance Optimization Engineer | Engineer | Learning Engineer | Machine Learning Engineer | Optimization Engineer | Performance Optimization Engineer

Regions

North America

Countries

United States

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs