aijobs.net

AI Research Engineer (Model Compression & Quantization) - 100% Remote Worldwide

Remote job R

USD 203K-330K (estimate) Senior-level Full Time

Apply Save
Found 18h ago
Tasks
Perks/Benefits
Skills/Tech-stack

Compute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelism | Flash Attention | GPU Kernels | High Throughput | Inference Optimization | Inference Pipelines | KV cache | Low Latency | Machine Learning | Memory Optimization | Metal Shading Language | Mobile Devices | Model Compression | Model Pruning | Model Quantization | Model Serving | NLP | Pipeline parallelism | Shading language | Speculative decoding | Tensor Parallelism | Vision Transformers

Education

PhD

Roles

AI | AI Research Engineer | Engineer | Research Engineer

Apply Save
Language: en Views: 1 Clicks: 0 Saves: 0

Related jobs