aijobs.net

AI Research Engineer (Kernel & Inference Optimization)

Remote job R

USD 200K-332K (estimate) Senior-level Full Time

Apply Save
Found 19h ago
Tasks
Perks/Benefits
Skills/Tech-stack

Computer Vision | Deep learning | Diffusion Models | Distributed inference | Edge Computing | Embedded Systems | Expert parallelism | Flash Attention | GPU Kernels | GPU Programming | High Throughput | High-Throughput Systems | Inference Optimization | KV cache | Kernel optimization | Low Latency | Low-Latency Systems | Machine Learning | Memory Optimization | Metal Shading Language | Mobile optimization | Model Serving | NLP | On-device Inference | Pipeline parallelism | Pruning | Quantization | Shading language | Speculative decoding | Tensor Parallelism | Tokenization | Vision Transformers

Education

Bachelor of Science | Doctor of Philosophy | Master of Science | PhD

Roles

AI | AI Research Engineer | Engineer | Learning Engineer | Machine Learning Engineer | Research Engineer

Apply Save
Language: en Views: 3 Clicks: 0 Saves: 0

Related jobs