aijobs.net

Staff ML Engineer, Generative Model Performance & Efficiency

Mountain View, California, United States, New York City, New York, United States

USD 251K-310K Senior-level Full Time

Apply Save
Found 17h ago
Tasks
Perks/Benefits
Skills/Tech-stack

Data parallelism | Diffusion Models | Efficient Attention | Expert parallelism | Flax | GPU | High Throughput | JAX | Knowledge Distillation | Low Latency | Low-latency serving | Mixture of Experts | Model Compression | Model partitioning | NVIDIA Nsight | Perfetto | Pipeline parallelism | Pruning | PyTorch | Python | Quantization | TPU | Tensor Parallelism | TensorFlow | Transformers | XLA | Xprof

Education

Master of Science | PhD

Roles

Engineer | Learning Engineer | Machine Learning Engineer | Staff Machine Learning Engineer

Regions

North America

Countries

United States

States

New York, US | California, US

Cities

New York City, New York, US | Mountain View, California, US

Apply Save
Language: en Views: 1 Clicks: 0 Saves: 0

Related jobs