aijobs.net

ML Research Scientist -Deep Learning & Transformer Architectures

New York, New York, United States of America

USD 150K-200K Senior-level Full Time

Apply Save
Found 21h ago
Tasks
Perks/Benefits
Skills/Tech-stack

Attention Mechanisms | C++ | Decoder Only | Decoder-only Transformer | GPU parallelism | Gradient Checkpointing | Inference Optimization | Information theory | KV cache | Linear Algebra | Mixed Precision | Mixed-precision training | Model Evaluation | Model Quantization | Multi GPU Parallelism | Multi-GPU | Next Token Prediction | Optimization | Positional encoding | Probability theory | PyTorch | Python | Speculative decoding | Tokenization | Transformer

Education

PhD

Roles

Machine Learning Research Scientist | Research Scientist | Scientist

Regions

North America

Countries

United States

States

New York, US

Cities

New York City, New York, US

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs