aijobs.net

Senior Machine Learning Engineer, Runtime and Serving

Mountain View, CA, USA

USD 213K-263K Senior-level Full Time

Apply Save
Found 2d ago
Tasks
Perks/Benefits
Skills/Tech-stack

Benchmarking | Buffer management | C++ | CUDA | Concurrent Systems | Continuous batching | Custom silicon | Data transfer | Deep learning | Distributed Systems | GPUs | JAX | KV caching | Low Latency | Low-Latency Systems | Machine Learning | Machine Learning Runtime | Model Compression | Model Optimization | Model Serving | ONNX Runtime | OpenXLA | PJRT | Prefix caching | Profiling | PyTorch | Python | Shared Memory | TPUs | TVM | Tensor Buffer Management | Tensor operations | TensorRT | Triton | XLA | Zero copy | Zero-copy data transfer

Education

Bachelor of Science | Master of Science

Roles

Engineer | Learning Engineer | ML Engineer | Machine Learning Engineer

Regions

North America

Countries

United States

States

California, US

Cities

Mountain View, California, US

Apply Save
Language: en | Views: 0 | Clicks: 0 | Saves: 0

Related jobs