aijobs.net

Senior Software Engineer - AI Inference

New York

USD 160K-240K Senior-level Full Time

Apply Save
Found 4h ago
Tasks
Perks/Benefits
Skills/Tech-stack

Batching | CUDA | Caching | Distributed Systems | High Performance | High-Performance Computing | Inference Optimization | KServe | Kubernetes | Load Balancing | Machine Learning | Memory Aware Serving | NCCL | NVIDIA GPU | ONNX | Observability | Performance Computing | Prompt Caching | PyTorch | Request Routing | Request Scheduling | Structured Sampling | TensorRT | Traffic Management | Triton | VLLM

Education

N/A

Roles

Engineer | Learning Engineer | Machine Learning Engineer | Senior Software Engineer | Software Engineer

Regions

North America

Countries

United States

States

New York, US

Cities

New York City, New York, US

Apply Save
Language: en | Views: 1 | Clicks: 0 | Saves: 0

Related jobs