aijobs.net

Member of technical staff (Inference) - Paris

Paris

EUR 80K-120K (estimate) Senior-level Full Time

Apply Save
Found 1d ago
Tasks
Perks/Benefits
Skills/Tech-stack

C++ | CUDA | Caching | Continuous batching | Distributed Computing | Flash Attention | GPU Programming | Ggml | Llama.cpp | Model Compression | NCCL | ONNX | ONNX Runtime | Paged Attention | PyTorch | Python | Quantization | Rust | SGLang | TensorRT-LLM | Triton | VLLM

Education

Master of Science | PhD

Roles

AI | AI Engineer | Engineer | Learning Engineer | Machine Learning Engineer

Regions

Europe

Countries

France

States

Île-de-France, FR

Cities

Paris, Île-de-France, FR

Apply Save
Language: en | Views: 0 | Clicks: 0 | Saves: 0

Related jobs