Inference Engineer - Acceleration
Tasks
- Analyze token cost
- Apply quality regression gates
- Drive low precision MoE serving
- Implement prefill decode split
- Instrument inference stack
- Manage KV cache hierarchy
- Optimize throughput latency uptime
- Tune scheduling and admission control
Perks/Benefits
- Commuting subsidy
- Learning and development budget
- Offsites and team events
- Pension plan
- Vacation days
Skills/Tech-stack
Admission control | CUDA | Cutlass | FlashAttention | KV cache | Long Context | Long context attention | MOE | Nsight | Quantization | RDMA | SGLang | Scheduling | TensorRT-LLM | Triton | VLLM
Education
N/A
Roles
Related jobs
-
AI Infrastructure Engineer CHF 128K-192KAgentgateway | Ansible | Apache Kafka | C++ | Cloud Native24x7 on-call rotationSenior-level Full TimeGland, VD, Switzerland1d ago
-
AWS | CI/CD | CUDA | DDP | Deep learningAnnual leave | Career growth opportunities | Hybrid work option | Public holidays | Remote work optionSenior-level Full TimeSwitzerland R4d ago
-
CAD | CUDA | Co-simulation | Contact mechanics | Controller co simulationSenior-level Full TimeSwitzerland, Remote R8d ago
-
Autonomy Engineer - Deep Learning Model Acceleration CHF 128K-188KCUDA | Computer Vision | Data Preparation | Deep learning | Edge DeploymentSenior-level Full TimeZurich, Switzerland13d ago
-
Autonomy Engineer - Deep Learning Infrastructure CHF 140K-166KCUDA | Computer Vision | Deep learning | Edge Computing | GPUSenior-level Full TimeZurich, Switzerland13d ago
-
Applied AI Engineer CHF 128K-192KAgent systems | Cost Optimization | Evaluation Frameworks | Inference Optimization | JAXSenior-level Full TimeZurich, Switzerland14d ago
-
Classifier Training | DPO | Fine Tuning | Huggingface | Human FeedbackAI tools access | Annual in-person meetup | Co-working space budget | Company laptop | Fully remoteSenior-level Full TimeSwitzerland R14d ago
-
Robotics Software Engineer (Senior) - Rust CHF 111K-150KC++ | Debugging | Input Output | Kernel | LinuxSenior-level Full TimeZürich14d ago
-
Embedded Software Engineer (m/w/d) CHF 65K-88KC++ | CI/CD | Embedded Systems | Hardware-in-the-loop | Machine architectureMid-level Full TimeSargans, Switzerland1mo ago
-
Robotics Platform Jetson Integration Engineer CHF 128K-192KC++ | CUDA | Continuous Deployment | Continuous integration | DeepStreamIn-person work requirementSenior-level Full TimeZürich1mo ago
-
ML Infra Engineer CHF 92K-130KAWS | Ansible | CI | CI/CD | CUDABias for action | Career growth | Collaborative team | On-site roleMid-level Full TimeZürich, Zurich, Switzerland1mo ago
-
Senior ML Engineer (Evaluation) CHF 128K-192KAlerting | Artifact versioning | CI Code Review | CI/CD | CUDAAutonomy | Commuting subsidy | Learning and development budget | Offsites and team events | Pension planSenior-level Full TimeZürich, Switzerland1mo ago