Inference Engineer - Acceleration
Tasks
- Analyze token cost
- Apply quality regression gates
- Drive low precision MoE serving
- Implement prefill decode split
- Instrument inference stack
- Manage KV cache hierarchy
- Optimize throughput latency uptime
- Tune scheduling and admission control
Perks/Benefits
- Commuting subsidy
- Learning and development budget
- Offsites and team events
- Pension plan
- Vacation days
Skills/Tech-stack
Admission control | CUDA | Cutlass | FlashAttention | KV cache | Long Context | Long context attention | MOE | Nsight | Quantization | RDMA | SGLang | Scheduling | TensorRT-LLM | Triton | VLLM
Education
N/A
Roles
Related jobs
-
Robotics Software Intern - Sim-to-Real CHF 63K-69KC++ | CUDA | Control Systems | Isaac Lab | Isaac SimEntry-level Full Time InternshipSwitzerland, Zurich7d ago
-
Entry-level InternshipSwitzerland, Zurich7d ago
-
C++ | CMake | CUDA | Cache optimization | ClangContract extension to indefinite tenure | Family allowances | Health insurance | Hybrid work schedule | Paid time offSenior-level Contract Full TimeGeneva, GENEVA, Switzerland10d ago
-
3D Geometry | C++ | CUDA | Hardware acceleration | Image classificationCollaboration with software and hardware teams | Fast-paced startup environment | Mission-driven workEntry-level Full TimeZurich11d ago
-
C++ | CUDA | Deep learning | GDB | GPGPUHealth insurance | Language classes | On-the-job training | Paid time off | Pension fundEntry-level Full TimeGeneva, GENEVA, Switzerland19d ago
-
AI Infrastructure Engineer CHF 128K-192KAgentgateway | Ansible | Apache Kafka | C++ | Cloud Native24x7 on-call rotationSenior-level Full TimeGland, VD, Switzerland21d ago
-
CAD | CUDA | Co-simulation | Contact mechanics | Controller co simulationSenior-level Full TimeSwitzerland, Remote R28d ago
-
Autonomy Engineer - Deep Learning Model Acceleration CHF 128K-188KCUDA | Computer Vision | Data Preparation | Deep learning | Edge DeploymentSenior-level Full TimeZurich, Switzerland1mo ago
-
Autonomy Engineer - Deep Learning Infrastructure CHF 140K-166KCUDA | Computer Vision | Deep learning | Edge Computing | GPUSenior-level Full TimeZurich, Switzerland1mo ago
-
Applied AI Engineer CHF 128K-192KAgent systems | Cost Optimization | Evaluation Frameworks | Inference Optimization | JAXSenior-level Full TimeZurich, Switzerland1mo ago
-
Robotics Software Engineer (Senior) - Rust CHF 111K-150KC++ | Debugging | Input Output | Kernel | LinuxSenior-level Full TimeZürich1mo ago
-
Embedded Software Engineer (m/w/d) CHF 65K-88KC++ | CI/CD | Embedded Systems | Hardware-in-the-loop | Machine architectureMid-level Full TimeSargans, Switzerland1mo ago