aijobs.net

Associate Director, Software Engineering (Model Hosting/Inference Optimisation)

Shenzhen, Guangdong, China R

CNY 240K-360K (estimate) Mid-level Full Time

Apply Save
Found 3d ago
Tasks
Perks/Benefits
Skills/Tech-stack

AWQ | AWS | Accelerate | Azure | Batching | CUDA | Distributed Training | Docker | FP8 | Fine Tuning | GCP | GPTQ | Hugging Face | Hugging Face Transformers | Hyperparameter Tuning | INT4) | Inference Optimization | KV cache | Kubernetes | LLM | LoRA | Operator optimization | Python | QLoRA | Quantization | SGLang | TensorRT-LLM | VLLM

Education

Bachelor of Science | Master of Science | PhD

Roles

Engineer | Engineering Manager | Manager | Software Engineer | Software Engineering | Software Engineering Manager

Regions

Asia/Pacific

Countries

China

States

Guangdong, CN

Cities

Shenzhen, Guangdong, CN

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs