aijobs.net

Senior Consultant Specialist (Model Hosting/Inference Optimization)

Guangzhou, Guangdong, China

CNY 144K-184K (estimate) Senior-level Full Time

Apply Save
Found 1d ago
Tasks
Perks/Benefits
Skills/Tech-stack

AWQ | AWS | Accelerate | Benchmarking | CUDA | Distributed Training | Docker | FP8 | GPTQ | Google Cloud | Hugging Face | Hugging Face Transformers | Hyperparameter Tuning | INT4) | KV cache | Kubernetes | LoRA | Microsoft Azure | Monitoring | Python | QLoRA | Quantization | SGLang | TensorRT-LLM | VLLM

Education

Bachelor of Science | Master of Science | PhD

Roles

Consultant Specialist | Engineer | Learning Engineer | ML Engineer | Machine Learning Engineer | Senior Consultant | Senior Consultant Specialist | Specialist

Regions

Asia/Pacific

Countries

China

States

Guangdong, CN

Cities

Guangzhou, Guangdong, CN

Apply Save
Language: en Views: 3 Clicks: 1 Saves: 0

Related jobs