aijobs.net

Senior Consultant Specialist (Model Hosting/Inference Optimization)

Guangzhou, Guangdong, China

CNY 144K-240K (estimate) Senior-level Full Time

Apply Save
Found 18h ago
Tasks
Perks/Benefits
Skills/Tech-stack

AWQ | AWS | Batching | CPU architecture | CUDA | Distributed Training | Docker | FP8 | Fine Tuning | GPTQ | GPU Architecture | Google Cloud | HPC | Hugging Face | Hugging Face Accelerate | Hugging Face Transformers | Hyperparameter Tuning | INT4) | KV cache | Kubernetes | LoRA | Microsoft Azure | Monitoring | Operator optimization | Python | QLoRA | Quantization | SGLang | TensorRT-LLM | VLLM

Education

Bachelor of Science | Master of Science | PhD

Roles

AI Engineer | Engineer | Learning Engineer | Machine Learning Engineer | Senior AI Engineer | Senior Machine Learning Engineer

Regions

Asia/Pacific

Countries

China

States

Guangdong, CN

Cities

Guangzhou, Guangdong, CN

Apply Save
Language: en Views: 0 Clicks: 0 Saves: 0

Related jobs