aijobs.net

Inference Optimization Architect, Speech AI

India, Bengaluru

INR 2500K-4500K (estimate) Senior-level Full Time

Apply Save
Found 3d ago
Tasks
Perks/Benefits
Skills/Tech-stack

Batching | CNN | CUDA | Computer Architecture | Dynamic Shapes | GPU Profiling | Inference Server | JAX | Kernel development | Knowledge Distillation | Latency Tuning | Memory Management | Model Compression | Nsight Compute | Nsight Systems | Operating Systems | Process scheduling | Pruning | PyTorch | Python | Quantization | RNN | TRT-LLM | TensorRT | Thread synchronization | Torchserve | Transformers | Triton Inference | Triton Inference Server | VLLM

Education

Bachelor of Engineering | Master of Science

Roles

AI Engineer | Architect | Engineer | Inference Optimization Architect | Speech AI Engineer

Regions

Asia/Pacific

Countries

India

States

Karnataka, IN

Cities

Bengaluru, Karnataka, IN

Apply Save
Language: en | Views: 0 | Clicks: 0 | Saves: 0

Related jobs