Inference Optimization Architect, Speech AI
Tasks
- Apply knowledge distillation
- Apply model pruning
- Apply model quantization
- Build automated model optimization pipelines
- Collaborate with model researchers to deploy to production
- Design speech model serving infrastructure
- Develop CUDA custom kernels
- Implement encoder caching
- Improve batching strategies
- Leverage CUDA and TensorRT acceleration
- Monitor inference costs and resource utilization
- Optimize inference across GPUs and platforms
- Optimize multi threaded inference pipelines
- Optimize streaming latency and throughput
- Profile and benchmark inference bottlenecks
- Use Nsight Compute for GPU debugging
- Use Nsight Systems for GPU profiling
Perks/Benefits
- N/A
Skills/Tech-stack
Batching | CNN | CUDA | Computer Architecture | Dynamic Shapes | GPU Profiling | Inference Server | JAX | Kernel development | Knowledge Distillation | Latency Tuning | Memory Management | Model Compression | Nsight Compute | Nsight Systems | Operating Systems | Process scheduling | Pruning | PyTorch | Python | Quantization | RNN | TRT-LLM | TensorRT | Thread synchronization | Torchserve | Transformers | Triton Inference | Triton Inference Server | VLLM
Education
Related jobs
-
Principal Lead - Data Engineer - 3132 INR 2000K-3500KAPIs | Agile | Apache Airflow | DBT | Data PipelinesFlexible working hours | Global exposure | Professional development | Rewards and recognition | Work-life balanceSenior-level Full TimeChennai, India3h ago
-
Senior Staff Software Engineer (AI/ML) INR 3000K-4000KA/B | A/B Testing | Access Control | Airflow | Audit LoggingSenior-level Full TimeHyderabad, India3h ago
-
Mid-level Full TimeHyderabad, TS, India4h ago
-
Mid-level Full TimeChennai, Tamil Nadu, India5h ago
-
Staff GenAI Engineer INR 2000K-4763KAWS | Agentic Workflows | Azure | CI/CD | Distributed SystemsCollaborative team environment | Leadership development program | Training sessionsSenior-level Full TimeMumbai, MH, India5h ago
-
Sr Data Engineering Manager INR 1500K-2400KAzure Data | Azure Data Factory | Azure Service | Azure Service Bus | CI/CDSenior-level Full TimeHyderabad, TS, India6h ago
-
Data Engineering Manager INR 1500K-2000KAzure Data | Azure Data Factory | Azure Service | Azure Service Bus | CI/CDMid-level Full TimeHyderabad, TS, India6h ago
-
Software Engineer – IB Tech & CRM/Analytics INR 3000K-4000KAWS EKS | Amazon Redshift | Amazon Web Services | CI/CD | DockerSenior-level Full TimePune, India7h ago
-
Sr. Snowflake Data Engineer - Risk Technology INR 3000K-4200KClustering keys | Cost Optimization | Data Quality | ELT | ETLSenior-level Full TimePune, India7h ago
-
Entry-level Full TimeIndia7h ago
-
Assistant Manager - Forward Deployed Engineer / AI Engineer INR 1800K-2500KAPI Integration | Autogen | Cloud Computing | CrewAI | FastAPIMid-level Full TimePune, Maharashtra, India8h ago
-
Azure Data Engineer - Associate Consultant INR 1500K-2000KAgile | Apache Hive | Apache Spark | Azure Data | Azure Data FactoryMid-level Full TimeBangalore, Karnataka, India8h ago
-
Mid-level Full TimeGurugram, Haryana, India9h ago
-
Sr. Software Engineer (AI/ML) (PhD / Postdoc) INR 1500K-2400KCausal Inference | Deep learning | Generative AI | Language Processing | Machine LearningAccess to large clinical dataset | Collaboration with healthcare partners | Conference publication support | Research freedomSenior-level Full TimeHyderabad, India9h ago
-
Mid-level Full TimeGurugram, Haryana, India9h ago
-
AI | AWS | Agile | Ansible | AutosysSenior-level Full TimeBengaluru, Karnataka, India10h ago
-
IT Data Engineer II INT INR 1800K-2500KApache Spark | Cloud Cost Optimization | Cloud platform | Cost Optimization | Data GovernanceMid-level Full TimeBangalore, India15h ago
-
Data Engineer - VOIS INR 2000K-2040KApache Airflow | Apache Spark | BigQuery | CI/CD | Cloud ComposerCollaborative environment | Flexible inclusive workplace | Learning opportunitiesMid-level Full TimePune, IN15h ago
-
AWS CloudFormation | Amazon Kinesis | Amazon Redshift | Amazon S3 | Amazon Web ServicesEqual opportunity employer | Fast track growth opportunities | Great place to workSenior-level Full TimeHyderabad, Office Level 3 & 4, …15h ago
-
TTT-SAP-Python Gen AI-Tax Senior INR 2000K-4512KAI Core | AI Launchpad | API Management | CI/CD | Cloud ALMSenior-level Full TimeBengaluru, KA, IN, 56001615h ago
-
Entry-level Full TimeBangalore, India15h ago
-
Senior-level Full TimeIndia - Chennai15h ago
-
Data Architect INR 2520K-3380KAmazon Redshift | Bash | Business Requirements | Case documentation | Customer DataSenior-level Full TimeBangalore, India R15h ago
-
Data engineer (Azure Databricks) INR 1500K-2500KAmazon Web Services | Apache Airflow | Apache Kafka | Apache Spark | AzureMid-level Full TimeChennai, Tamil Nadu, India15h ago
-
Data Engineer INR 3000K-4000KAgile | Azure | Azure IAM | Azure Networking | Azure StorageBe Well programs | Certification support | Coaching | Continuous feedback | Hybrid workMid-level Full TimeINMANBP Bangalore (INMANBP) Manyatha, India15h ago