Inference Optimization Architect, Speech AI
Tasks
- Build automated model optimization pipelines
- Collaborate with research teams
- Design scalable inference infrastructure
- Develop custom hardware accelerated kernels
- Implement model compression techniques
- Monitor and improve inference resource utilization
- Optimize across diverse GPU platforms
- Optimize inference performance
- Profile and benchmark models
Perks/Benefits
- N/A
Skills/Tech-stack
CNNs | CUDA | Deep learning | GPU Profiling | GPU debugging | Inference Optimization | Knowledge Distillation | Model Compression | Model Deployment | Model Inference | Model Inference Optimization | Model Serving | Nsight Compute | Nsight Systems | Operating Systems | Pruning | Quantization | RNNs | TensorRT | Thread synchronization | Transformers
Education
Related jobs
-
Principal Engineer, Data Analytics Engineering INR 2500K-3900KAI Governance | API | Agent Orchestration | Agentic Workflows | ComplianceSenior-level Full TimeBengaluru, KA, India7h ago
-
AI Architect INR 2500K-4500KAWS | Azure | Cloud platform | Computer Vision | Deep learningFlexible work arrangements | Professional growth opportunitiesSenior-level Full TimeGurgaon, Haryana1d ago
-
Architect- Embedded Development/ AI and Robotics & Edge AI INR 2800K-4110KC# | C++ | CPU | Cross-compilation | DDSAccess to fitness clubs | Creche facility for working parents | Employee assistance program | Food and beverage vouchers | Health insuranceSenior-level Full TimeIND - India Tech Center1d ago
-
AI & ML Architect INR 2500K-4200KAI Foundry | AI Search | Apache Spark | Azure AI | Azure AI Foundry401k matching | Dental insurance | Development and career opportunities | Health insurance | Recognition and rewardsSenior-level Full TimeIN Pune Pentagon Tower, India1d ago
-
AI Solutions Lead INR 2500K-4000KAWS | Data Lake | Data Pipelines | Data Warehousing | Deep learningSenior-level Full TimePune, MAHĀRĀSHTRA, India2d ago
-
AI Architect – Lilly Medicine Foundry (R5-6) INR 3000K-4725KAPI Design | AWS | Airflow | Artificial Intelligence | AzureSenior-level Full TimeIN: Lilly Hyderabad, India2d ago
-
AI Platform Architect- VP INR 2500K-4500KAPI Design | AWS | AWS Bedrock | Agentic Workflows | Amazon Elastic Container RegistrySenior-level Full TimeBCIT Bengaluru Office (MGS), India3d ago
-
Assistant Vice President INR 1800K-6000KAI Governance | API Orchestration | AWS | Agent systems | AzureExecutive-level Full TimeIndia3d ago
-
Large Language Model Architect INR 2500K-4000KData Processing | Deep learning | Language Models | Language Processing | Large Language ModelsSenior-level Full TimeBengaluru, BDC11A, India4d ago
-
Principal AI Architect INR 2500K-4500KAI Governance | AWS Lambda | AWS Step Functions | Amazon Bedrock | Amazon EC2Senior-level Full TimeIND19-01-Bengaluru-EPIP 122 (Phase II), India4d ago
-
Mid-level Full TimeBangalore, Karnataka, India7d ago
-
Lead AI Developer INR 2500K-3500KArtificial Intelligence | Backtesting | Cybersecurity | Data Architecture | Data GovernanceSenior-level Full TimeVadodara, India8d ago
-
Senior-level Full TimeBengaluru, Karnataka, India8d ago
-
Engineering Manager / Architect — Data Science & ML INR 3200K-4500K21 CFR | 21 CFR Part 11 | API Design | API Integration | AWSSenior-level Full TimeChennai9d ago
-
AI/ML Architect INR 2535K-4500KData Pipelines | Deep learning | Distributed Systems | Graph Analysis | Graphical ModelsSenior-level Full TimeDelhi NCR9d ago
-
Senior-level Full TimeBangalore–Embassy Business Hub, India10d ago
-
Technology Architect INR 2500K-3000KAPI Integration | Agent Builder | Avaya | Chatbots | Cloud FunctionsSenior-level Full TimePune, PDC2C, India10d ago
-
Senior-level Full TimeINDIA, Bangalore14d ago
-
Staff Software Engineer, Applied AI, Search INR 2500K-4500KData Processing | Debugging | Distributed Computing | Fine Tuning | Information RetrievalSenior-level Full TimeBengaluru, Karnataka, India15d ago
-
EY - GDS Consulting - AIA - AI Architect - senior INR 2500K-4500KAPI Development | Amazon Web Services | Caching | Cost Control | DistillationSenior-level Full TimeKolkata, WB, IN, 70009115d ago
-
Technical Architect (Python + AI) INR 2500K-4500KAWS | Azure | CI/CD | Data Governance | Deep learningSenior-level Full TimeBangalore, India15d ago
-
Senior Software Architect INR 3200K-4600KAutomated Training | Automated retraining | CI/CD | Data Drift | Data Drift DetectionSocial impact initiatives | Volunteering opportunitiesSenior-level Full TimeBangalore Office, India15d ago
-
Senior Software Architect INR 3200K-5000KCausal Inference | Deep learning | Distributed Training | Feature Engineering | GPU ComputingSenior-level Full TimeBangalore Office, India15d ago
-
Software Engineering Expert- C++ INR 3125K-4600KAgile | Algorithms | Apache Airflow | Apache Spark | Apache StormSenior-level Full TimeChennai,IND, India15d ago
-
AI ML Architect INR 2500K-3500KAWS | AWS Glue | AWS Lambda | Agentic AI | CI/CDContinuing education and training | Global agile team collaborationSenior-level Full TimeIN-UP-Noida-Candor TechSpace Tower 1, India15d ago