Inference Optimization Architect, Speech AI
Tasks
- Build automated model optimization pipelines
- Collaborate with research teams
- Design scalable inference infrastructure
- Develop custom hardware accelerated kernels
- Implement model compression techniques
- Monitor and improve inference resource utilization
- Optimize across diverse GPU platforms
- Optimize inference performance
- Profile and benchmark models
Perks/Benefits
- N/A
Skills/Tech-stack
CNNs | CUDA | Deep learning | GPU Profiling | GPU debugging | Inference Optimization | Knowledge Distillation | Model Compression | Model Deployment | Model Inference | Model Inference Optimization | Model Serving | Nsight Compute | Nsight Systems | Operating Systems | Pruning | Quantization | RNNs | TensorRT | Thread synchronization | Transformers
Education
Related jobs
-
Solutions Architect - Gen AI INR 2500K-5000KAgentic AI | BERT | Distributed Computing | Docker | Fine TuningSenior-level Full TimeIndia, Bengaluru1d ago
-
Associate Architect - Machine Learning Engineer INR 1800K-3000KAI Evaluation | Agentic AI | Benchmarking | Cloud Native | ContainerizationMid-level Full TimeIN KA Bengaluru, India1d ago
-
API Design | AWS | Apache Spark | Auditability | AzureAccident insurance | Best in class leave policy | Certification sponsorship | Childcare assistance reimbursement | Employee assistance programSenior-level Full TimePune - Business Bay, India1d ago
-
Digital Next IA :: Manager - Gen AI INR 1800K-2500KAWS | AWS Bedrock | Agile | Azure | Azure OpenAIMid-level Full TimePune, Maharashtra, India2d ago
-
Assistant Manager - SA2 (AI Hub - GDC) INR 1400K-2200KAWS | Agent Development | Apache Kafka | Autogen | AzureMid-level Full TimeHyderabad, Telangana, India2d ago
-
AI Architect INR 3000K-5000KAI architecture | Cloud Computing | Data Pipelines | Deep learning | Distributed SystemsSenior-level Full TimeDelhi, Delhi, India2d ago
-
Senior Solutions Architect - Generative AI INR 2500K-5000KAWS | Agentic AI | Azure | CUDA | Cloud platformSenior-level Full TimeIndia, Bengaluru3d ago
-
Principal AI Platform Architect INR 2500K-4800KAWS | Agentic Workflows | Agile | Azure | Business IntelligenceSenior-level Full TimeBangalore - RGA Tech Park, India4d ago
-
EY - GDS Consulting - AIA - AI Architect - senior INR 2500K-5000KAPI Development | AWS | Azure | Caching | Cost ControlSenior-level Full TimeBengaluru, KA, IN, 5600164d ago
-
API Design | AWS | Agentic Workflows | ChromaDB | Cloud NativeSenior-level Full TimePune, India7d ago
-
Gen AI Delivery Lead INR 3000K-5000KAI strategy | Agile | Applications portfolio management | Budget Management | CloudSenior-level Full TimeGurgaon, IN7d ago
-
AWS | Advanced Cluster Management | Ansible | As-a-Service | AzureSenior-level Full TimeAMRUTHAHALLI, NH 7,INTERNATION, India8d ago
-
Senior Machine Learning Engineer INR 2500K-5000KAOT | Adversarial Networks | CUDA | Computer Vision | Diffusion ModelsSenior-level Full TimeNoida, India R10d ago
-
Principal Engineer - AI & ML INR 2500K-4500KAWS SageMaker | Azure | Cloud AI | Google Cloud | Google Cloud AISenior-level Full TimeMumbai, India11d ago
-
Senior Staff Solution Architect - AI & Cloud INR 3125K-4500KAPI Architecture | AWS | AWS CDK | AWS CloudFormation | AWS GlueSenior-level Full TimeIND19-01-Bengaluru-EPIP 122 (Phase II), India14d ago
-
AI Governance | AWS Bedrock | AWS CloudFormation | AWS SageMaker | Agentic AICareer development | Coaching and feedback | Flexible work arrangement | Individual progression planMid-level Full TimeBengaluru, KA, IN, 56001615d ago
-
Lead AI Engineer - Vice President INR 3000K-5000KAI Agents | API Design | Apache Spark | Data Management | Distributed SystemsSenior-level Full TimeTOWER B, EON FREE ZONE II, …16d ago
-
Architect INR 3100K-4500KAWS | AWS Glue | AWS Lambda | Agile | Amazon SageMakerBackground check as required by role | Continuing education and training | Flexible leave | Health coverage | Retirement benefitsSenior-level Full TimeIN-UP-Noida-Candor TechSpace Tower 1, India16d ago
-
AI Architect - Modern AI Applications INR 2040K-3250KArtificial Intelligence | Cloud Architecture | Computer Vision | Data Engineering | Deep learningSenior-level Full TimeIN, BANGALORE, India16d ago
-
AI Architect INR 2800K-4500KAmazon Web Services | Cloud Platforms | Cloud platform | Computer Vision | Data GovernanceFlexible work arrangements | Inclusive workplace | Professional growth opportunitiesSenior-level Full TimeGurgaon, Haryana17d ago
-
AI Architect (Solutioning & Design) INR 1500K-2000KAPI Design | AWS | Amazon SageMaker | Apache Spark | Cloud platformMid-level Full TimeHyderabad, Telangana, India17d ago
-
Senior-level Full TimeHyderabad, Telangana, India17d ago
-
Principal AI Platform Architect INR 2500K-4600KAI Platform | AWS | Agentic Orchestration | Agile | AzureSenior-level Full TimeBangalore - RGA Tech Park, India17d ago
-
Scalable Design Solutions – IP Design AI Solution Architect INR 2000K-3500KAI | Analog design | Cell design | Data Architecture | Data HarvestingSenior-level Full TimeIND - Karnataka - Bengaluru - …18d ago
-
Principal Engineer, Data Analytics Engineering INR 2500K-3900KAI Governance | API | Agent Orchestration | Agentic Workflows | ComplianceSenior-level Full TimeBengaluru, KA, India21d ago