AI Platform Engineer
Tasks
- Build benchmarking suites
- Configure vLLM and Triton components
- Evaluate and implement inference optimizations
- Improve monitoring tracing and alerting
- Increase throughput under production traffic
- Manage batching and KV cache
- Measure latency throughput GPU utilization memory cost
- Optimize inference runtimes and servers
- Own production inference deployment
- Participate in incident response and postmortems
- Reduce end to end latency
- Scale LLM inference services
- Translate business requirements into SLOs
Perks/Benefits
- N/A
Skills/Tech-stack
Alerting | CUDA | Cause analysis | Continuous batching | GPU Profiling | Go | KV cache | Kernel programming | Latency optimization | Monitoring | PyTorch | Python | Quantization | Root Cause Analysis | Root cause | Rust | SGLang | Speculative decoding | TensorRT | Throughput Optimization | Tracing | Triton | VLLM
Education
N/A
Related jobs
-
SSE_Analytics Enablement INR 2250K-5000KAI/ML | Alteryx | Analytics engineering | Architecture Review | AutomationSenior-level Full Time110380-IND-BENGALURU-INTL BLR Twr-1&2 CARNATION, India3h ago
-
SSE_Analytics Enablement INR 2250K-5000KAlteryx | Architecture Review | Artificial Intelligence | CI/CD | Code AutomationSenior-level Full Time110380-IND-BENGALURU-INTL BLR Twr-1&2 CARNATION, India3h ago
-
Entry-level Full TimeBengaluru, India9h ago
-
GCP Data Engineer / Consultant Specialist INR 1500K-2000KAirflow | Alerting | Apache Beam | Automation | BigQueryFlexible working | Inclusive workplace | Opportunities for growth | Professional developmentMid-level Full TimePune, Maharashtra, India R9h ago
-
Senior AI Testing Engineer (Generative AI) INR 2000K-3300KAI Agents | API Testing | AWS | Azure | CI/CDSenior-level Full TimeBangalore North, India9h ago
-
AI Solution Architect INR 2000K-4500KAPI Integration | AWS | Agentic Orchestration | Async workflows | AutogenSenior-level Full TimeBangalore North, India9h ago
-
Senior-level Full TimeIN-KA-Bangalore10h ago
-
Senior-level Full TimeIN-KA-Bangalore10h ago
-
Senior-level Full TimeIN-KA-Bangalore10h ago
-
Senior-level Full TimeIN-KA-Bangalore10h ago
-
Senior-level Full TimeIN-KA-Bangalore10h ago
-
Senior-level Full TimeIN-TN-Chennai10h ago
-
Data Science and Tech Lead AMEA INR 2000K-3500KAWS | Agile | Amazon Bedrock | Amazon SageMaker | CI PipelinesSenior-level Full TimePune, ASIA, India10h ago
-
Mid-level Full TimeBangalore, Karnataka, India11h ago
-
Mid-level Full TimeBangalore, Karnataka, India11h ago
-
Mid-level Full TimeBangalore, Karnataka, India11h ago
-
Mid-level Full TimeBangalore, Karnataka, India11h ago
-
Assistant Manager- AI Engineer INR 1500K-2800KAmazon Web Services | Anthropic Claude | Autogen | Azure | Cloud platformMid-level Full TimeBangalore, Karnataka, India11h ago
-
Consultant - Python Developer with Gen AI INR 1500K-2200KAgile | Azure | Azure OpenAI | CSS | DockerMid-level Full TimeBangalore, Karnataka, India11h ago
-
Consultant - Python Developer with Gen AI INR 1500K-2200KAI Search | Agile | Azure | Azure OpenAI | CSSMid-level Full TimeBangalore, Karnataka, India11h ago
-
Consultant - Python Developer with Gen AI INR 1500K-2200KAzure | Azure OpenAI | CSS | Docker | FastAPIMid-level Full TimeBangalore, Karnataka, India11h ago
-
Senior Software Engineer, AI/ML, AI Garage INR 2500K-4800KApache Beam | BigQuery | Cause analysis | Data Processing | DataflowSenior-level Full TimeHyderabad, Telangana, India12h ago
-
Data Engineer, Play Data Science and Analytics INR 1500K-2500KAI | Automation | C plus plus | Data Infrastructure | Data ModelingMid-level Full TimeBengaluru, Karnataka, India12h ago
-
Data Engineer, YouTube INR 800K-1400KApache Flume | Apache Spark | Cloud Dataflow | Data Analysis | Data ArchitectureMid-level Full TimeBengaluru, Karnataka, India12h ago
-
Practice Customer Engineer, GenAI, Public Sector, Google Cloud INR 2000K-3300KAPIs | Agentic AI | Audit Logging | Cloud Networking | Cloud infrastructureSenior-level Full TimeGurgaon, Haryana, India; Bengaluru, Karnataka, India12h ago