Lead - Cloud Reliability Engineering

Off Embassy Golf Links Business Park, Bangalore India

Apply now Apply later

Job Description:

Job Title: Lead Cloud Reliability Engineer

Team: Enterprise Cloud Enablement – Cloud Reliability Engineering

The Purpose of the Role

We are seeking a Lead Cloud Reliability Engineer to join our Enterprise Infrastructure Cloud Reliability Engineering. You will play a key role in evolving our event-driven platform that aggregates and enriches cloud-native events across multiple Cloud Service Providers. Your mission is to unlock enterprise-wide insights, identify patterns in cloud usage and reliability, and drive intelligent automation that improves resilience and operational efficiency.

The Value You Deliver


Event Intelligence Development – Event Intelligence

• Design and implement systems to analyze and correlate cloud events with internal enterprise data.

• Build classification models and anomaly detection pipelines to surface trends and outliers.

• Correlate cloud-native events with internal data (e.g., CMDB, deployment metadata).

• Build classification and trend detection logic to surface enterprise-wide themes.

• Develop dashboards and reports that highlight reliability, security, and operational themes.

• Automation & Remediation

• Identify opportunities for intelligent automation based on recurring event patterns.

• Build or integrate with remediation workflows (e.g., auto-ticketing, self-healing scripts).

• Collaborate with SREs and platform teams to implement proactive mitigation strategies.

• Data Enrichment & Integration

• Enhance event data with metadata from CMDB, asset inventory, and deployment pipelines.

• Ensure data quality, consistency, and traceability across the platform.

• Collaboration & Enablement Fidelity Internal Information

• Partner with security, operations, and application teams to understand pain points and use cases.

• Contribute to post-incident reviews and help translate findings into platform improvements.

• Contribute to a culture of learning by sharing insights and helping teams adopt best practices.

The Expertise You Have:

• Over 5 Years of strong experience with cloud platforms (AWS & Azure) and their native event systems.

• 5 Years of development experience with Python, Go, or similar languages for backend and automation work.

• Experience with event streaming platforms (e.g., Kafka, EventBridge, Pub/Sub).

• Working experience with observability tools (e.g., Datadog, Prometheus, ELK, Grafana).

• Strong experience with CI/CD pipelines, infrastructure as code, and DevOps practices.

• Experience with data modeling, ETL pipelines, and SQL/NoSQL databases.

Non-Technical Skills:

• Strong analytical thinking and ability to derive insights from complex data.

• Excellent communication skills – able to translate technical findings into business value.

• Collaborative mindset with a passion for cross-functional teamwork.

• Curiosity and a continuous improvement mindset – always looking for better ways to do things.

• Ability to prioritize and manage ambiguity in a fast-paced environment.

• Growth mindset – eager to learn, experiment, and continuously improve.

Preferred Skills:

• Exposure to machine learning or rule-based systems for pattern detection. • Understanding security advisories, vulnerability management, and compliance frameworks.

• Familiarity with ITSM tools (e.g., ServiceNow), Incident Management and CMDB integration.

Fidelity Internal Information Company Overview At Fidelity, we are focused on making our financial expertise broadly accessible and effective in helping people live the lives they want. We are a privately held company that places a high degree of value in creating and nurturing a work environment that attracts the best talent and reflects our commitment to our associates. We are proud of our diverse and inclusive workplace where we respect and value our associate for their unique perspectives and experiences.

For information about working at Fidelity, visit FidelityCareers.com. Fidelity Investments is an equal opportunity employer

Certifications:

Category:

Information Technology
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0

Tags: AWS Azure CI/CD Classification Data quality DevOps ELK Engineering ETL Grafana Kafka Machine Learning NoSQL Pipelines Python Security SQL Streaming

Perks/benefits: Career development Startup environment Team events

Region: Asia/Pacific
Country: India

More jobs like this