SRE Analytics Lead- C14 - SINGAPORE
3 CHANGI BUSINESS PARK CRESCENT CHANGI BUSINESS PARK SINGAPORE, Singapore
Citi
Citi is a leading global bank for institutions with cross-border needs, a global provider in wealth management and a U.S. personal bank.The SRE Analytics Lead is a strategic professional who thrives at the intersection of engineering, data, and operations. This role reports in to Head of SRE Services & is crucial for building a comprehensive metrics ecosystem for Services business that reflects the true state of our platforms and progress against Production engineering goals.
We are looking for someone, who will influence engineering and production teams across Services & wider SMBF Production to adopt meaningful, actionable metrics – helping shift the culture from reactive reporting to proactive reliability management.
This role is key to uplifting maturity across the enterprise — not just building dashboards, but helping teams internalize what good looks like, and supporting them in closing the gap.
Responsibilities:
Design, build, and own key Production Engineering dashboards and metrics pipelines, with hands-on ownership across enterprise tools like Tableau, Grafana, Jira, and ServiceNow, giving teams the visibility to make smarter, faster decisions in day-to-day operations and incident response.
Establish enterprise aligned consistent frameworks and guiding teams in adopting them, you will help mature how the wider production organization defines, tracks, and acts on engineering health and operational risk.
Own the end-to-end data pipeline – from extraction (via APIs or queries), transformation, validation, and delivery – for SRE & wider Production metrics ensuring fully alignment with bank's Agile workflows.
Have an automation first mindset - Challenge the status quo, collaborate & contribute innovative solutions to the wider SMBF Production capabilities to improve visibility of key engineering metrics.
Track and improve critical production OKRs across Services Production such as MTTR, MTTD, change success rate, recovery automation/Swing tests, alert volume, and toil, by providing actionable insights.
Utilise & re-use the existing enterprise solutions to create a unified view of reliability and recovery trends within Services.
Collaborate with other central Observability, Architecture and Infrastructure teams to ensure the availability, quality, and consistency of engineering data.
Build out data models and repositories that support historical analysis, trend forecasting, and anomaly detection.
Drive executive and operational reporting to tell a real story of engineering progress, platform health, and critical business impact enabling LoBs to take data driven decisions.
Support SRE tooling strategy by identifying gaps in telemetry, metrics maturity, and automation opportunities.
Define and operationalize SLIs, SLOs, and error budgets in partnership with other SREs and development teams across Services, ensuring continue refinement.
Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behaviour, conduct and business practices, and escalating, managing and reporting control issues with transparency.
Qualifications:
15+ years of experience in SRE, Observability, Engineering Productivity, or Data Engineering roles.
Hands-on experience with Tableau and Grafana for visualization and reporting.
Strong command of data integration and engineering techniques (e.g., REST APIs, SQL, Python, ETL tools, data modelling).
Experience building metrics pipelines and data workflows across ServiceNow, Jira, Grafana, cloud telemetry, and operational systems.
Familiarity with defining and implementing SLIs, SLOs, and error budget-based engineering workflows.
Deep understanding of incident response, recovery processes, and engineering operations in enterprise environments & the related KPIs
Demonstrated ability to influence enterprise outcomes using data – from post-incident reviews to quarterly engineering OKRs.
Strong communication skills with the ability to engage both senior technical and non-technical audiences.
Demonstrated social, positive, can-do attitude to quickly learn and take own initiative to deliver creative and productive solutions
Ability to communicate well at all levels and network / influence at all levels
Ability to balance multiple demands and work both independently and as part of a matrix organisation to develop solutions
Education:
Bachelor’s degree in Computer Science, Engineering, Data Science, or a related technical field, or equivalent practical experience.
------------------------------------------------------
Job Family Group:
Technology------------------------------------------------------
Job Family:
Applications Support------------------------------------------------------
Time Type:
Full time------------------------------------------------------
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile APIs Architecture Computer Science Engineering ETL Grafana Jira KPIs OKR Pipelines Python SQL Tableau
Perks/benefits: Career development Transparency
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.