SRE Lead

Telstra ICC Bangalore, India

Telstra

Join Australia's largest mobile network, view our plans for NBN broadband internet, mobile phones, 5G & on demand streaming services.

View all jobs at Telstra

Apply now Apply later

Employment Type

Permanent

Closing Date

10 Mar 2025 11:59pm

Job Title

SRE Lead

Job Summary

You will provide scalable, reliable, durable, and secure applications for our customers and internal users. You will help build highly reliable applications using a customer-first approach while innovating technically. You will understand our customer's needs and how we can meet them.
You will be joining the Telstra Software Engineering, in Telstra in one of our ICC locations.

Job Description

About Us:

At Telstra, our purpose is to build a connected future so everyone can thrive. It's a future that won't happen on its own, it has to be delivered — and only Telstra can bring together all the parts to create it. Telstra is on a mission to redesign the way we all connect - with leading-edge technologies and solutions that are changing the world. And this is where YOU come in, by playing your part to building in help our customers connect: faster, better, and smarter.

 

Why Telstra?

Telstra is a well-known Australian company that has been around for over 100 years. We are the leading telecommunications and technology company in Australia and have been operating internationally for over 70 years. We have a strong presence in over 20 countries. In India, we have offices in Bangalore, Pune, Mumbai, and Delhi. We are focused on using innovation, automation, and technology to solve major technological challenges in areas such as IoT, 5G, AI, and machine learning. Joining Telstra gives you the chance to make a difference in the lives of millions of people and have a rewarding career with flexibility.

The role with us:

As a Site Reliability Engineer, you will have the opportunity to manage the complex challenges of scale which are unique to Telstra’s digitisation, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. You will provide scalable, reliable, durable, and secure applications for our customers and internal users. You will help build highly reliable applications using a customer-first approach while innovating technically. You will understand our customer's needs and how we can meet them.

You will be joining the Telstra Software Engineering, in Telstra in one of our ICC locations

We're interested in hearing from people who have:

  • Critical thinking mindset, strong sense of accountability for product delivery, passion to develop quality software.
  • Good communication skills, and team player.
  • Experience working (or willing to work) with geographically distributed teams.
  • Strong technical background
  • Develop own and peer’s skills and be a mentor to junior peers
  • Understanding of Incident and Problem Management/ITIL Certification.

Responsibilities:

  • Within the Site Reliability Engineering team, you will be working with development team, and other partner teams to ensure that applications reliability, efficiency, and performance meets our customer's needs, while keeping the service's operation's reliable, scalable, and automated.
  • Develop tools and automation to streamline operations and improve system reliability, efficiency, and performance.
  • Partner with development teams on feature launches to ensure our customers are delivered reliable and scalable functionality.
  • Build a deep knowledge on production infrastructure and using that to debug distributed systems problems and identify improvements to the system.
  • Operations, SLO, SLA management
  • Metrics reporting and progress tracking.
  • Work with security teams to ensure compliance with security policies and procedures.
  • Participate in on-call rotations to provide 24/7 support for our systems.
  • Observability (Alarms, monitoring, synthetics).
  • Error management

Qualifications:

  • Bachelor’s degree in computer science or a related engineering degree
  • 10+ years of IT industry experience
  • Experience in
    • Observability using Splunk, NewRelic
    • APIs and event-driven approaches
    • Security patterns
    • Infrastructure as Code using terraform
    • Java, Nodejs, microservices, NoSQL
    • AWS EC2, S3, Lambda, IAM, ECS, EKS, SQS, Kinesis
  • Strong Experience in analysing and troubleshooting large-scale distributed systems. Quick reaction on high severity customer impacts.
  • Ability to debug and optimise code and automate routine tasks.
  • Familiarity with containerisation and orchestration technologies such as Docker
  • Knowledge in modern software engineering practices and tools - Agile and DevOps
  • Strong communication skill and the ability to explain complex technical matters in an easy-to-understand way.

Nice to have:

  • Knowledge on additional tools
    • Python
    • APIGEE
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0
Category: Leadership Jobs

Tags: Agile APIs AWS Computer Science DevOps Distributed Systems Docker EC2 ECS Engineering ITIL Java Kinesis Lambda Machine Learning Microservices Node.js NoSQL Python Security Splunk Terraform

Perks/benefits: Career development

Region: Asia/Pacific
Country: India

More jobs like this