Staff Site Reliability Engineer

Bangalore, India

Zscaler

Zscaler, the zero trust cybersecurity leader, accelerates digital transformation with fast, secure connections between users, devices and apps over any network.

View all jobs at Zscaler

Apply now Apply later

About Zscaler

Serving thousands of enterprise customers around the world including 40% of Fortune 500 companies, Zscaler (NASDAQ: ZS) was founded in 2007 with a mission to make the cloud a safe place to do business and a more enjoyable experience for enterprise users. As the operator of the world’s largest security cloud, Zscaler accelerates digital transformation so enterprises can be more agile, efficient, resilient, and secure. The pioneering, AI-powered Zscaler Zero Trust Exchange™ platform protects thousands of enterprise customers from cyberattacks and data loss by securely connecting users, devices, and applications in any location. 

Named a Best Workplace in Technology by Fortune and others, Zscaler fosters an inclusive and supportive culture that is home to some of the brightest minds in the industry. If you thrive in an environment that is fast-paced and collaborative, and you are passionate about building and innovating for the greater good, come make your next move with Zscaler. 

Our Engineering team built the world’s largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your vision and passion to our team of cloud architects, software engineers, security experts, and more who are enabling organizations worldwide to harness speed and agility with a cloud-first strategy.

We're looking for an experienced Site Reliability Engineer to join our Platform & Tooling team. Reporting to the Senior Director, you'll be responsible for:

  • You will design the infrastructure, tools, services, and platforms that allow our operations and primary product teams to deliver high-quality, reliable, and scalable services, enhancing the customer experience.
  • We are looking for close collaboration with multiple teams to improve our observability, automation, configuration management, continuous deployment, and reliability practices.

Job Location - Bangalore / Hyderabad

What We’re Looking for (Minimum Qualifications) 

  • 6+ years of software development experience in Cloud-SRE/DevOps/System Engineering
  • Technical Skills: Proficiency in a combination of SRE and software development(any).
  • Operating Systems: Linux, FreeBSD
  • Scripting and Programming: Python, Go, Shell
  • Observability: OpenTelemetry, Prometheus, Grafana, ELK stack, or any enterprise monitoring platform
  • Databases: PostgreSQL, OLAP/Time Series/Analytics DBs, Redis
  • Other SRE Tools: Terraform, Puppet, Ansible, ETL, Docker, Kubernetes, JIRA, Jenkins, Maven, Splunk, Tomcat/Nginx, Kafka, Apache Spark, Flink.
  • QA/Testing: Experience with relevant testing tools and frameworks
  • AI/ML: MLOps, GenAI, SRE/AI Co-Pilot

What Will Make You Stand Out (Preferred Qualifications)

  • Infrastructure Management: Design, manage scalable and reliable infrastructure solutions to support Zscaler’s cloud services.
  • Observability: Develop systems for monitoring, logging, tracing, and alerting to ensure the health and performance of our infrastructure and applications.
  • Automation: Design automation and orchestration platforms and tools to stimproveeployment, patching, upgrades, scaling, and management of infrastructure and applications.
  • Continuous Deployment (CD) Platform: Design and maintain a Continuous Deployment platform and tools to deploy our products into cloud environments with high reliability and minimal human intervention.
  • Configuration Management: Design, and maintain configuration management platforms and tools to improve configurations within our cloud environment.
  • Chaos Engineering: Design a Chaos Engineering platform to drive failure modes and effects analysis, ensuring maximum resilience and scalability of infrastructure and applications.
  • Portal Development: Design scalable portals for SRE dashboards, SLI/SLO/SLA management, error budgets, and executive dashboards to support data-driven decision-making.
  • SRE/AI Co-Pilot Platform: Develop and integrate SRE/AI Co-Pilot solutions to enhance the efficiency of operations and engineering teams.
  • Collaboration: Work with product, operations, and security teams to ensure seamless integration and deployment of new tools, services, features, and updates across the cloud.
  • Documentation: Maintain comprehensive documentation of infrastructure, and toolso ensure knowledge sharing and continuity.
  • Continuous Improvement: Advocate for and implement best practices in site reliability engineering, promoting a culture of continuous improvement and learning.

By applying for this role, you adhere to applicable laws, regulations, and Zscaler policies, including those related to security and privacy standards and guidelines.

Zscaler is proud to be an equal opportunity and affirmative action employer. We celebrate diversity and are committed to creating an inclusive environment for all of our employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex (including pregnancy or related medical conditions), age, national origin, sexual orientation, gender identity or expression, genetic information, disability status, protected veteran status or any other characteristics protected by federal, state, or local laws.

See more information by clicking on the Know Your Rights: Workplace Discrimination is Illegal link.

Pay Transparency

Zscaler complies with all applicable federal, state, and local pay transparency rules. For additional information about the federal requirements, click here.

Zscaler is committed to providing reasonable support (called accommodations or adjustments) in our recruiting processes for candidates who are differently abled, have long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support.

Apply now Apply later
  • Share this job via
  • 𝕏
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  1  0  0

Tags: Agile Ansible Architecture CX DevOps Docker ELK Engineering ETL Flink Generative AI Grafana Jenkins Jira Kafka Kubernetes Linux Machine Learning Maven MLOps OLAP PostgreSQL Privacy Puppet Python Security Spark Splunk Terraform Testing

Perks/benefits: Career development Health care Transparency

Region: Asia/Pacific
Country: India

More jobs like this