Senior Site Reliability Engineer
IND.Chennai, India
Workday
Workday unites HR and finance on one AI platform to help elevate humans and supercharge work to keep business moving forever forward.Your work days are brighter here.
At Workday, it all began with a conversation over breakfast. When our founders met at a sunny California diner, they came up with an idea to revolutionize the enterprise software market. And when we began to rise, one thing that really set us apart was our culture. A culture which was driven by our value of putting our people first. And ever since, the happiness, development, and contribution of every Workmate is central to who we are. Our Workmates believe a healthy employee-centric, collaborative culture is the essential mix of ingredients for success in business. That’s why we look after our people, communities and the planet while still being profitable. Feel encouraged to shine, however that manifests: you don’t need to hide who you are. You can feel the energy and the passion, it's what makes us unique. Inspired to make a brighter work day for all and transform with us to the next stage of our growth journey? Bring your brightest version of you and have a brighter work day here.
About the Team
Workday VNDLY is a next-generation vendor management SaaS platform crafted for procurement executives, talent acquisition teams, suppliers and Managed Service Providers (MSPs) to collaborate on the corporation's contingent workforce needs. We develop cloud software that helps our customers streamline the talent sourcing and acquisition process across different sources including staffing agencies, job boards and freelance management systems. We want our software to be so easy to use, our company name is short for "Vendor Friendly". At Workday VNDLY, our software does all of the work for the client, putting algorithms and data science to work in acquiring the right talent (Contractors or Full-time employees) and incorporating dashboards to provide complete transparency of the talent acquisition process.We are a team within Workday focused on becoming the most customer centric enterprise software company in the world. We care about enabling our customers to connect with the talent they need, so they can succeed. In order to complete our mission we believe in retaining an exceptional engineering team with a passion for code quality and phenomenal design.
About the Role
Workday VNDLY is seeking a dynamic Site Reliability Engineer (SRE) to join our Platform Operations team. You'll play a crucial role in building solutions that scale our operations and deliver reliable support to our customers. We're committed to optimizing our product delivery pipeline, balancing innovation with unwavering stability.
About You
Do you thrive on solving complex customer problems? Are you a creative SRE/DevOps Engineer seeking opportunities to automate and improve reliability, or a developer passionate about building tools to reduce manual effort? If so, we want to hear from you. You'll be empowered to design and implement solutions that not only resolve immediate issues but also drive lasting improvements by preventing future occurrences.
Responsibilities
Develop in-depth understanding of the product, architecture, and supporting technologies
Design, build, enhance, and maintain large-scale customer facing systems using AWS and GCP
Respond to production monitoring: triage, fix, and resolution. Retro and act on incidents to continually improve our systems and processes.
Collaborating with internal teams to investigate and resolve non-standard customer incidents.
Improve operational efficiency to help scale the team by automating where reasonable
Build platforms that empower application developers to interact with production in a self service manner
Build and maintain strong relationships with peers and partners to deliver results through collaboration
Develop and maintain processes, documentation, training, and other materials based on technology requirements and solutions
Engage in a culture of learning and innovation through hackathons, online course offerings, and employee-led special interest guilds
Be a part of Workday's people-first culture and experience the numerous benefits.
About You
Basic Qualifications
7+ years of SRE/DevOps or related experience
Enterprise experience in public cloud platform (ideally on AWS or GCP)
Experience configuring and using monitoring tooling (e.g., CloudWatch, Prometheus, etc.) for real-time monitoring and alerting
Established history of successfully handling complex customer issues and critical incidents
Other Qualifications
Degree in Computer Science or related field, or equivalent practical experience
Proficiency with at least one deployment automation / configuration management tool (e.g. Terraform, Jenkins, etc.)
Proven track record of building tooling, automation, and/or services in one or multiple languages (e.g., Python, Go, etc.)
Experience debugging and optimizing systems and code
Ability to architect, analyze, and support complex distributed systems
Passion to optimize and automate
Automating deployment, scaling, and management of containerized applications with Kubernetes
Strong organization and time management skills
Ability to take initiative, work independently and be dedicated to drive the product forward
Pursuant to applicable Fair Chance law, Workday will consider for employment qualified applicants with arrest and conviction records.
Workday is an Equal Opportunity Employer including individuals with disabilities and protected veterans.
Are you being referred to one of our roles? If so, ask your connection at Workday about our Employee Referral process!
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture AWS Computer Science DevOps Distributed Systems Engineering GCP Jenkins Kubernetes Python Terraform
Perks/benefits: Career development Transparency
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.