Senior Machine Learning Ops Engineering
IN Bangalore, India
ResMed
ResMedin langattomat ratkaisut parantavat miljoonien uniapneaa ja muita hengitystiesairauksia sairastavien elämää – katso, mitä he voivat tehdä hyväksesi.Global Technology Solutions (GTS) at ResMed is a division dedicated to creating innovative, scalable, and secure platforms and services for patients, providers, and people across ResMed. The primary goal of GTS is to accelerate well-being and growth by transforming the core, enabling patient, people, and partner outcomes, and building future-ready operations.
The strategy of GTS focuses on aligning goals and promoting collaboration across all organizational areas. This includes fostering shared ownership, developing flexible platforms that can easily scale to meet global demands, and implementing global standards for key processes to ensure efficiency and consistency.
As a Sr. Machine Learning Ops Platform Engineer, you will be responsible for building automation and leading-edge architecture around Data and AI/ML engineering on an AWS & Kubernetes platform. Specifically, you will code and help architect a production-grade, scalable platform to be used by dozens of data scientists. You will help define and ensure best coding & CICD practices within the team of excellent and engaged engineers. You will be given creative freedom and work in supportive team environment. You will do hands-on code development and interact with business stakeholders.
Let's talk about responsibilities:
Build and maintain systems built using DevOps & LLM Ops (Kubernetes, Docker), AWS, Python and Terraform.
Stay informed of industry trends and enable successful DevOps/MLops AWS platform and Terraform solutions by leveraging best practices.
Participate in and set up Proof of Concepts (POCs) to demonstrate proposed solutions.
Enable team members through training, culture, and team building.
Identify, design, and implement internal process improvements: Automating manual processes, re-designing infrastructure for greater scalability, etc.
Build infrastructure needed for AWS platform, such as Lambdas, EC2, Docker, pipeline engineering, data monitoring alerting, and networking.
Experience in implementing observability stack like prometheus, loki, grafana/datadog
Build, Design, Implement and support Data & ML models pipelines using latest CICD and deployment technologies.
Work with stakeholders including the Executive, Product, Data and Design teams to help with technical issues and support their infrastructure needs.
Participate in Code Review and process improvement.
Let's talk about qualifications and experience:
7+ years of total experience in a complex, technical environment. Experience with developing production-grade code in Python, SQL & Pandas.
Experience with 3 or more of the following AWS tools: Lambda, EC2, EMR, S3, Glue, Athena, RDS, Networking, IAM, Batch processing, Sagemaker, Airflow
Experience or self-study in Terraform and, generally commonly used DevOps tools and techniques like Kubernetes , Docker, Github, Github Actions, Code pipeline & Jenkins.
Experience in Kubeflow & MLFlow would be a good advantage.
Experience in creating and working with CICD pipelines and APIs.
Experience with relational SQL and NoSQL databases. Snowflake experience is plus.
Experience working with AI/ML teams and cross-functional teams in a dynamic environment.
All listed duties, requirements and responsibilities are considered as essential functions to this position; however, business conditions may require reasonable accommodation for added tasks and responsibilities.
Let’s talk about what you can expect:A supportive environment that focuses on people development and best implementation
Opportunity to design, influence, and be innovative
Work with inclusive global teams and the open sharing of new ideas. We want your ideas!
Be supported both inside and outside of the work environment
The opportunity to build something meaningful and see a direct positive impact on people’s lives!
Dream big, iterate and experiment to drive innovation!!
Joining us is more than saying “yes” to making the world a healthier place. It’s discovering a career that’s challenging, supportive and inspiring. Where a culture driven by excellence helps you not only meet your goals, but also create new ones. We focus on creating a diverse and inclusive culture, encouraging individual expression in the workplace and thrive on the innovative ideas this generates. If this sounds like the workplace for you, apply now! We commit to respond to every applicant.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow APIs Architecture Athena AWS DevOps Docker EC2 Engineering GitHub Grafana Jenkins Kubeflow Kubernetes Lambda LLMOps LLMs Machine Learning MLFlow ML models MLOps NoSQL Pandas Pipelines Python SageMaker Snowflake SQL Terraform
Perks/benefits: Career development Flex hours
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.