Senior Machine Learning Engineer – Cloud Observability - Visa AI as Services
Austin, TX, United States
Visa
Visa digitaalinen ja mobiilimaksuverkko on eturintamassa uusien maksujen, sähköisten ja kontaktivarojen maksutekniikan, jotka muodostavat rahan maailmanCompany Description
Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose – to uplift everyone, everywhere by being the best way to pay and be paid.
Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa.
Job Description
Ready to make a global impact by industrializing AI?
Visa AI as a Service (AIaS) operationalizes the delivery of AI and decision intelligence to ensure their ongoing business values. Built with composable AI capabilities, privacy-enhancing computation, and cloud native platforms, AIaS powers and automates industrialization of data, models, and applications for predictive and generative AI. Combined with strong governance, AIaS optimizes the performance, scalability, interpretability and reliability of AI models and services. If you want to be in the exciting payment and AI space, learn fast, and make big impacts, Visa AI as a Service is an ideal place for you!
This role is for a Sr. ML Engineer – Cloud Observability. We are seeking for a talented professional with a solid background in public cloud and AI/ML production systems. This role offers ample opportunities for learning and growth, and the chance to be part of delivering the next big thing for our AI as Services team.
Key Responsibilities:
Implement and Maintain Cloud Observability Solutions: Build and maintain monitoring, logging and tracing systems (E.g. Prometheus, Grafana, Druid, ELK Stack) for cloud-native AI services on AWS/Azure/GCP. Partner with data engineers and data scientists to embed observability into ML workflows and ensure real-time insights.
Collaborate on AI Model Monitoring: Work closely with data scientists and product owners to design and implement observability solutions for monitoring AI/ML model performance (e.g. accuracy, latency, data drift) in production. Develop dashboards and alerts to detect anomalies, model degradation, or bias, ensuring alignment with business SLAs.
Automate Devops Practices: Develop tools for automated deployment, alerting and incident response using CI/CD pipelines like Jenkins and Github flows and infrastructure as code like Terraform.
Document & Reporting: Create and maintain clear documentation for observability processes and best practices. Generate reports to track system health and performance trends for business and technology stakeholders.
Incident Response: Assist in diagnosing and troubleshooting issues by analyzing metrics, logs and performance data and collaborate with cross functional teams to improve system level observability from the learning.
Stay Ahead of Trends: Explore emerging cloud and observability technologies to drive innovation.
If you are passionate about observability, cloud technology, AI, and machine learning, and are excited about making a significant impact, we would love to hear from you.
This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager.
Qualifications
Basic Qualifications:
- 2 or more years of work experience with a Bachelor’s Degree or an Advance Degree (e.g. Masters, MBA, JD, MD).
Preferred Qualifications:
- 3 or more years of work experience with a Bachelor’s Degree or more than 2 years of work experience with an Advanced Degree (e.g. Masters, MBA, JD, MD).
- Strong development experience in one or more the following programming languages: Java, Go, Rust, C++.
- 2 years of related experience with AWS, GCP, or Azure, preferably in an AI/ML production environment.
- Experience with one of the following: Prometheus, Grafana, Druid, ELK Stack- highly preferred.
- Experience in observability eco-system highly preferred.
Additional Information
Work Hours: Varies upon the needs of the department.
Travel Requirements: This position requires travel 5-10% of the time.
Mental/Physical Requirements: This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, frequently operate standard office equipment, such as telephones and computers.
Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.
Visa will consider for employment qualified applicants with criminal histories in a manner consistent with applicable local law, including the requirements of Article 49 of the San Francisco Police Code.
U.S. APPLICANTS ONLY: The estimated salary range for a new hire into this position is 116,500.00 to 164,500.00 USD per year, which may include potential sales incentive payments (if applicable). Salary may vary depending on job-related factors which may include knowledge, skills, experience, and location. In addition, this position may be eligible for bonus and equity. Visa has a comprehensive benefits package for which this position may be eligible that includes Medical, Dental, Vision, 401 (k), FSA/HSA, Life Insurance, Paid Time Off, and Wellness Program.
Tags: AWS Azure CI/CD DevOps ELK GCP Generative AI GitHub Grafana Java Jenkins Machine Learning Pipelines Privacy Rust Terraform
Perks/benefits: Career development Equity / stock options Health care Insurance Salary bonus Wellness
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.