Sr. Data Engineer (Python, SQL, ETL)
Bengaluru, India
Visa
Das digitale und mobile Zahlungsnetzwerk von Visa steht an der Spitze der neuen Zahlungstechnologien für die neue Zahlung, elektronische und kontaktlose Zahlung, die die Welt des Geldes bildenCompany Description
Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose – to uplift everyone, everywhere by being the best way to pay and be paid.
Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa.
Job Description
Team Summary
The Risk and Identity Solutions (RaIS) team provides risk management services for banks, merchants, and other payment networks. Machine learning and AI models are the heart of the real-time insights used by our clients to manage risk. Created by the Visa Predictive Models (VPM) team, continual improvement and efficient deployment of these models is essential for our future success. To support our rapidly growing suite of predictive models we are looking for engineers who are passionate about managing large volumes of data, creating efficient, automated processes and standardizing ML/AI tools.
This is a great opportunity to work with a new Engineering and MLOps team to scale and structure large scale data engineering and ML/AI that drives significant revenue for Visa. As a member of the Risk and Identify Solutions modeling organization (VPM), your role will involve developing and implementing practices that will allow deployment of machine learning models in large data science projects.
You must be a hands-on expert able to navigate both data engineering and data science disciplines to build effective engineering solutions that support ML/AI models. You will partner closely with global stakeholders in RaIS Product, VPM Data Science and Visa Research to help create and prioritize our strategic roadmap. You will then leverage your expert technical knowledge of data engineering, tools and data architecture in the design and creation of the solutions on our roadmap.
The position is based at Visa's offices in Bangalore, India.
Essential functions
Team members working for this role deploy, manage, and optimize data pipelines and machine learning models in production environments, ensuring smooth integration and efficient operations. The team also takes a data scientist’s model and make it accessible to the software that utilizes it. The essential functions of this role include:
- ETL processes: The role also involves developing and executing large scale ETL processes to support data quality, reporting, data marts, and predictive modeling.
- Spark pipelines: The role requires building and maintaining efficient and robust Spark pipelines to create and access data sets and feature stores for ML models.
- Distributed computing: This role involves developing distributed applications.
- Performance optimization: This role involves a lot of performance optimization on the existing data pipelines developed in Spark or other distributed frameworks.
- Infrastructure Management: This role involves managing VPM ML platform infra, datasets, data governance, application asset management, Migrations.
- Data pipelines orchestration: Working with tools like Apache air flow, Control M to deploy and manage data workflows. Might develop custom tools for effective data pipeline orchestrations.
- Collaboration with Technology teams: The role involves working with Data Science teams, Visa Research, and other Technology teams to leverage and provide feedback on ML systems and tools.
- Technical documentation and innovation: The role requires defining and building technical and data documentation, using code version control systems, ensuring data accuracy and consistency, and suggesting new ideas for innovation.
This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.
Qualifications
Basic Qualifications:
• 3+ years of relevant work experience and a Bachelor’s degree, OR 5+ years of relevant work experience
• Working knowledge of Hadoop ecosystem and associated technologies (e.g. HDFS, MapReduce, YARN, Spark, Kafka, MLlib, GraphX, iPython, sci-kit, Pandas etc.)
Preferred Qualifications:
3+ years of work experience with a Bachelor’s Degree or more than 2 years of work experience with an Advanced Degree (e.g. Masters, MBA, JD, MD).
Experience working in Linux/Unix environment and exposure to command line utilities.
Strong Development experience in one or more than one of the following: Golang, Java, Python, Rust, and C/C++.
Hands-on experience working with large scale data ingestion, processing, and storage in the Hadoop ecosystem.
Experience with complex, high volume, multi-dimensional data, as well as machine learning models based on unstructured, structured, and streaming datasets.
Experience in writing and optimizing SQL queries in Big data environment.
Experience creating/supporting production software/systems and have expertise on resolving performance bottlenecks for production systems
Experience working with scheduling tools (Airflow, Control-M) and building data processing orchestration workflows.
Strong written, verbal, and interpersonal skills needed to effectively communicate technical insights and recommendations with business customers and leadership team.
Experience working with technology and business teams on Data Governance, Data Quality and Data Architecture initiatives.
Additional Information
Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Architecture Big Data Data governance Data pipelines Data quality Engineering ETL Golang Hadoop HDFS Java Kafka Linux Machine Learning ML models MLOps Pandas Pipelines Predictive modeling Python Research Rust Spark SQL Streaming
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Manager jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Marketing Data Analyst jobs
- Open Data Science Manager jobs
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Business Data Analyst jobs
- Open Data Analytics Engineer jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Product Data Analyst jobs
- Open Sr Data Engineer jobs
- Open Junior Data Scientist jobs
- Open Data Analyst Intern jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Principal Data Scientist jobs
- Open Research Scientist jobs
- Open Big Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Azure Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open ML Engineer jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Business Intelligence-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open NLP-related jobs
- Open Finance-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open APIs-related jobs
- Open LLMs-related jobs
- Open Consulting-related jobs
- Open Generative AI-related jobs
- Open CI/CD-related jobs
- Open Snowflake-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Databricks-related jobs
- Open Data warehouse-related jobs