GCP Data Engineer
Aguascalientes, AGU, MX
Capgemini
A global leader in consulting, technology services and digital transformation, we offer an array of integrated services combining technology with deep sector expertise.RH: Raul García
Position: GCP Data Engineer
Location: Aguascalientes, Mexico
Industry - Sector: MALS
What you’ll do?
- Build, maintain, and optimize end-to-end ETL pipelines using Google Cloud Dataflow, Apache Beam, Cloud Composer, and Cloud Functions.
- Automate data ingestion, transformation, and integration processes from multiple sources (structured and unstructured data) into cloud-based data platforms.
- Design and implement scalable cloud-based data architectures using services like Google BigQuery, Google Cloud Storage (GCS), Cloud Pub/Sub, and Cloud Dataproc.
- Architect data solutions that are reliable, performant, and cost-effective for large-scale data processing and storage needs.
- Build and manage data lakes using Google Cloud Storage and ensure data is properly ingested, stored, and processed in an organized and secure manner.
- Implement Google BigQuery data warehouses, optimize data models for performance, and ensure they are scalable to handle large datasets.
- Leverage Google Cloud Dataflow or Apache Beam to transform data into formats suitable for reporting and machine learning models.
- Integrate diverse data sources such as relational databases, NoSQL databases, APIs, and flat files into the data pipeline architecture.
- Automate the deployment of data pipelines, processes, and infrastructure using Google Cloud Deployment Manager, Terraform, or Cloud Composer.
- Design and manage CI/CD pipelines for efficient deployment of data engineering solutions.
- Implement best practices for data governance, ensuring data privacy and compliance with security standards (e.g., encryption, data masking, and access control).
- Manage permissions and security with Google IAM, and ensure that sensitive data is securely handled and stored.
- Continuously monitor the performance of data pipelines and storage solutions, identifying opportunities for optimization and cost reduction.
- Troubleshoot and resolve data integration and pipeline issues to ensure smooth and efficient data processing.
- Collaborate with cross-functional teams (data scientists, analysts, business stakeholders) to understand data requirements and deliver high-quality data solutions.
- Provide support for data-related questions, troubleshooting, and assist with data visualization and reporting tasks.
- Document data engineering processes, including pipeline architectures, configurations, and best practices.
- Prepare and deliver regular updates on data engineering projects and their progress.
What you’ll bring:
- Expertise in Google Cloud BigQuery, Google Cloud Storage (GCS), Cloud Dataflow, Cloud Pub/Sub, Google Cloud Dataproc, Cloud Composer, Cloud Functions, and Cloud Bigtable.
- Experience in building ETL pipelines using Cloud Dataflow, Apache Beam, Google Cloud
- Ability to integrate structured and unstructured data sources into cloud-based systems.
- Strong experience working with Google BigQuery for building and managing data warehouses and running optimized analytical queries.
- Expertise in data ingestion, transformation, and integration using GCP tools, including Google Cloud Pub/Sub, Cloud Dataflow, and BigQuery.
- Proficiency in Python, SQL, and JavaScript for data engineering tasks, scripting, and automation.
- Experience with Apache Spark and PySpark is a plus for handling large-scale data processing.
- Experience with Google Cloud Deployment Manager, Terraform, or Cloud Composer to automate infrastructure and data pipelines.
- Strong understanding of Google IAM, data encryption, and secure data access practices.
- Proficiency in Git for version control and collaborative development.
Soft skills:
Work Underpressure, Quality at work, Results Oriented
What can YOU expect in a career with Capgemini?
- Working in a team environment, Consultants will focus on the analysis, design and development of technology-based solutions for Capgemini’s clients.
- You will work alongside technical, functional and industry specialists to assist with the development, implementation and integration of innovative system solutions including methods, techniques and tools.
- You will contribute to client satisfaction by providing timely and responsive value-added services and work products.
- Capgemini offers a competitive compensation and benefits package.
- Headquartered in Paris, France, Capgemini has a presence of more than 340 thousand professionals in Mexico distributed among 3 sites located in Mexico City, Monterrey and Aguascalientes. A deeply multicultural organization.
- Capgemini has developed its own way of working, the Collaborative Business ExperienceTM, and draws on Rightshore, its worldwide delivery model.
You will love this job because
- Capgemini focuses on giving each new hire a YOU-nique experience through our recruitment process and on-boarding program, as well as by helping you to build your own career and professional skills foundation.
- Capgemini provides a collaborative environment that embodies and holds the following stated values close to heart: Honesty, Boldness, Trust, Freedom, Team Spirit, Modesty, and Fun.
- Capgemini cultivates an atmosphere for development that enables YOU to be hands-on, planning for your growth, both horizontally and vertically.
“At Capgemini Mexico, we aim to attract the best talent and are committed to creating a diverse and inclusive work environment, so there is no discrimination based on race, sex, sexual orientation, gender identity or expression, or any other characteristic of a person. All applications welcome and will be considered based on merit against the job and/or experience for the position”
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs Architecture BigQuery Bigtable CI/CD Dataflow Data governance Data pipelines Dataproc Data visualization Engineering ETL GCP Git Google Cloud JavaScript Machine Learning ML models NoSQL Pipelines Privacy PySpark Python RDBMS Security Spark SQL Terraform Unstructured data
Perks/benefits: Career development Competitive pay
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.