Lead Data Engineer - AI/ML - R01551402
Chicago, Illinois, United States
Brillio
From data ingestion and transformation to advanced analytics and visualization, we provide end-to-end solutions to help you drive business growth.Primary Skills
- Athena, SNS, SQS, CloudWatch, Macie, Spark - Scala, Kinesis, CloudFormation, EMR, Open Search, DynamoDB, Amazon API Gateway, SCT, Redshift, DMS, Oozie
Job requirements
- Location: Chicago, IL - Hybrid with 2-3 days on-site in a week
- Design and implement scalable ETL/ELT pipelines using AWS Glue, Spark (PySpark), and Step Functions.
- Work with structured and semi-structured data using Athena, S3, and Lake Formation to enable efficient querying and access control.
- Develop and deploy serverless data processing solutions using AWS Lambda and integrate them into pipeline orchestration.
- Perform advanced SQL and PL/SQL development for data transformation, analysis, and performance tuning. Build data lakes and data warehouses using S3, Aurora, and Athena.
- Implement data governance, security, and access control strategies using AWS tools including Lake Formation, CloudFront, EBS/EFS, and IAM. Develop and maintain metadata, lineage, and data cataloging capabilities.
- Participate in data modeling exercises for both OLTP and OLAP environments.
- Work closely with data scientists, analysts, and business stakeholders to understand data requirements and deliver actionable insights. Monitor, debug, and optimize data pipelines for reliability and performance.
- Strong experience with AWS data services: Glue, Athena, Step Functions, Lambda, Lake Formation, S3, EC2, Aurora, EBS/EFS, CloudFront.
- Proficient in PySpark, Python, SQL (basic and advanced), and PL/SQL.
- Solid understanding of ETL/ELT processes and data warehousing concepts.
- Familiarity with modern data platform fundamentals and distributed data processing.
- Experience in data modeling (conceptual, logical, physical) for analytical and operational use cases.
- Experience with orchestration and workflow management tools within AWS.
- Strong debugging and performance tuning skills across the data stack.
kill sets Must have : Python, SQL/PLSQL, AWS,Postgresql,S3, Glue Good to have : CDK, GitHub Job Description We are looking for an experienced AWS Lead Data Engineer to design, build, and manage robust, scalable, and high-performance data pipelines and data platforms on AWS. The ideal candidate will have a strong foundation in ETL fundamentals, data modeling, and modern data architecture, with hands-on expertise across a broad spectrum of AWS services including Athena, Glue, Step Functions, Lambda, S3, and Lake Formation.
Key Responsibilities:
Required Skills & Experience:
Know more about:DAI: https://www.brillio.com/services-data-analytics/Know what it’s like to work and grow at Brillio: https://www.brillio.com/join-us/ Equal Employment Opportunity DeclarationBrillio is an equal opportunity employer to all, regardless of age, ancestry, colour, disability (mental and physical), exercising the right to family care and medical leave, gender, gender expression, gender identity, genetic information, marital status, medical condition, military or veteran status, national origin, political affiliation, race, religious creed, sex (includes pregnancy, childbirth, breastfeeding, and related medical conditions), and sexual orientation. #LI-SR1
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs Architecture Athena AWS AWS Glue CloudFormation Data governance Data pipelines Data Warehousing DynamoDB EC2 ELT ETL GitHub Kinesis Lake Formation Lambda Machine Learning OLAP Oozie Pipelines PostgreSQL PySpark Python Redshift Scala Security Spark SQL Step Functions
Perks/benefits: Medical leave
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.