Lead PySpark Developer (7233-1)
Owings Mills, Maryland, United States
Lumiere Systems
Job Description:
- 7+ years of experience in Amazon Web Service(AWS) Cloud Computing.
- 10+ years of experience in big data and distributed computing.
- Very Strong hands-on experience with PySpark, Apache Spark, and Python.
- Strong Hands on experience with SQL and NoSQL databases (DB2, PostgreSQL, Snowflake, etc.).
- Proficiency in data modeling and ETL workflows.
- Proficiency with workflow schedulers like Airflow.
- Hands on experience with AWS cloud-based data platforms.
- Experience in DevOps, CI/CD pipelines, and containerization (Docker, Kubernetes) is a plus.
- Strong problem-solving skills and ability to lead a team
- DBT, AWS Astronomer
- Lead the design, development, and deployment of PySpark-based big data solutions.
- Architect and optimize ETL pipelines for structured and unstructured data.
- Collaborate with Client, data engineers, data scientists, and business teams to understand requirements and provide scalable solutions.
- Optimize Spark performance through partitioning, caching, and tuning.
- Implement best practices in data engineering (CI/CD, version control, unit testing).
- Work with cloud platforms like AWS.
- Ensure data security, governance, and compliance.
- Mentor junior developers and review code for best practices and efficiency.
Additional Notes:
- Please submit the candidate's resume in PDF format.
- Please note that TCS does not consider former full-time employees (FTEs) for rehire. Additionally, individuals who have previously worked at TCS as contractors must observe a minimum waiting period of six months before being eligible for re-engagement.
MUST HAVE:
- 7+ years of experience in Amazon Web Service(AWS) Cloud Computing.
- 10+ years of experience in big data and distributed computing.
- Experience with PySpark, Apache Spark, and Python.
- Experience with SQL and NoSQL databases (DB2, PostgreSQL, Snowflake, etc.).
- Hands on experience with AWS cloud-based data platforms.
- Experience in DevOps, CI/CD pipelines, and containerization (Docker, Kubernetes) is a plus.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
0
0
0
Categories:
Engineering Jobs
Leadership Jobs
Tags: Airflow AWS Big Data CI/CD DB2 dbt DevOps Docker Engineering ETL Kubernetes NoSQL Pipelines PostgreSQL PySpark Python Security Snowflake Spark SQL Testing Unstructured data
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Data Engineer II jobsBI Developer jobsPrincipal Data Engineer jobsStaff Data Scientist jobsSr. Data Engineer jobsPrincipal Software Engineer jobsStaff Machine Learning Engineer jobsData Science Manager jobsDevOps Engineer jobsData Science Intern jobsSoftware Engineer II jobsData Manager jobsJunior Data Analyst jobsData Analyst Intern jobsLead Data Analyst jobsAccount Executive jobsStaff Software Engineer jobsData Specialist jobsBusiness Data Analyst jobsSenior Backend Engineer jobsAI/ML Engineer jobsBusiness Intelligence Analyst jobsData Governance Analyst jobsSr. Data Scientist jobsData Engineer III jobs
Consulting jobsAirflow jobsOpen Source jobsMLOps jobsLinux jobsKPIs jobsEconomics jobsJavaScript jobsTerraform jobsRDBMS jobsKafka jobsData Warehousing jobsNoSQL jobsGitHub jobsPostgreSQL jobsGoogle Cloud jobsComputer Vision jobsClassification jobsScikit-learn jobsPhysics jobsStreaming jobsHadoop jobsBanking jobsR&D jobsPrompt engineering jobs