AWS Pyspark MDM Developer
South San Francisco, California, United States
Saama
Saama automates key clinical development and commercialization processes, with artificial intelligence (AI), machine learning (ML) and advanced-analytics, accelerating your time to market.Does solving complex business problems and real world challenges interest you? Do you enjoy seeing the impact your contributions make on a daily basis? Are you passionate about using data analytics to provide game changing solutions to the Global 2000 clients? Do you thrive in a dynamic work environment that constantly pushes you to be the best you can be and more? Are you ready to work with smart colleagues who drive for excellence in everything they do? If you possess a solutions mindset, strong architecting skills, and commitment to be part of a tremendous journey, come join our growing, global team. See what Saama can do for your career and for your journey.
Saama Analytics has been on the forefront of data innovation for the last two decades and continues to offer cutting-edge data analytics solutions powered with big data, cloud, and AI/ML aptitudes for its customers in Life Sciences, Insurance, CPG, and other industries. Saama is committed to finding the best people because the innovations and discoveries that enabled together leads to better technologies, better treatments, and a better future. Responsibilities:- Lead the design, implementation, and optimization of scalable data pipelines and architectures utilizing AWS Glue, Elastic MapReduce (EMR), Lambda, Redshift, Athena, DynamoDB, OpenSearch, and S3.
- Use Spark on AWS for data transformation and processing across large datasets.
- Develop and maintain efficient data workflows with SQS for task queueing and orchestration.
- Integrate, transform, and manage data using Mulesoft for seamless data integration.
- Ensure high-performance data storage, retrieval, and analytics across Redshift, DynamoDB, and Athena.
- Oversee data consistency, integrity, and compliance through IQVIA MDM solutions.
- Apply best practices in data governance, security, and scalability within a collaborative and cross-functional team environment.
Qualifications:- Proven expertise in AWS data engineering, specifically with Glue, EMR, Lambda, Redshift, Athena, DynamoDB, OpenSearch, and S3.
- Some experience with data integration (Mulesoft, Talend)
- Working knowledge of master data management.
- Demonstrated ability to lead technical projects and mentor data engineering teams.
- Exceptional analytical and communication skills.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture Athena AWS AWS Glue Big Data Data Analytics Data governance Data management Data pipelines DynamoDB Engineering Lambda Machine Learning OpenSearch Pipelines PySpark Redshift Security Spark Talend
Perks/benefits: Insurance
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.