AWS Pyspark MDM Developer

South San Francisco, California, United States

Saama

Saama automates key clinical development and commercialization processes, with artificial intelligence (AI), machine learning (ML) and advanced-analytics, accelerating your time to market.

View all jobs at Saama

Apply now Apply later

  • Does solving complex business problems and real world challenges interest you?  Do you enjoy seeing the impact your contributions make on a daily basis?  Are you passionate about using data analytics to provide game changing solutions to the Global 2000 clients?  Do you thrive in a dynamic work environment that constantly pushes you to be the best you can be and more?  Are you ready to work with smart colleagues who drive for excellence in everything they do?  If you possess a solutions mindset, strong architecting skills, and commitment to be part of a tremendous journey, come join our growing, global team.  See what Saama can do for your career and for your journey.

     Saama Analytics has been on the forefront of data innovation for the last two decades and continues to offer cutting-edge data analytics solutions powered with big data, cloud, and AI/ML aptitudes for its customers in Life Sciences, Insurance, CPG, and other industries. Saama is committed to finding the best people because the innovations and discoveries that enabled together leads to better technologies, better treatments, and a better future.  Responsibilities:
    • Lead the design, implementation, and optimization of scalable data pipelines and architectures utilizing AWS Glue, Elastic MapReduce (EMR), Lambda, Redshift, Athena, DynamoDB, OpenSearch, and S3.
    • Use Spark on AWS for data transformation and processing across large datasets.
    • Develop and maintain efficient data workflows with SQS for task queueing and orchestration.
    • Integrate, transform, and manage data using Mulesoft for seamless data integration.
    • Ensure high-performance data storage, retrieval, and analytics across Redshift, DynamoDB, and Athena.
    • Oversee data consistency, integrity, and compliance through IQVIA MDM solutions.
    • Apply best practices in data governance, security, and scalability within a collaborative and cross-functional team environment.

    Qualifications:
    • Proven expertise in AWS data engineering, specifically with Glue, EMR, Lambda, Redshift, Athena, DynamoDB, OpenSearch, and S3.
    • Some experience with data integration (Mulesoft, Talend)
    • Working knowledge of master data management.
    • Demonstrated ability to lead technical projects and mentor data engineering teams.
    • Exceptional analytical and communication skills.
     Work EnvironmentThis job operates in a professional remote office environment. This role routinely uses standard office equipment, including but not limited to, computers, phones, and photocopiers.Physical DemandsThis position requires the frequent and repetitive use of a computer, keyboard, and mouse. Hand and finger dexterity is required.Other DutiesPlease note that this job description is not designed to cover or contain a comprehensive listing of activities, duties, or responsibilities required of the employee for this job. Duties, responsibilities, and activities may change at any time, with or without notice.EEO Saama Technologies, Inc. provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws.This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training. 
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  1  0  0
Category: Engineering Jobs

Tags: Architecture Athena AWS AWS Glue Big Data Data Analytics Data governance Data management Data pipelines DynamoDB Engineering Lambda Machine Learning OpenSearch Pipelines PySpark Redshift Security Spark Talend

Perks/benefits: Insurance

Region: North America
Country: United States

More jobs like this