Data Engineer (Boston, USA)
Boston, Massachusetts, United States
NextGen Invent Corporation
Department: Data Engineering
Experience: 5+ Years
Job Location: Hybrid /Boston, MA
No. of Position: Multiple
Qualifications: Undergrad or Higher
Work Timings: 8:00 AM - 5:00 PM EST
Job Description:
We are seeking an experienced Data Engineer with 5+ years of hands-on experience in building scalable data pipelines and data ingestion frameworks. The ideal candidate must have a strong command of Python, excellent skills in data pipeline creation, and a deep understanding of healthcare data systems. Experience with Databricks and/or Snowflake is highly desirable.
Key Responsibilities:
- Design, develop, and maintain robust data pipelines to ingest, transform, and store data from multiple sources.
- Build efficient and scalable ETL processes using Python, PySpark, and SQL.
- Implement and optimize data workflows on AWS, Azure, or hybrid cloud environments.
- Leverage Databricks, Snowflake, and/or Azure Data Factory for advanced data engineering solutions.
- Collaborate with data scientists, analysts, and other engineers to ensure seamless data access and integration.
- Ensure healthcare data security, compliance, and governance best practices are embedded into solutions.
- Identify bottlenecks in data ingestion and recommend optimizations.
- Develop automated solutions for data validation, monitoring, and reporting.
- Stay current with evolving data engineering practices and tools, particularly in the healthcare sector.
Required Skills and Experience:
- Bachelors degree in Computer Science, Engineering, or a related technical field.
- Minimum of 5 years of experience in data engineering roles, with at least 2 years working with healthcare data (mandatory).
- Strong proficiency in Python and SQL for data engineering tasks.
- Solid experience with data ingestion, ETL/ELT pipeline creation, and large-scale data integration.
- Experience with web scraping and data gathering from third-party sources is a plus
- Familiarity with healthcare interoperability standards like HL7, FHIR, etc. is a plus
- Exposure to automation tools like UI Path or Power Automate is a plus.
- Familiarity with cloud platforms such as AWS or Azure.
- Hands-on experience with Databricks and/or Snowflake (preferred).
- Experience with PySpark for large data processing.
- Deep understanding of HIPAA compliance and healthcare-specific data regulations.
- Excellent analytical, troubleshooting, and communication skills.
- This position does not offer visa sponsorship; applicants must have valid authorization to work in the U.S.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: AWS Azure Computer Science Databricks Data pipelines ELT Engineering ETL HL7 Pipelines PySpark Python Security Snowflake SQL
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.