Senior Data Engineer
Philadelphia, PA
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
HealthVerity
HealthVerity applies data synchronization technologies with the nation's largest healthcare & consumer data ecosystem to advance the sciencePlease note: This is a hybrid role requiring 3 days in office at our Philadelphia HQ - 1818 Market Street.
How you will help
As a senior data engineer on the data platform team, you will be supporting and enhancing the platform that supports HealthVerity’s Petabyte-scale core data asset. You will work closely with other engineers, data scientists, and business leaders to ensure that our data platform is available, secure, and reliable. You will use your strong engineering and product mindset to understand business needs and develop scalable engineering solutions that support HealthVerity’s product roadmap and vision while continuously looking for opportunities to simplify, automate tasks, and build reusable components.
What you will do
- Engineer efficient, adaptable and scalable data pipelines to process structured and unstructured data
- Develop and maintain data pipelines to efficiently process and analyze large amounts of streaming data
- Collaborate with other data engineers to maintain a cohesive and standardized data infrastructure
- Work closely with the software engineering team to integrate data pipelines into the overall platform architecture
- Collaborate with cross-functional teams including software engineers, data scientists, product managers, and analysts to understand data needs and deliver valuable platform enhancements that support the overall HealthVerity vision and roadmap.
- Identify and implement solutions to optimize data storage, retrieval, and processing
- Continuously evaluate and improve data engineering processes and systems to increase efficiency and scalability
- Stay up-to-date with emerging technologies and industry trends in data engineering
- Ensure data security and compliance with privacy regulations
- Troubleshoot and resolve data-related issues in a timely manner
- Leverage large-scale distributed computing and serverless architecture including Spark, AWS Lambda, etc. to develop pipelines for transforming data
- Partner with the product teams to understand product goals and provide data that enables us to respond to customer and regulatory data requests
- Monitor data quality and proactively identify and resolve data issues
Our tech stack:
Our team leverages the following technologies in our day-to-day development process: Github (includes CI/CD Flow-GHA), Python, Postgres, AWS Cloud-native technologies (CDK, Lambda, S3, EMR, ECS, SQS, Eventbridge, AuroraDB, cloudwatch and more), Spark, Databricks (SQL, Delta Live Tables, Unity Catalog, Audit Logs, Workflow), Docker/Kubernetes, Airflow, Hive SQL, Infrastructure as Code (IaC) tools, such as Terraform, YAML, and Helm Charts
How Success is defined
- Lead the design and implementation of scalable data solutions
- Proactively identify and address data quality and compliance issues
- Share knowledge across teams
- Contribute to strategic decisions regarding data architecture and tooling
Required skills and experience
- You are proficient in at least one primary language (e.g., Java, Scala, Python) and Advanced SQL (any variant)
- You have experience with Databricks pipeline automation, AWS EMR, AWS S3 service, Snowflake, Spark, Docker
- You have 8+ years of industry experience and proficiency in building distributed data pipelines for both batch and real-time (experience with Spark, Hive, Iceberg, Kafka, Snowflake is helpful, but not strictly required)
- You have a product mindset to understand business needs and develop scalable engineering solutions
- You are always looking for opportunities to simplify, automate tasks, and build reusable components across multiple use cases and teams
- You have strong communication skills to collaborate with cross-functional partners and drive projects. You are curious and eager to work across a variety of engineering specialties (i.e., Data Science, Data Engineering, and Machine Learning to name a few)
- You have a strong knowledge of Databricks features and functionalities, such as Unity Catalog, Audit Logs, Databricks SQL and Delta Live Tables
- Experience with CI/CD pipelines and DataOps
- You have an eye for detail and like to spark joy amongst your partners with well-documented high-quality data products that are modeled and easy to understand
- You are able to successfully lead large, complex systems design and implementation challenges independently
- Experience using Infrastructure as Code (IaC) tools, such as Terraform, YAML, and Helm Charts
Base salary for the role is commensurate with experience and can range between $120,000 - 200,000 + annual bonus opportunity.
Hiring Locations
Our main office is located in Center City, Philadelphia, where we operate on a hybrid model with in-office work required three days a week for local employees. We believe collaboration is most effective when teams come together, which is why we prioritize hiring in the Philadelphia area.
For certain roles, we also hire from hub locations—regions where we have an established presence with multiple team members working remotely. While these employees primarily work from home, we bring them together in person at lease once a year for team-building, collaboration, and strategic planning.
Due to tax and labor regulations, we can only hire from specific states. Remote work is supported in the following key hub locations and approved states:
Hub Locations:
- Philadelphia, Pennsylvania
- Boston, Massachusetts
- New York City, New York
- Baltimore, Maryland
- Washington, D.C.
- Charlotte, North Carolina
- Raleigh-Durham, North Carolina
- Atlanta, Georgia
- Chicago, Illinois
Approved States for Remote Work:
CT, DE, FL, GA, IL, IN, MA, MD, MI, NC, NJ, NY, OH, PA, TN, and VA.
About HealthVerity
HealthVerity is the leader in privacy-protected real-world data exchange, transforming how healthcare and life sciences organizations connect and analyze disparate healthcare and consumer data. We continue to innovate HealthVerity Marketplace, the nation's first and largest real-world data ecosystem comprising more than 75 leading data providers and over 340 million US patients. Combined with Identity Manager, the industry's most accurate and efficient solution for patient identity, privacy and governance, we support critical applications in clinical development, commercial strategy, regulatory decision-making, population health, underwriting and more. HealthVerity has raised more than $140 million to date and works closely with its data providers, partners and clients to Synchronize the Science. To learn more about HealthVerity, visit healthverity.com.
Why you'll love working here
We are making a difference – Our technology is at the forefront of some of the biggest healthcare challenges in the world.
We are one team – Our people define our culture and always will. We take time out to celebrate each other, and acknowledge the value that each of us adds towards our greater mission. Come share all you have to offer.
We are learners – Every team member is continually learning, no matter if we've been in a role for one year or much longer. We are committed to learning and implementing what is best for our clients, partners, and each other.
Benefits & Perks
Our benefits package is thoughtfully designed to support and enrich the experience of our full-time employees, with eligibility limited to those in permanent positions.
- Compensation: competitive base salary & annual bonus opportunity (for non-commissioned roles)
- Benefits: We offer a 401(k) plan and stock options. Health, dental, and vision coverage start on day 1, while 401(k) eligibility and stock options follow soon after.
- Flexible location: Remote workdays and 3 days a week of in-office collaboration for team members in the Philadelphia area. Check location requirements with the recruiting team.
- Generous PTO: Take time off as needed, targeted at 4 weeks per year, including vacation, personal and sick time, plus paid parental leave.
- Parental Leave: 12 weeks paid leave for childbearing, surrogacy, and adoption; 6 weeks for non-childbearing parents.
- Comprehensive and individualized onboarding: mentorship program, departmental talks, and a library of resources are available beginning day 1 for each new team member to minimize the stress of starting a new job
- Professional development: biweekly 1:1s, hands-on leadership that is goal-and growth-oriented for each team member, and an annual budget to support professional development pursuits
We believe incorporating different ideas, perspectives and backgrounds make us stronger and encourages an environment where ageism, racism, sexism, ableism, homophobia, transphobia or any other form of discrimination are not tolerated. All qualified job applicants will be given consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or on the basis of disability. At HealthVerity, we’re working towards an innovative and connected future for healthcare data and believe the future is better together. We can only do that if everyone has a seat at the table.
If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please direct your inquiries to careers@healthverity.com
Remote opportunities are not available in all areas and require team members to work from a fixed location due to tax and labor law implications - specific questions about remote positions can be discussed during the interview process with your recruiter.
Tags: Airflow Architecture AWS CI/CD Databricks DataOps Data pipelines Data quality Docker ECS Engineering GitHub Helm Java Kafka Kubernetes Lambda Machine Learning Pipelines PostgreSQL Privacy Python Scala Security Snowflake Spark SQL Streaming Terraform Testing Unstructured data
Perks/benefits: Career development Competitive pay Equity / stock options Flex hours Flex vacation Health care Parental leave Salary bonus Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.