Senior Data Engineer
US - Remote
Pattern Data
Who We Are
The Pattern Data platform was created for the rapid analysis of thousands of medical records. Fueled by AI, our platform efficiently reviews and categorizes documents in minutes, significantly reducing validation time by highlighting pertinent information crucial for litigation in medical analysis and personal injury cases. Our platform has been successfully implemented in major national settlements to process, adjudicate, and value claims.
We transform terabytes of unstructured data into real-time, indexed medical and legal knowledge bases. Our document processing pipeline is the industry’s fastest and uses Scala on AWS. We accomplish this at scale using the following core stack of technologies:
- Scala, TypeScript, PostgresQL, GraphQL, Elasticsearch, AWS
At Pattern, our team is built on a foundation of collaborative ownership, visionary problem-solving, customer-centric solutions, and authenticity. We’re looking for a Senior Data Engineer to join our growing team.
What You’ll Do
As a Senior Data Engineer at Pattern Data, you will be responsible for the design, maintenance, and ongoing enhancement of Pattern’s data analytics pipeline. You will:
- Design, develop, and maintain our data warehouse and ETL pipelines using technologies like Scala, PostgresQL, Airflow, and Python to ensure the accuracy, consistency, and reliability of our data
- Operate within Pattern’s data warehouse, writing queries against large volumes of structured and unstructured information in PostgreSQL
- Build tools to monitor the health and responsiveness of Pattern’s analytic products
- Collaborate with data scientists, analysts, and other stakeholders to understand data requirements and deliver high-quality data solutions
What You’ll Bring
- Bachelor's degree in Mathematics, Economics, Data Science/Analysis, Computer Science, or a related field, or equivalent certifications
- Proven experience as a Data Engineer or similar role at an early stage startup
- 5 + years working with SQL data sources, preferably PostgreSQL, with demonstrated success for both OLTP and OLAP workloads
- Strong emphasis on data quality, monitoring, and transparency within reporting products
- Passion for using data to solve problems, uncover new insights and trace underlying problems
Ready to meet us?
Please apply directly through our website or Linkedin. We are excited to hear from you!
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow AWS Computer Science Data Analytics Data quality Data warehouse Economics Elasticsearch ETL GraphQL Mathematics OLAP Pipelines PostgreSQL Python Scala SQL TypeScript Unstructured data
Perks/benefits: Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.