Data Engineer

Boston

Full Time Mid-level / Intermediate USD 107K - 200K *

Manifold Bio

Our innovative technology combines protein barcoding and high-throughput in vivo capabilities to design drugs with more precise molecular testing.

View all jobs at Manifold Bio

Apply now Apply later

Posted 12 hours ago

Manifold Bio is a biotech company pursuing a pipeline of protein therapeutics using novel molecular measurement technologies and library-guided protein engineering. Our drug discovery engine is differentiated by massively parallel screening in vivo from the beginning of our discovery process. This unique platform is powered by a proprietary protein barcoding technology that allows multiplexed protein quantitation at unprecedented scale and sensitivity. We combine this and other high-throughput protein engineering approaches with computational design to create antibody-like drugs and other biologics. Our world-class team of protein engineers, biologists, and computational scientists are working together to aim the platform at therapeutic opportunities where precise targeting is the key to overcoming clinical challenges.

Manifold Bio is seeking an exceptional Data Engineer to join our team. This role will help lead the full life cycle of Manifold’s platform data including modeling, design, coding, testing and deployment of solutions across our scientific research effort. This role will help build standard-driven data integration and automation processes to manage the integrity and quality of Manifold platform data used for reporting and analytics. This role will work closely with the Computational Team and other wet-lab scientists to identify unmet data needs and implement novel solutions as Manifold grows. Expertise in the life sciences and experience with the Benchling Data Model is a plus. The ideal candidate will have experience implementing solutions from previous roles.

Responsibilities

Work closely with Manifold’s Computational Team and wet-lab scientists to identify and deploy solutions to augment our ability to capture, store, and make decisions based on our data
Create tools, models, algorithms and data pipelines to support novel data streams
Create interfaces for researchers to access data without engineering support
Present and report on data model and infrastructure updates to the team
Own interfaces and integrations with partner services, including Benchling

Qualifications

5+ years of relevant programming experience (including Python)
Demonstrated and proven experience modeling and building data solutions (e.g. Postgresql)
Experience developing, orchestrating and supporting ETL pipelines
Cloud computing experience with Amazon Web Services (AWS)
Experience with data profiling, data quality, master data management, metadata management
Experience across multiple operating systems: Unix/Linux, Mac, and [tolerance of] Windows
Detail-oriented with excellent problem identification and problem-solving skills
Demonstrated ability to work both independently and as part of a team
A deep passion for data modeling and developing new methods
PREFERRED: master’s degree, project management experience, relevant certs in data science or project management
PREFERRED: experience working with Next Generation Sequencing (NGS) data

If you’re excited to build a platform that combines these technologies to revolutionize how protein therapeutic discovery happens, please apply!

We value different experiences and ways of thinking and believe the most talented teams are built by bringing together people of diverse cultures, genders, and backgrounds.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats: 0 0 0

Category: Engineering Jobs

Tags: AWS Data management Data pipelines Data quality Drug discovery Engineering ETL Linux Pipelines PostgreSQL Protein engineering Python Research Testing