Data Engineer
Boston
Manifold Bio
Our innovative technology combines protein barcoding and high-throughput in vivo capabilities to design drugs with more precise molecular testing.Manifold Bio is a biotech company pursuing a pipeline of protein therapeutics using novel molecular measurement technologies and library-guided protein engineering. Our drug discovery engine is differentiated by massively parallel screening in vivo from the beginning of our discovery process. This unique platform is powered by a proprietary protein barcoding technology that allows multiplexed protein quantitation at unprecedented scale and sensitivity. We combine this and other high-throughput protein engineering approaches with computational design to create antibody-like drugs and other biologics. Our world-class team of protein engineers, biologists, and computational scientists are working together to aim the platform at therapeutic opportunities where precise targeting is the key to overcoming clinical challenges.
Manifold Bio is seeking an exceptional Data Engineer to join our team. This role will help lead the full life cycle of Manifold’s platform data including modeling, design, coding, testing and deployment of solutions across our scientific research effort. This role will help build standard-driven data integration and automation processes to manage the integrity and quality of Manifold platform data used for reporting and analytics. This role will work closely with the Computational Team and other wet-lab scientists to identify unmet data needs and implement novel solutions as Manifold grows. Expertise in the life sciences and experience with the Benchling Data Model is a plus. The ideal candidate will have experience implementing solutions from previous roles.
Responsibilities
- Work closely with Manifold’s Computational Team and wet-lab scientists to identify and deploy solutions to augment our ability to capture, store, and make decisions based on our data
- Create tools, models, algorithms and data pipelines to support novel data streams
- Create interfaces for researchers to access data without engineering support
- Present and report on data model and infrastructure updates to the team
- Own interfaces and integrations with partner services, including Benchling
Qualifications
- 5+ years of relevant programming experience (including Python)
- Demonstrated and proven experience modeling and building data solutions (e.g. Postgresql)
- Experience developing, orchestrating and supporting ETL pipelines
- Cloud computing experience with Amazon Web Services (AWS)
- Experience with data profiling, data quality, master data management, metadata management
- Experience across multiple operating systems: Unix/Linux, Mac, and [tolerance of] Windows
- Detail-oriented with excellent problem identification and problem-solving skills
- Demonstrated ability to work both independently and as part of a team
- A deep passion for data modeling and developing new methods
- PREFERRED: master’s degree, project management experience, relevant certs in data science or project management
- PREFERRED: experience working with Next Generation Sequencing (NGS) data
If you’re excited to build a platform that combines these technologies to revolutionize how protein therapeutic discovery happens, please apply!
We value different experiences and ways of thinking and believe the most talented teams are built by bringing together people of diverse cultures, genders, and backgrounds.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: AWS Data management Data pipelines Data quality Drug discovery Engineering ETL Linux Pipelines PostgreSQL Protein engineering Python Research Testing
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.