Data Scientist

114 16th Street Charlestown (Building 114), United States

Mass General Brigham

Mass General Brigham is an integrated healthcare system, uniting great minds to solve the hardest problems in medicine for our communities and the world.

View all jobs at Mass General Brigham

Apply now Apply later

Site: The General Hospital Corporation


 

At Mass General Brigham, we know it takes a surprising range of talented professionals to advance our mission—from doctors, nurses, business people and tech experts, to dedicated researchers and systems analysts. As a not-for-profit organization, Mass General Brigham is committed to supporting patient care, research, teaching, and service to the community.  We place great value on being a diverse, equitable and inclusive organization as we aim to reflect the diversity of the patients we serve.

At Mass General Brigham, we believe a diverse set of backgrounds and lived experiences makes us stronger by challenging our assumptions with new perspectives that can drive revolutionary discoveries in medical innovations in research and patient care. Therefore, we invite and welcome applicants from traditionally underrepresented groups in healthcare — people of color, people with disabilities, LGBTQ community, and/or gender expansive, first and second-generation immigrants, veterans, and people from different socioeconomic backgrounds – to apply.


 


 

Job Summary

GENERAL SUMMARY/ OVERVIEW STATEMENT: The Albers Lab is a research laboratory of the MassGeneral Institute for Neurodegenerative Disease (MIND) that develops and applies integrative computational methods in biomedical and brain research to develop new therapeutic strategies for neurodegenerative diseases. We are seeking a highly motivated, innovative, and independent Data Scientist to be part of our team. While the position is primarily computational, the successful candidate will be working with a highly interdisciplinary team of computational, clinical and bench researchers at the Mass General Hospital and Harvard Medical School providing bioinformatics support for translational projects. You will have the opportunity to analyze both internal data (cellular and iPSC-derived neuron models of TDP43+ ALS and Alzheimer’s disease and Related Dementias) and external datasets (MGH-ADRC, UK Biobank, NYGC, electronic health record data from MGB, UK, and ENACT enclave etc) to assess the human relevance of disease phenotypes and pharmacological rescue in our cellular, iPSC-derived neuron and mouse models of disease. The Data Scientist will also interact with lab members emulating clinical trials in electronic health records using federated learning of target trial emulation with collaborators conducting molecular dynamic simulation for drug discovery, and with medicinal chemists conducting in silico screening. The Data Scientist will lead the standardization of bioinformatic pipelines and data management of data being generated across the teams. Scientific insights resulting from this research are expected to be presented at external scientific conferences and published in high impact journals. PRINCIPAL DUTIES AND RESPONSIBILITIES: § Establishing and standardizing bioinformatics pipelines relevant to translational projects § Analysis of genomic, transcriptomic (scRNA/bulk RNA), proteomic internal and external data sets § Further develop DRIAD (Drug Repurposingin Alzheimer’s Disease) platform by incorporating TDP43 driven cryptic exon detection/staging § Development of TRIALS (Therapeutic Repurposing in ALS) platform by using machine learning to develop predictors of ALS disease progression. § Provide support for federated learning efforts to emulate clinical trials using electronic health records from US, Europe and Asia. § Prepare data packages for regulatory bodies, such as NIH and FDA. § Build and process input datasets for machine-learning models. § Write code using a collaborative version control system, ensuring proper documentation and reproducible workflows. § Work closely with other computational scientists, researchers and physicians to design and perform analyses. § Assist in preparation of manuscripts as well as abstracts and presentations for scientific meetings. § Lead harmonization of data schema across different data ecosytems and data management across groups


 

Qualifications

SKILLS/ABILITIES/COMPETENCIES REQUIRED: § Independent, highly motivated, and highly collaborative with the ability to work together with multi-disciplinary teams of computational and clinical researchers as well as laboratory biologists § Enthusiastic about working in a drug discovery and development centric scientific environment § Curious and quick learner, with a willingness to explore about new areas and build expertise, takes initiative to see your ideas implemented § Strong programming skills in R and Python are required. § Comfortable using Linux environments. § Experience with statistical analysis and databases is strongly preferred. § Experience in multi-omics datasets is required § Excellent organizational and communication skills – demonstrated ability to work well within multi-disciplinary teams. § Ability to work independently and take initiative when necessary. § Highly motivated and able to meet deadlines. § Knowledge of neuroscience is desirable. EDUCATION: Bachelor’s Degree required. PhD in Computer Science, Bioinformatics, or related quantitative discipline is preferred. EXPERIENCE: 0-2 years of experience in data science projects is required. SUPERVISORY RESPONSIBILITY: 1 Bachelor’s level Data Analyst. WORKING CONDITIONS: Work will be performed in a typical office setting with remote work as a possibility.


 

Additional Job Details (if applicable)

Additional Job Description


 

Remote Type

Hybrid


 

Work Location

114 16th Street


 

Scheduled Weekly Hours

40


 

Employee Type

Regular


 

Work Shift

Day (United States of America)


 

EEO Statement:

The General Hospital Corporation is an Affirmative Action Employer. By embracing diverse skills, perspectives and ideas, we choose to lead. All qualified applicants will receive consideration for employment without regard to race, color, religious creed, national origin, sex, age, gender identity, disability, sexual orientation, military service, genetic information, and/or other status protected under law. We will ensure that all individuals with a disability are provided a reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment.


 

Mass General Brigham Competency Framework

At Mass General Brigham, our competency framework defines what effective leadership “looks like” by specifying which behaviors are most critical for successful performance at each job level. The framework is comprised of ten competencies (half People-Focused, half Performance-Focused) and are defined by observable and measurable skills and behaviors that contribute to workplace effectiveness and career success. These competencies are used to evaluate performance, make hiring decisions, identify development needs, mobilize employees across our system, and establish a strong talent pipeline.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  7  3  0
Category: Data Science Jobs

Tags: Bioinformatics Computer Science Data management Drug discovery Linux Machine Learning Nonprofit PhD Pipelines Python R Research Statistics Teaching

Perks/benefits: Career development Conferences Health care

Regions: Asia/Pacific North America
Country: United States