Co-op, Data Science

Cambridge, MA, United States

Biogen

Biogen is a leading global biotechnology company that pioneers science and drives innovations for complex and devastating diseases. Biogen is advancing a pipeline of potential therapies across neurology, neuropsychiatry, specialized immunology...

View all jobs at Biogen

Apply now Apply later

Job Description

This application is for a 6-month student role from January - June 2025. Resume review begins in October 2024.

Biogen’s Decision & Quality Analytics Innovation (DQAI) team is part of the RD&M, the vision of the DQAI team is to drive data-driven insights from trusted data. And the mission of the DQAIteam is to maximize the quality, efficiency and application of analytics, optimization opportunities, identification of compliance risk, and enhanced business analytics application. Specifically, the DQAI team has four major components: (1) Advanced analytics and simulation of portfolio data (Monte Carlo simulation, etc.); (2) Establish robust and scalable data management pipeline; (3) Apply Artificial Intelligence in different business domains (Data Science, Machine Learning, Statistical Analysis, etc.);  (4) Development of business intelligence tools (dashboards, websites, etc.) to transform data into actionable decisions.

Qualifications

In this role, you will work side-by-side with Biogen’s Data Scientists and Statisticians – you will have the opportunity to implement the latest methods from state-of-the-art (SOTA) research papers and get involved in the entire development lifecycle of the Artificial Intelligence (AI) products— from data ETL to model training, versioning, deploying, monitoring and validate models with feedback from subject matter experts. Below are some accountabilities of this role:

  • Collaborate closely with senior data scientists and statisticians to implement and deploy cutting-edge AI models
  • Facilitate clinical scenario simulation work
  • Develop and prototype data visualizations and dashboards
  • Conduct research works on the latest AI applications in Pharmaceutical areas
  • Engage with stakeholders to communicate key results to deliver predictive and prescriptive insights
  • Provide ad-hoc statistical and machine learning support to business partners

Example projects may include:

  • Develop explainable machine learning models and deploy them as interactive dashboards
  • Reproduce the latest methodologies from the top-tier machine learning research papers, apply them to Biogen’s internal data and use cases, and create comprehensive evaluation reports regarding the model performance and limitations
  • Create a simulation model for clinical programs

Qualifications:

Include the knowledge, skills, and abilities you may be seeking.

  • Demonstrated proficiency in at least one programming language (Python, R, etc)
  • Familiarity with concepts about NLP/NLG, topic modeling, text analytics, and text mining, and understanding of their mathematical foundations
  • Experience with Monte Carlo Simulation
  • Experience with NLP packages in Python, such as NLTK, spaCy, Gensim, etc.
  • Experience with deep learning frameworks, such as Pytorch, TensorFlow, HuggingFace
  • Ability to explore, discover and import data from multiple sources and make them ready for modeling with SQL and/or Pandas
  • Ability to communicate complex technical concepts in a clear and actionable manner
  • Willing to work in a collaborative environment to define a practical solution
  • Strong data visualization skills and experience with the Streamlit and/or Dash framework in Python is a plus
  • Experience with reproducing results from top-tier machine learning conferences is a plus
  • Familiarity with Github and Linux shell scripting in a cloud-based environment is a plus
  • Experience with Quality and Compliance data, Clinical Portfolio Data in the Pharmaceutical industry is a plus

To participate in the Biogen Internship Program, students must meet the following eligibility criteria:

  • Legal authorization to work in the U.S.
  • At least 18 years of age prior to the scheduled start date
  • Be currently enrolled in an accredited community college, college or university

Education

Currently pursuing a Master’s degree in Data Science, Statistics, Bioinformatics, Computer Science, Computational Biology, or related field

Additional Information

Why Biogen?

We are a global team with a commitment to excellence, and a pioneering spirit. As a mid-sized biotechnology company, we provide the stability and resources of a well-established business while fostering an environment where individual contributions make a significant impact. Our team encompasses some of the most talented and passionate achievers who have unparalleled opportunities for learning, growth, and expanding their skills. Above all, we work together to deliver life-changing medicines, with every role playing a vital part in our mission. Caring Deeply. Achieving Excellence. Changing Lives.

At Biogen, we are committed to building on our culture of inclusion and belonging that reflects the communities where we operate and the patients we serve. We know that diverse backgrounds, cultures, and perspectives make us a stronger and more innovative company, and we are focused on building teams where every employee feels empowered and inspired. Read on to learn more about our DE&I efforts.

All qualified applicants will receive consideration for employment without regard to sex, gender identity or expression, sexual orientation, marital status, race, color, national origin, ancestry, ethnicity, religion, age, veteran status, disability, genetic information or any other basis protected by federal, state or local law. Biogen is an E-Verify Employer in the United States.

 

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  9  1  0

Tags: Bioinformatics Biology Business Analytics Business Intelligence Computer Science Data management Data visualization Deep Learning ETL GitHub HuggingFace Linux Machine Learning ML models Model training Monte Carlo NLG NLP NLTK Pandas Pharma Python PyTorch R Research Shell scripting spaCy SQL Statistics Streamlit TensorFlow Topic modeling

Perks/benefits: Career development Conferences

Region: North America
Country: United States

More jobs like this