Data Engineer

New Brunswick - NJ

Bristol Myers Squibb

Bristol Myers Squibb is a global biopharmaceutical company committed to discovering, developing and delivering innovative medicines to patients with serious diseases.

View all jobs at Bristol Myers Squibb

Apply now Apply later

Working with Us
Challenging. Meaningful. Life-changing. Those aren’t words that are usually associated with a job. But working at Bristol Myers Squibb is anything but usual. Here, uniquely interesting work happens every day, in every department. From optimizing a production line to the latest breakthroughs in cell therapy, this is work that transforms the lives of patients, and the careers of those who do it. You’ll get the chance to grow and thrive through opportunities uncommon in scale and scope, alongside high-achieving teams rich in diversity. Take your career farther than you thought possible.

Bristol Myers Squibb recognizes the importance of balance and flexibility in our work environment. We offer a wide variety of competitive benefits, services and programs that provide our employees with the resources to pursue their goals, both at work and in their personal lives. Read more: careers.bms.com/working-with-us.

Summary of Research Data Ecosystem Project

BMS (Bristol Myers Squibb)' mission is to discover, develop and deliver innovative medicines that help patients prevail over serious diseases. To accelerate our ability to serve patients around the world, we must unleash the power of technology.  We are committed to being at the forefront of transforming the way medicine is made and delivered by harnessing the power of computer and data science, artificial intelligence, and other technologies to promote scientific discovery, faster decision-making, and enhanced patient care. BMS Research IT is investing in a step-change in our ability to leverage data, knowledge, and prediction to find new high quality development candidates. This major undertaking will result in a unified Research Data Ecosystem that enables BMS' scientists, engineers, and decision-makers to embrace predictive AI/ML and data-driven business outcomes to improve quality, speed, and efficiency in discovering new medicines.

The Data Ecosystem initiative will build a next-generation, business-driven, and self serve data experience to provide a best-in-class data-centric environment for accelerating our predictive capabilities, Insilco research, and discovery insights. This effort will aggressively engineer our data at scale, creating numerous functional and operational data products acting as a single unified asset to unlock the value of our unique collection of data and predictions in real time.

The Data Ecosystem initiative will bifurcate across small-molecule and large molecule modalities, while maintaining cross-domain alignment.

The LARGE MOLECULE Data Ecosystem effort is seeking an IT Data Engineer to play a crucial role in data product and R&D data ecosystem development, bridging the gap between business stakeholders and technical teams.

This autonomous role will enable the project through:

  • Data & Process Mapping: Understand the workflows, processes, and datasets involved in the various business processes and their evolution in a data-centric environment. The Analyst will document data pipelines from the business processes needed to enable computational scientists & data scientists. Conduct interviews and analysis to elicit and document clear and comprehensive functional and non-functional requirements. This includes clear mapping of business rules, security, and data governance needs of each data product Owner & Producer.
  • Data Analysis and Modeling: Collaborate with information modelers and domain experts to understand data requirements, data sources, and data processing needs. Facilitate the creation of data models, data mapping, and data transformation specifications to ensure data integrity, quality, and availability. Translate business needs into comprehensive System to Target Mappings (STTM) for each data product - connecting the business insight & modeling needs to the specific data elements in and across systems and data sources. Furthermore, identify necessary process & system remediations to ensure successful data capture for robust data products.
  • Solution Design: Collaborate with digital capability managers, architects, and data engineers to ensure successful development of effective, usable, and trustworthy data products.
  • Maintain Agile & Growth Mindset: Support agile approaches and participate in lessons learned sessions and provide feedback to improve future data product development processes. Stay updated with industry trends, emerging technologies, and best practices related to data product development and analysis.

Qualifications & Experience:

  • Requires thorough knowledge of the principles and concepts of a discipline and developed knowledge of other related disciplines, typically gained through a university degree (preferred) and 4-6 years of experience. Bachelor’s degree in computer science, Software Engineering, or a related field (preferred); or equivalent work experience. Master or other advanced degree is a plus.
  • Strong Biologics domain knowledge and familiarity with the drug discovery process, including understanding of early-stage drug discovery assays, NGS sequencing and other data types, and scientific principles. Knowledge of relevant bioinformatics and predictive biological modeling is beneficial; Knowledge/experience in learning and/or applying AI/ML to drug discovery.
  • Ability in gathering, tracing, translating, and managing complex requirements, business rules, and data from varied stakeholders. Inclusive of mapping business processes, user stories, and functional and non-functional requirements. Strong attention to detail to ensure accuracy and completeness of requirements, documentation, and deliverables necessary.
  • Exceptional proficiency in analyzing data, deriving insights, and presenting findings. Skills in data modeling, statistical analysis, data visualization, or machine learning techniques are advantageous.
  • Ability to think critically and analytically to understand complex business and technical requirements. The analyst should be adept at breaking down problems, identifying patterns, and deriving insights from data.
  • Excellent oral and written communication skills including technical writing/documentation; organizes and presents ideas in a convincing way.
  • Exceptional interpersonal and outgoing personality skills; able to collaborate effectively with cross-functional teams, including data scientists, lab researchers, developers, and business stakeholders. Effective communication skills are crucial to establish rapport, manage expectations, and facilitate collaboration.
  • Agile and Growth Mindset; must possess a willingness to learn innovative technologies, methodologies, and scientific concepts related to early drug discovery. Ability to adapt to evolving project needs, embrace new challenges, and stay updated with industry trends and best practices.
  • Proven experience in designing, developing, and deploying data products on AWS using native tools and services.
  • Demonstrated experience with AWS cloud and data management technologies including Amazon S3, AWS Lamda, RDS, Redshift, DMS, Spark, Glue Sql, Hive, CDP Impala, familiarization with Parquet, JSON format, Amazon API Gateway.
  • Strong understanding of cloud-native architecture principles, microservices, and serverless computing.
  • Knowledge of security best practices in AWS, including identity and access management, encryption, and network security.
  • Knowledge of search technology & analytic tools such as R and Python; databases such as Redshift, RDS, and Oracle; and reporting tools such as Tableau and Spotfire.
  • Excellent problem-solving skills and the ability to troubleshoot complex issues in distributed systems.
  • Experience in writing reusable complex Python/PySpark scripts for ELT, Business Logic OR APIs
  • Mastery of relational, NoSQL or NewSQL database systems
  • Expertise in working with unstructured, structured and semi-structured data
  • Experience in streaming data from SaaS/PaaS applications
  • Experience with DataOps and related set of practices, processes and technologies
  • Experienced in Data Migration and Data Integration
  • Strong communication skills and the ability to work effectively within a collaborative and self-organized product team.

#LI-Hybrid

If you come across a role that intrigues you but doesn’t perfectly line up with your resume, we encourage you to apply anyway. You could be one step away from work that will transform your life and career.

Uniquely Interesting Work, Life-changing Careers
With a single vision as inspiring as “Transforming patients’ lives through science™ ”, every BMS employee plays an integral role in work that goes far beyond ordinary. Each of us is empowered to apply our individual talents and unique perspectives in an inclusive culture, promoting diversity in clinical trials, while our shared values of passion, innovation, urgency, accountability, inclusion and integrity bring out the highest potential of each of our colleagues.

On-site Protocol

BMS has a diverse occupancy structure that determines where an employee is required to conduct their work. This structure includes site-essential, site-by-design, field-based and remote-by-design jobs. The occupancy type that you are assigned is determined by the nature and responsibilities of your role:

Site-essential roles require 100% of shifts onsite at your assigned facility. Site-by-design roles may be eligible for a hybrid work model with at least 50% onsite at your assigned facility. For these roles, onsite presence is considered an essential job function and is critical to collaboration, innovation, productivity, and a positive Company culture. For field-based and remote-by-design roles the ability to physically travel to visit customers, patients or business partners and to attend meetings on behalf of BMS as directed is an essential job function.

BMS is dedicated to ensuring that people with disabilities can excel through a transparent recruitment process, reasonable workplace accommodations/adjustments and ongoing support in their roles. Applicants can request a reasonable workplace accommodation/adjustment prior to accepting a job offer. If you require reasonable accommodations/adjustments in completing this application, or in any part of the recruitment process, direct your inquiries to adastaffingsupport@bms.com. Visit careers.bms.com/eeo-accessibility to access our complete Equal Employment Opportunity statement.

BMS cares about your well-being and the well-being of our staff, customers, patients, and communities. As a result, the Company strongly recommends that all employees be fully vaccinated for Covid-19 and keep up to date with Covid-19 boosters.

BMS will consider for employment qualified applicants with arrest and conviction records, pursuant to applicable laws in your area.

If you live in or expect to work from Los Angeles County if hired for this position, please visit this page for important additional information: https://careers.bms.com/california-residents/

Any data processed in connection with role applications will be treated in accordance with applicable data privacy policies and regulations.

Apply now Apply later
  • Share this job via
  • 𝕏
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0
Category: Engineering Jobs

Tags: Agile APIs Architecture AWS Bioinformatics Computer Science Data analysis Data governance Data management DataOps Data pipelines Data visualization Distributed Systems Drug discovery ELT Engineering Excel JSON Machine Learning Microservices NoSQL Oracle Parquet Pipelines Privacy PySpark Python R R&D Redshift Research Security Spark Spotfire SQL Statistics Streaming Tableau

Perks/benefits: Career development Startup environment

Region: North America
Country: United States

More jobs like this