Product Engineer, Clinical Data Repository

BE009 Turnhoutseweg 30, Belgium

Johnson & Johnson

Johnson &Johnsonis a leading wholesale broker with commercial and personal lines expertise.

View all jobs at Johnson & Johnson

Apply now Apply later

At Johnson & Johnson, we believe health is everything. Our strength in healthcare innovation empowers us to build a world where complex diseases are prevented, treated, and cured, where treatments are smarter and less invasive, and solutions are personal. Through our expertise in Innovative Medicine and MedTech, we are uniquely positioned to innovate across the full spectrum of healthcare solutions today to deliver the breakthroughs of tomorrow, and profoundly impact health for humanity. Learn more at https://www.jnj.com

Job Function:

Technology Product & Platform Management

Job Sub Function:

Technical Product Management

Job Category:

Scientific/Technology

All Job Posting Locations:

Beerse, Antwerp, Belgium

Job Description:

Johnson & Johnson Technology is currently recruiting for a Product Engineer, Clinical Data Repository. This position will be based in Raritan, NJ; Titusville, NJ; Horsham, PA, or Beerse, Belgium.

Please note that this role is available across multiple countries and may be posted under different requisition numbers to comply with local requirements. While you are welcome to apply to any or all of the postings, we recommend focusing on the specific country(s) that align with your preferred location(s):

Belgium - Requisition Number: R-017761

US - Requisition Number: R-016682

Within J&J Innovative Medicine, the R&D Business Technology team is the strategic information technology partner providing innovative technology solutions that enable Global Development to authoritatively deliver on our portfolio and help provide transformative medicines to patients around the world while proactively improving agility, embracing innovation, and crafting the organization we need to deliver in the future.

R&D Business Technology's Clinical Data Management & Analysis is currently seeking a Product Engineer, Clinical Data Repository with an extensive Databricks experience. This role will combine a leadership aspects such as defining user-centric product visions, roadmaps, and prioritization with collaboration with business partners, including the assessment and prioritization of new features and improvements. It will also include hands-on aspects: designing, developing, optimizing and platform operations using Databricks and other related technologies, ensuring that our data architecture meets business needs and adheres to our engineering standards, particularly in the context of clinical data .

Key Responsibilities:

Technical Leadership & Expertise:

  • Define and lead all aspects of the overall data engineering architecture and roadmap using Databricks and modern data engineering practices.
  • Take accountability for the overall quality of engineering team work within the product, and ensure the team continually improves performance.
  • Mentor team members in solving technical challenges related to data pipelines and analytics through hands-on mentorship and support.
  • Along with the Technical Product owner, define and communicate to key partners the product vision and strategy including new features and improvements within Clinical Data Repository product

Databricks Implementation:

  • Design and implement data pipelines using Databricks for ETL processes, leveraging Spark for large-scale data processing and transformation.
  • Develop Delta Lake tables and optimize data storage and query performance within the Databricks environment.
  • Create and maintain Databricks notebooks for data exploration, analytics, and machine learning model development, ensuring standard methodologies for documentation and collaboration.

Performance Optimization:

  • Drive monitoring and optimization of the performance of Databricks jobs, ensuring efficient execution of data pipelines.
  • Analyze and fine-tune data processing workflows to reduce runtime and enhance performance.
  • Develop strategies for optimizing Spark configurations and cluster resources for cost-effective and high-performance data processing.

Platform Operations:

  • Ensure the stability and reliability of the Clinical Data Repository platform, managing cluster configuration, scaling, and resource allocation to meet workload demands.
  • Fix and resolve operational issues within the Databricks environment, collaborating with DevOps teams for seamless integration and deployment.
  • Supervise monitoring system to ensure stability, performance, and availability of data solutions, proactively addressing system issues and bottlenecks.
  • Handle technical debt and continuously seek opportunities for improvement in clinical data processes and architecture.
  • Assist in data migration strategies from legacy clinical systems, ensuring compliance with business rules and data governance standards.

Collaboration & Communication:

  • Work closely with clinical programmers, medical reviewers, data managers central monitors, data scientist, and business customers to understand clinical data needs and collaborate on analytics projects.
  • Facilitate communication between technical and non-technical teams to ensure alignment on data solutions and project goals.

Development & Problem-Solving:

  • Effectively engage in the development of data pipelines and workflows within Databricks, supporting code reviews and technical discussions.
  • Drive testing and deployment of data solutions, ensuring automated testing measures are in place for quality assurance.
  • Scale proof-of-concept projects into production environments, ensuring performance, reliability, and maintainability.
Qualifications:

Required:

  • Bachelor’s degree or higher in Computer Science, Engineering, Mathematics, or a related field is required.
  • At least seven (7) years of relevant IT experience, with a strong focus on data engineering and clinical technology.
  • Proven experience with Databricks and Spark in building data pipelines and analytics solutions
  • Extensive hands-on experience with cloud services such as AWS or Azure in relation to data solutions.
  • Strong SDLC foundations in Agile methodologies and experience in collaborative development environments.
  • Experience with programming languages such as Python, SQL, or Scala with experience in code reviews.
  • Excellent analytical and problem-solving skills, with a history of delivering data solutions in enterprise settings.
  • Effective communication and critical thinking skills, creative and flexible problem solving, and process focus.
  • Good interpersonal skills; ability to convey technical information clearly to team members.
  • Proven written and verbal communication skills with the ability to present a strong rationale for key decisions to partners, including senior leaders.

Preferred:

  • Knowledge of CDISC standards (SDTM, ADaM) is strongly preferred
  • In-depth knowledge of clinical data management, including familiarity with clinical data regulations and systems.
  • Experience in implementing AI/ML techniques within data engineering workflows is advantageous.

Other:

Up to 10% of the domestic travel may be required

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  1  0  0

Tags: Agile Architecture AWS Azure CDISC Computer Science Databricks Data governance Data management Data pipelines DevOps Engineering ETL Machine Learning Mathematics ML models Pipelines Python R R&D Scala SDLC Spark SQL Testing

Perks/benefits: Career development

Region: Europe
Country: Belgium

More jobs like this