Sr Principal Data Software Engineer
Hyderabad (Office)
Novartis
Working together, we can reimagine medicine to improve and extend people’s lives.Job Description Summary
Data Engineer in Data Engineeringarchitects and builds data intensive products, platforms, pipelines and systems leveraging data engineering best practices and strength in data ingestion, modeling, extraction, and highly leveraging scalable data management systems
Job Description
Where data is the primary challenge (quantity of data, the complexity of data, or the speed at which it is changing), focusing on increasing business value, enhance BR decision-making and enable a data-centric culture by
designing and implementing data-intensive products, platforms, pipelines, and systems
sharing data-intensive expertise, tools, and techniques
implementing and improving the health, security, integrity, and quality of our interconnected data ecosystem
We are seeking a highly competent and versatile data engineer to join our team. The chosen candidate should possess the skills needed to prepare data for machine learning and statistical data science by cleaning, unifying, scaling, aligning and joining diverse and complex datasets. Your contributions as a data wrangler will be instrumental to our research, machine learning, and data science efforts.
Major Accountabilities:
Collaborate closely with data scientists and subject-matter experts to fulfill data needs.
Validate and ensure the accuracy and quality of data by cleaning, shaping, and sometimes analyzing, normalizing, and conforming it to existing models and vocabularies.
Identify and rectify data inconsistencies and irregularities.
Design data models and prepare data artifacts to effectively meet business needs.
Promote culture of transparency and communication regarding data modifications, lineage, and definitions to all stakeholders.
Desirable Requirements:
Proven experience as a data wrangler, data analyst, or a similar role.
Demonstrable data processing expertise in Python, R, bash and other scripting tools.
Demonstrable data management expertise in relational, document, column and graph datastores.
Experience building ETL processes in high-performance environments like Databricks, AWS, Snowflake.
Experience with ML and data libraries such as scikit-learn, pandas, tensorflow, and pytorch
Understanding and working vocabulary about common machine-learning concepts (training sets vs. test sets, over/under fitting, bias, annotations, feature extraction, RAG, LLMs, classifiers, and so on)
Excellent English-language oral and written communication skills.
Experience and familiarity with various data types, including images, tabular, unstructured, and text.
Habit of communicating pro-actively, asking questions, and seeking clarifications when necessary.
Essential Requirements:
BS in Computer Science, Informatics or similar, or equivalent practical experience
English fluent
Why Novartis: Our purpose is to reimagine medicine to improve and extend people’s lives and our vision is to become the most valued and trusted medicines company in the world. How can we achieve this? With our
people. It is our associates that drive us each day to reach our ambitions. Be a part of this mission and join us! Learn more here: https://www.novartis.com/about/strategy/people-and-culture
You’ll receive: You can find everything you need to know about our benefits and rewards in the Novartis Life Handbook. https://www.novartis.com/careers/benefits-rewards
Commitment to Diversity and Inclusion:
Novartis is committed to building an outstanding, inclusive work environment and diverse teams' representative of the patients and communities we serve.
Join our Novartis Network: If this role is not suitable to your experience or career goals but you wish to stay connected to hear more about Novartis and our career opportunities, join the Novartis Network here: https://talentnetwork.novartis.com/network
Skills Desired
Algorithms, Computer Programming, Computer Science, Computer Vision, Data Science, People Management, Project Management, R&D (Research And Development)* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: AWS Computer Science Computer Vision Databricks Data management Engineering ETL LLMs Machine Learning Pandas Pipelines Python PyTorch R RAG R&D Research Scikit-learn Security Snowflake Statistics TensorFlow
Perks/benefits: Career development Health care
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.