IT Data Analyst
TARRYTOWN, United States
Regeneron
Discover how Regeneron (NASDAQ: REGN) harmonizes biology and technology to create life-changing medicines. Join our team and explore clinical trials.- In Research and Pre-clinical Development IT, we deliver complex enterprise systems to help Research Scientists at Regeneron solve their toughest business problems. We bring together multi-disciplinary, highly-skilled technical teams to deliver high-quality technical solutions and services that deliver business value.
The IT Data Analyst with rich experience building Data Centric Solutions is a key role on the team, enabling science by leading and contributing to full software lifecycle; architecture, design, implementation, testing and deployment of Scientific Data Applications that add demonstrable value and impact to the scientific operations of Research & Pre-clinical Development (R&pD) functions at Regeneron.
Our current Data Platform Tech stack includes Spotfire, Spark, Python, NiFi, AWS EMR, AWS MWAA, AWS Redshift, AWS RDS Postgres Aurora, Dremio. This IT Data Analyst will be an integral team member to leverage our data platform, working closely with other team members as well as our scientists to organize/manage/transform our biology data in a way to enable our scientists to gain scientific insights easier and faster. This person also can contribute to design and implement the data visualization platform to serve our research data/information/knowledge to our scientists in a user-friendly fashion.
Responsibilities:
The role you will play includes:
- Collaboration with Data SMEs and Scientists to improve data models, increasing data accessibility and promoting data-driven decision making
- Hands-on Technologist with skill to develop and maintain scalable data pipelines for new Data Product & support continuing increases in data volume and complexity
- Defining Data models & building Data product pipeline as script in PySpark
- Defining and driving adoption of best practices in code health, testing, and maintainability
- Performing data analysis required to troubleshoot data related issues, root cause analysis and assisting with resolution of data issues
Requirements:
Skills and expertise you will bring to the table include:
- Advanced degree in biology and/or advanced degree in Computer Science
- 1+ years of Data pipeline development experience with Spark (PySpark), HIVE & Airflow
- 2+ years of SQL experience & No-SQL experience is a plus
- 2 years of experience with schema design and dimensional data modeling
- 2+ years of Datawarehouse & BI Tools (Spotfire preferable) experience
- Deep biology knowledge and experience in managing biology data is a must
- Experience working on Big Data Platform (EMR & Hadoop preferable) of any size/scale
- Proven ability to adapt to new languages, frameworks, technologies as the need arises
- Solid understanding of the practical application of Agile development methods
- Ability to formulate and articulate the technical and political implications of creating and delivering transformative technical solutions in an innovative, high-growth, demanding business area
Does this sound like you? Apply now to take your first step towards living the Regeneron Way! We have an inclusive and diverse culture that provides comprehensive benefits, which often include (depending on location) health and wellness programs, fitness centers, equity awards, annual bonuses, and paid time off for eligible employees at all levels!
Regeneron is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion or belief (or lack thereof), sex, nationality, national or ethnic origin, civil status, age, citizenship status, membership of the Traveler community, sexual orientation, disability, genetic information, familial status, marital or registered civil partnership status, pregnancy or parental status, gender identity, gender reassignment, military or veteran status, or any other protected characteristic in accordance with applicable laws and regulations. The Company will also provide reasonable accommodation to the known disabilities or chronic illnesses of an otherwise qualified applicant for employment, unless the accommodation would impose undue hardship on the operation of the Company's business.
For roles in which the hired candidate will be working in the U.S., the salary ranges provided are shown in accordance with U.S. law and apply to U.S.-based positions. For roles which will be based in Japan and/or Canada, the salary ranges are shown in accordance with the applicable local law and currency. If you are outside the U.S, Japan or Canada, please speak with your recruiter about salaries and benefits in your location.
Please note that certain background checks will form part of the recruitment process. Background checks will be conducted in accordance with the law of the country where the position is based, including the type of background checks conducted. The purpose of carrying out such checks is for Regeneron to verify certain information regarding a candidate prior to the commencement of employment such as identity, right to work, educational qualifications etc.
Salary Range (annually)
$70,700.00 - $115,100.00Tags: Agile Airflow Architecture AWS Big Data Biology Computer Science Data analysis Data pipelines Data visualization Hadoop NiFi Pipelines PostgreSQL PySpark Python R Redshift Research Spark Spotfire SQL Testing
Perks/benefits: Equity / stock options Health care Startup environment Wellness
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.