Senior Data Engineer

US-ME-Westbrook ID, United States

IDEXX

Enhancing the health and well-being of pets, people, and livestock.

View all jobs at IDEXX

Apply now Apply later

IDEXX Laboratories, Inc. seeks a Senior Data Engineer in Westbrook, ME (telecommuting permitted) to develop and build robust, fault-tolerant data pipelines that collect, assemble, and potentially transform and aggregate unorganized data distributed into databases or data sources such as operational data stores, data integration hubs and data lakes or estuaries. The Senior Data Engineer compiles and installs database systems, writes queries, scales to multiple machines and puts disaster recovery systems into place; builds groundwork for data consumers (software or human) to easily retrieve needed data for evaluations and experiments; builds operational data use cases such as moving large volumes of data across applications via operational data stores, data hubs and data lakes, and builds private/segregated data pipelines between specific applications. Duties include designing and implementing scalable, reliable distributed data processing frameworks and analytical infrastructure using multiple technologies, including data sets or data warehouses, data virtualization and services and repositories of semi-structured data sets; designing metadata and schemas for assigned projects based on a logical model; creating scripts for physical data layout; writing scripts to load test data; validating schema design; developing and implementing node cluster models for unstructured data storage and metadata to meet performance and financial guidelines; designing advanced level Structured Query Language (SQL), data definition language (DDL) and Python scripts to assist in data validation or performance tuning steps; defining, designing, and implementing data management, storage, backup and recovery solutions that ensure high performance of the organization's enterprise data; designing automated software deployment functionality that allows efficient management of applications across distributed platforms; understanding structural requirements and defining standards for how data will be stored, consumed, integrated and managed; monitoring structural performance and utilization, identifying problems and implements solutions; leading the creation of standards, best practices and new processes for operational integration of new technology solutions; ensures environments are compliant with defined standards and operational procedures; and implementing measures to ensure data accuracy and accessibility, constantly monitoring and refining the performance of data management systems; and completing problem.

The minimum requirements for this position are a Bachelor’s degree in Computer Science, Computer Engineering, Information Systems, Information Systems Engineering or a related field and 5 years of experience that includes 3 years of related professional experience with object-oriented languages: Python, Java, and Scala, or alternatively, a Master’s degree in Computer Science, Computer Engineering, Information Systems, Information Systems Engineering or a related field 3 years of related professional experience with object-oriented languages: Python, Java, and Scala. All qualified candidates must have advanced SQL knowledge and experience working with relational databases, including Snowflake, Oracle, Redshift; AWS or Azure cloud platforms; data pipeline and workflow scheduling tools: Apache Airflow, Informatica; with ETL/ELT tools and data processing techniques; and in database design, development, and modeling. This position is eligible for IDEXX’s Employee Referral Program. Apply at https://careers.idexx.com/us/en/ or e-mail hireme@idexx.com.  

IDEXX values a diverse workforce and workplace and strongly encourages women, people of color, LGBTQ+ individuals, people with disabilities, members of ethnic minorities, foreign-born residents, and veterans to apply.
IDEXX is an equal opportunity employer. Applicants will not be discriminated against because of race, color, creed, sex, sexual orientation, gender identity or expression, age, religion, national origin, citizenship status, disability, ancestry, marital status, veteran status, medical condition, or any protected category prohibited by local, state, or federal laws.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  1  0  0
Category: Engineering Jobs

Tags: Airflow AWS Azure Computer Science Data management Data pipelines DDL ELT Engineering ETL Informatica Java Oracle Pipelines Python RDBMS Redshift Scala Snowflake SQL Unstructured data

Region: North America
Country: United States

More jobs like this