Senior Data Engineer (Remote, US)
United States
Sayari
Get instant access to public records, financial intelligence and structured business information on over 455 million companies worldwide.Our company culture is defined by a dedication to our mission of using open data to prevent illicit commercial and financial activity, a passion for finding novel approaches to complex problems, and an understanding that diverse perspectives create optimal outcomes. We embrace cross-team collaboration, encourage training and learning opportunities, and reward initiative and innovation. If you like working with supportive, high-performing, and curious teams, Sayari is the place for you.
POSITION DESCRIPTIONSayari provides instant access to structured business information from hundreds of millions of corporate, legal, and trade records for a variety of use cases. As a member ofSayari's data team you will work with our Product and Software Engineering to build the graph that underlies Sayari’s products.
Job Responsibilities
- Build and maintain ETL pipelines to process and export record data to Sayari Graph application
- Develop and improve entity resolution processes
- Implement logic to calculate and export risk information
- Work with product team and other development teams to collect and refine requirements
- Run and maintain regular data releases
Required Skills & Experience
- Expertise with Python and a JVM programming language (e.g., Scala)
- Expertise with SQL (e.g., Postgres) and NoSQL (e.g., Cassandra, Elasticsearch, Memgraph, etc.) databases
- 7+ years of experience designing, maintaining, and orchestrating ETL pipelines (e.g., Apache Spark, Apache Airflow) in cloud based environments (e.g., GCP, AWS, or Azure).
Desired Skills & Experience
- Experience with entity resolution, graph theory, and/or distributed computing
- Experience with Kubernetes
- Experience working as part of an agile development team using Scrum, Kanban, or similar
Benefits
- A collaborative and positive culture - your team will be as smart and driven as you
- Limitless growth and learning opportunities
- A strong commitment to diversity, equity, and inclusion
- Performance and incentive bonuses
- Outstanding competitive compensation and comprehensive family-friendly benefits, including full healthcare coverage plans, commuter benefits, 401K matching, generous vacation, and parental leave.
- Conference & Continuing Education Coverage
- Team building events & opportunities
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Airflow AWS Azure Cassandra Elasticsearch Engineering ETL GCP Kanban Kubernetes NoSQL Pipelines PostgreSQL Python Research Scala Scrum Spark SQL
Perks/benefits: Career development Competitive pay Equity / stock options Parental leave Startup environment Team events
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.