Data Engineer (Remote, US)
United States
Sayari
Get instant access to public records, financial intelligence and structured business information on over 455 million companies worldwide.Our company culture is defined by a dedication to our mission of using open data to enhance visibility into global commercial and financial networks, a passion for finding novel approaches to complex problems, and an understanding that diverse perspectives create optimal outcomes. We embrace cross-team collaboration, encourage training and learning opportunities, and reward initiative and innovation. If you like working with supportive, high-performing, and curious teams, Sayari is the place for you.
POSITION DESCRIPTIONSayari provides instant access to structured business information from hundreds of millions of corporate, legal, and trade records for a variety of use cases. As a member of Sayari's data team you will work with our Product and Software Engineering to build the graph that underlies Sayari’s products.
Please note that we cannot provide H1B and/or Visa Sponsorship for this role at this time.
Job Responsibilities:
- Build and maintain ETL pipelines to process and export record data to Sayari Graph application
- Develop and improve entity resolution processes
- Implement logic to calculate and export risk information
- Work with product team and other development teams to collect and refine requirements
- Run and maintain regular data releases
Required Skills & Experience:
- Expertise with Python or a JVM programming language (e.g. Java, Scala)
- Expertise with SQL (e.g., Postgres) databases
- 2+ years of experience designing, maintaining, and orchestrating ETL pipelines (e.g., Apache Spark, Apache Airflow) in cloud based environments (e.g., GCP, AWS, or Azure).
Desired Skills & Experience:
- Experience with entity resolution, graph theory, and/or distributed computing
- Experience with Kubernetes
- Experience working as part of an agile development team using Scrum, Kanban, or similar
Tags: Agile Airflow AWS Azure Engineering ETL GCP Java Kanban Kubernetes Pipelines PostgreSQL Python Scala Scrum Spark SQL
Perks/benefits: 401(k) matching Career development Competitive pay Equity / stock options Flex vacation Health care Insurance Medical leave Parental leave Startup environment Transparency
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.