Data Engineer
United States - Remote
K1X
The only AI-powered platform that streamlines alternative investment tax data. AI Automation Software for K-1s, K-3s and 990s.Full-Time (Exempt)
Fully Remote Position
Preferred Locations: Central Time Zone or Eastern Time Zone
Who We Are:
We are K1X. Our technology is used by the nation’s largest institutional investors, funds, and accounting firms, by bringing long-established solutions that are creating an all-digital K-1 experience. Our goal is to transform the K-1 industry by moving a traditionally PDF-based process to an all-digital experience via our software solutions. Join us at the start of something exciting!
What Are We Looking For?
We are seeking a highly skilled and experienced Staff Data Engineer to join our dynamic team. The ideal candidate will be comfortable working across various data engineering tasks, from building and optimizing data pipelines to designing and implementing scalable data infrastructure. As a Staff Data Engineer, you will play a crucial role in supporting our machine learning models and ensuring that our systems are robust, efficient, and scalable.
With K1X You Will:
- Design, build, and maintain scalable and efficient data pipelines to support machine learning models and analytics.
- Collaborate with data scientists, software engineers, and other stakeholders to understand data needs and deliver appropriate solutions.
- Implement best practices for data governance, data quality, and data lifecycle management
- Mentor and guide junior team members, fostering a collaborative and innovative work environment.
Requirements
- Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field
- 6+ years of relevant industry experience as a data engineer, with a focus on unstructured and semi-structured data (e.g. financial documents)
- Experience with various data storage solutions (SQL, NoSQL, data lakes, data warehouses, ...) as well as cloud platform offerings and vendor solutions (for example, Azure Cosmos, GCP BigQuery, DataBricks, ...)
- Excellent problem-solving skills with the ability to synthesize and communicate complex technical results to senior leaders and nontechnical audiences
- Proficiency in Python and familiarity with machine learning frameworks and libraries (e.g. scikit-learn, PyTorch)
Preferred Experience:
- Previous experience with applications of NLP to financial documents
- Familiarity with alternative investment accounting needs
- Prior experience managing a moderate-sized (300k+ documents) training corpus for language models
Benefits
· Unlimited Vacation Policy + Sick Time + Holidays
· Paid Parental Leave
· Fully Remote Opportunity
· Healthcare Benefits and 401K
· Growing Startup Culture
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Azure BigQuery Computer Science Databricks Data governance Data pipelines Data quality Engineering GCP Machine Learning ML models NLP NoSQL Pipelines Python PyTorch Scikit-learn SQL
Perks/benefits: Career development Parental leave Startup environment Unlimited paid time off
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.