Data Specialist
Somerville, MA
Hatch IT
hatch I.T. is a tech recruiting partner for scaling startups and small businesses. We specialize in engineering, data, and product teams.
hatch I.T. is partnering with VIA to find a Data Specialist. See details below:
About the Role:As a Data Specialist at VIA, you will play a pivotal role in the growth of their solutions. Your key responsibilities will include translating customer domain knowledge into data infrastructure requirements, designing and automating data ingestion and cleaning processes, transforming raw data into insightful visualizations, and ensuring the quality of our data-driven solutions. You will manage source documents to verify the accuracy of information, ensure data quality, and manage the incorporation of data into VIA’s work streams (database, modeling, and reporting decisions, to name a few). You will work on an Agile product delivery team that may includedevelopers, data and modeling specialists, and client delivery professionals.
About VIA:VIA is making an impact, and so can you.At VIA, their mission is to make communities cleaner, safer, and more equitable. They believe that by working across organizational boundaries, they can achieve greater collective good than they can individually. VIA overcomes digital barriers to collective action by providing the world’s most secure and simple data and identity protection solutions.They are trusted by the U.S. Department of Defense and Fortune 100 companies around the globe to solve their toughest data and identity protection challenges. Using our Web3, quantum-resistant, passwordless technologies (19 issued patents), VIA protects data against theft, manipulation, and misuse.
About the Role:As a Data Specialist at VIA, you will play a pivotal role in the growth of their solutions. Your key responsibilities will include translating customer domain knowledge into data infrastructure requirements, designing and automating data ingestion and cleaning processes, transforming raw data into insightful visualizations, and ensuring the quality of our data-driven solutions. You will manage source documents to verify the accuracy of information, ensure data quality, and manage the incorporation of data into VIA’s work streams (database, modeling, and reporting decisions, to name a few). You will work on an Agile product delivery team that may includedevelopers, data and modeling specialists, and client delivery professionals.
About VIA:VIA is making an impact, and so can you.At VIA, their mission is to make communities cleaner, safer, and more equitable. They believe that by working across organizational boundaries, they can achieve greater collective good than they can individually. VIA overcomes digital barriers to collective action by providing the world’s most secure and simple data and identity protection solutions.They are trusted by the U.S. Department of Defense and Fortune 100 companies around the globe to solve their toughest data and identity protection challenges. Using our Web3, quantum-resistant, passwordless technologies (19 issued patents), VIA protects data against theft, manipulation, and misuse.
In this role, you will:
- Understand existing data ingestion and cleaning processes, and take responsibility for further refinement and implementation
- Explore raw customer data to get a better understanding of the different files, columns, and characteristics (e.g., column averages, expected ranges, trends, standard deviations, etc.)
- Understand customer needs and requirements, and make suggestions based on data
- Work with VIA’s client delivery team and the customer to validate assumptions and resolve issues with the data
- Own the ingestion of data into relational and/or non-relational databases standardized across customers, enabling data to connect to the rest of the data science pipeline
- Leverage VIA libraries to ingest raw data from customers
- Perform cleaning of raw data
- Design end-to-end data pipelines
- Coordinate with internal stakeholders and customers when information is missing or discrepancies are found
- Perform quality control, identifying errors or discrepancies in data and data products, including automated and manual testing
- Translate customer domain knowledge to create automated analyses and insights
- Document assumptions and actions made during the data cleaning and wrangling process to provide traceability
- Utilize data visualization tools to analyze data and craft compelling visual representations
- Contribute to the continual improvement of internal tools for data cleaning and data quality assessment by identifying key data-related challenges that are ideal candidates for automation
- Contribute to the delivery of data-based products to external customers, including data visualizations, data quality reports, and statistical analysis
What you will bring to this role:
- 2+ years of experience in a data-driven role or equivalent in data-related research projects
- Bachelor’s or Master's degree in science, mathematics, engineering, or a data-driven field
- Competence in at least two of the following technologies:
- Database technologies (e.g., SQL, PostgreSQL)
- Python, R, or equivalent programming with data science libraries (e.g., NumPy, pandas)
- Data visualization tools (e.g.,Plotly, seaborn, ggplot)
- Ability to break down complex problems into actionable steps
- Passionate about data quality
- A self-starter attitude and demonstrated ability to learn new technologies quickly
- Experience in the following is a plus:
- Data dashboarding tools (e.g., Streamlit, Tableau, Dash)
- Data pipelining (e.g., ETL workflows and tools)
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
0
0
0
Tags: Agile Data pipelines Data quality Data visualization Engineering ETL Mathematics NumPy Pandas Pipelines Plotly PostgreSQL Python R RDBMS Research Seaborn SQL Statistics Streamlit Tableau Testing
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
BI Developer jobsSr. Data Engineer jobsData Engineer II jobsBusiness Intelligence Analyst jobsPrincipal Data Engineer jobsStaff Data Scientist jobsStaff Machine Learning Engineer jobsData Manager jobsData Science Manager jobsPrincipal Software Engineer jobsData Science Intern jobsBusiness Data Analyst jobsJunior Data Analyst jobsData Analyst Intern jobsData Specialist jobsSoftware Engineer II jobsLead Data Analyst jobsResearch Scientist jobsSr. Data Scientist jobsDevOps Engineer jobsStaff Software Engineer jobsAI/ML Engineer jobsData Engineer III jobsSenior Backend Engineer jobsBI Analyst jobs
Git jobsAirflow jobsEconomics jobsOpen Source jobsLinux jobsComputer Vision jobsKafka jobsGoogle Cloud jobsJavaScript jobsMLOps jobsNoSQL jobsData Warehousing jobsTerraform jobsPhysics jobsKPIs jobsRDBMS jobsPostgreSQL jobsScikit-learn jobsBanking jobsHadoop jobsScala jobsGitHub jobsData warehouse jobsStreaming jobsPandas jobs
Classification jobsR&D jobsBigQuery jobsDistributed Systems jobsOracle jobsPySpark jobsdbt jobsLooker jobsCX jobsScrum jobsReact jobsRAG jobsMicroservices jobsRobotics jobsJira jobsRedshift jobsIndustrial jobsSAS jobsData Mining jobsNumPy jobsPrompt engineering jobsGPT jobsELT jobsMySQL jobsData strategy jobs