Software Engineer - Big Data Ingestion and Processing
Herndon, VA
Redhorse
We’ve all been on your side of the table at some point in our careers, in uniform or government. That experience helps us understand your challenges in a…
About the OrganizationNow is a great time to join Redhorse Corporation. Redhorse specializes in developing and implementing creative strategies and solutions with private, state, and federal customers in the areas of cultural and environmental resources services, climate and energy change, information technology, and intelligence services. We are hiring creative, motivated, and talented people with a passion for doing what's right, what's smart, and what works.
About the RoleRedhorse is transforming how government agencies leverage data and technology. We are seeking a highly skilled Software Engineer to join our team supporting a critical intelligence mission. You will play a vital role in ingesting, processing, and analyzing massive datasets, directly impacting the Sponsor's ability to address pressing intelligence questions. You will work with cutting-edge technologies in a dynamic, collaborative environment, directly contributing to national security.
About the RoleRedhorse is transforming how government agencies leverage data and technology. We are seeking a highly skilled Software Engineer to join our team supporting a critical intelligence mission. You will play a vital role in ingesting, processing, and analyzing massive datasets, directly impacting the Sponsor's ability to address pressing intelligence questions. You will work with cutting-edge technologies in a dynamic, collaborative environment, directly contributing to national security.
Key Responsibilities
- Load large datasets into the Sponsor’s on-premises and Cloud environments.
- Develop and maintain ingestion algorithms and schemas for large datasets.
- Analyze new large-volume datasets to optimize the data ingest processes.
- Support the creation of Apache NiFi schemas for new data loads.
- Develop software tools that efficiently preprocess, modify, aggregate, load, index, and archive large data collections into clusters in near real-time.
- Ensure proper access controls are implemented.
- Generate metrics to track data ingest statistics to maintain data integrity and provenance.
- Document the data-flows according to standards set by the Sponsor.
- Engage regularly with data scientists, analysts, and managers.
Required Experience/Clearance
- Demonstrated professional experience in Computer Science, Computer Engineering, Systems Engineering, or closely related discipline.
- Demonstrated professional experience with AWS cloud services, including long-term storage options, and cloud-based database services.
- Demonstrated experience working with Databricks.
- Demonstrated experience understanding SQL database structures and mapping them between different SQL databases.
- Demonstrated professional experience working with Apache NiFi.
- Demonstrated professional experience working with large data and high-performance compute clusters such as Hadoop or similar.
- Demonstrated experience with API development techniques.
- Demonstrated experience developing and deploying ETL processes for large data sets.
- Demonstrated experience creating operating system level scripts to perform ETL operations on SQL databases.
- Demonstrated professional experience with version control systems, preferably Git.
- Demonstrated experience testing the development of software solutions for the extraction, transformation, and loading of data using the most efficient languages for the task such as NiFi, Python, and SQL.
- Demonstrated experience implementing multiprocessing data-flows to parallelize ingest operations.
- Minimum 5-7 years of relevant experience.
Desired Experience
- Demonstrated experience with the Sponsor’s data environment.
- Demonstrated experience exhibiting strong coordination and collaboration skills.
- Demonstrated experience working with full-stack developers to deploy applications that leverage large data sets.
- Demonstrated experience communicating technical concepts to non-technical audiences.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
0
0
0
Categories:
Big Data Jobs
Engineering Jobs
Tags: API Development APIs AWS Big Data Computer Science Databricks Engineering ETL Git Hadoop NiFi Python Security SQL Statistics Testing
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
BI Developer jobsData Engineer II jobsPrincipal Data Engineer jobsStaff Data Scientist jobsSr. Data Engineer jobsPrincipal Software Engineer jobsStaff Machine Learning Engineer jobsData Science Manager jobsData Manager jobsDevOps Engineer jobsData Science Intern jobsSoftware Engineer II jobsJunior Data Analyst jobsData Analyst Intern jobsLead Data Analyst jobsBusiness Data Analyst jobsBusiness Intelligence Analyst jobsStaff Software Engineer jobsData Specialist jobsSenior Backend Engineer jobsAccount Executive jobsData Governance Analyst jobsSr. Data Scientist jobsAI/ML Engineer jobsData Engineer III jobs
Consulting jobsAirflow jobsOpen Source jobsMLOps jobsKPIs jobsLinux jobsEconomics jobsJavaScript jobsRDBMS jobsTerraform jobsData Warehousing jobsKafka jobsNoSQL jobsGitHub jobsGoogle Cloud jobsPostgreSQL jobsComputer Vision jobsScikit-learn jobsPhysics jobsClassification jobsStreaming jobsBanking jobsHadoop jobsR&D jobsLooker jobs
dbt jobsData warehouse jobsOracle jobsRAG jobsScala jobsBigQuery jobsPandas jobsPrompt engineering jobsGPT jobsReact jobsCX jobsScrum jobsDistributed Systems jobsPySpark jobsIndustrial jobsELT jobsJira jobsRedshift jobsMicroservices jobsLangChain jobsSAS jobsJenkins jobsOpenAI jobsSalesforce jobsRobotics jobs