Data Scientist
Stevensville, Maryland, United States
Full Time USD 150K - 167K
BlueHalo
BlueHalo is united by a mission to create & deploy purpose-built solutions to those who defend us at home & abroad where & when they need it.At BlueHalo, we don’t just witness the future of national security – we create it. We're on the lookout for a Data Scientist to embark on challenging, mission-critical projects in Stevensville, MD directly impacting the nation’s security and intelligence mission. In our team of problem solvers, innovators, technologists, and operators, you'll be at the forefront of driving meaningful change and making an enduring impact.
We are seeking a highly motivated Data Scientist with expertise in Python coding, big data analytics, and publicly available information (PAI) analysis. The ideal candidate will possess strong problem-solving skills, a willingness to learn new technologies, and the ability to develop innovative solutions for large-scale data challenges. This role requires experience with cloud-based platforms, network analysis, and data lifecycle strategy. The successful candidate will be responsible for designing and implementing data storage schemas, developing advanced analytics, and collaborating with various stakeholders to support mission-critical objectives.
You'd like to do this:
- Big Data Analytics & Processing.
- Develop and maintain large-scale data analytics solutions for complex datasets.
- Implement Spark-based analytics using PySpark or other supported languages (Java, Scala, R).
- Design data storage schemas to efficiently store and retrieve analytic results.
- Support quick turnaround data analytics such as rapid extraction, processing, and ad hoc analysis.
- Development & Coding o Write, update, and maintain clean, well-documented code for analytics solutions.
- Conduct code reviews and provide SME expertise to other developers.
- Utilize GitHub for version control and CI/CD pipeline integration.
- Data Strategy & Lifecycle Management.
- Provide subject matter expertise on data lifecycle strategy, from ingestion to archiving.
- Optimize analytics workflows for scalability and efficiency.
- Support cloud-based data solutions leveraging AWS, Azure, and Databricks.
- Collaboration & Coordination.
- Engage with customers and external partners to support analytics integration.
- Assist in coordinating efforts with other developers, contractors, and analysts.
- Provide training support to enhance personnel expertise in analytics tools and methodologies.
You’re required to have this:
- Proficiency in Python, with experience in Spark (PySpark preferred) for big data analytics.
- Experience with Apache NiFi for large-scale iterative data processing.
- Familiarity with cloud platforms such as AWS, Microsoft Azure, and Databricks.
- Ability to analyze network traffic using Wireshark.
- Knowledge of GitHub CI/CD pipelines for software deployment and integration.
- Strong analytical problem-solving skills and ability to develop creative solutions.
- Comfortable working with large-scale datasets and optimizing performance.
- Familiarity with Elasticsearch for data indexing and search.
- Experience working with large distributed data systems.
- Knowledge of cybersecurity concepts and data protection strategies.
- Understanding of data visualization techniques and reporting methodologies.
- Willingness to Learn: Ability to adapt to new technologies, tools, and problem sets.
- Ability to explore unconventional solutions and optimize large-scale analytics.
- Ability to work closely with other developers, analysts, and stakeholders.
- Experience handling massive datasets and developing scalable analytics workflows.
Salary Range: $150,000 - $167,000
The BlueHalo pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Determination of official compensation or salary relies on several factors including, but not limited to, level of position, job responsibilities, geographic location, scope of relevant work experience, educational background, certifications, contract-specific affordability, organizational requirements, alignment with local internal equity as well as alignment with market data.
Our compensation package also includes components designed to support employees’ total well-being, which should be considered when evaluating our competitive benefits package. These benefits include health insurance, life insurance, disability, company holiday and paid time off, parental leave, 401(k) company match and contributions, professional development/training reimbursements, and other work/life programs.
Tags: AWS Azure Big Data CI/CD Data Analytics Databricks Data strategy Data visualization Elasticsearch GitHub Java NiFi Pipelines PySpark Python R Scala Security Spark
Perks/benefits: 401(k) matching Competitive pay Equity / stock options Health care Insurance Parental leave
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.