Sr Data Scientist (Active TS clearance required)
Arlington, VA, US
Full Time Senior-level / Expert Clearance required USD 133K - 248K *
A Square Group
Description
Locations: Falls church/ Arlington/Pentagon/ Alexandria, VA (will be deputed in any one of these locations)
Company Description:
ASG is a Minority Woman Owned, Physician owned small business with over 14 years' experience in federal government contracting. ASG offers data collection, statistical analysis, health program evaluation, technical and health program implementation support. ASG provides a broad range of healthcare technology related services such as software development and integration, mobile apps, AI/ML, Analytics, Data Science, Bigdata, DevSecOps, Digital transformation, cloud, and cybersecurity. ASG is CMMI Level 3 certified for Development and Services, and holds ISO certifications 9001:2015, 20000-1:2011, and 27000:2015.
Job Description:
ASG is seeking a Senior Data Scientist for the Department of Defense (DoD) will design, develop, test, and support data science and informatics solutions for a variety of technical use cases. They will collaborate with cross-functional teams to integrate AI solutions into Search Portfolio products and optimize AI models for performance and scalability, utilizing cloud-based resources and distributed computing frameworks like Apache Spark/Databricks, as well as GPU-enabled Kubernetes clusters. The role involves managing the lifecycle of AI/ML components, staying updated on AI advancements, and applying analytical methodologies to resolve data challenges. The Senior Data Scientist will document findings, develop strategic data modeling processes, and maintain shared libraries and tools across teams, while contributing to rapid prototyping strategies.
What You Will Do:
- Designs, configures, develops, tests, and supports informatics and data science solutions for a wide array of technical use cases.
- Collaborate with cross-functional teams, including data scientists and software engineers to integrate AI solutions developed by other elements of CDAO or the DoD community into Search Portfolio products when appropriate.
- Optimize AI models for performance, scalability, and efficiency, leveraging cloud-based resources and distributed computing frameworks, specifically Apache Spark/Databricks. Ability to adapt code base to also run using GPU enabled Kubernetes clusters.
- Stay updated on and contribute to the latest advancements in AI research, applying new findings to improve Search Portfolio products.
- Manage the lifecycle of AI/ML components used in Search Portfolio products from research and development to deployment and optimization.
- Applies analytical methodologies to diagnose data-related challenges, implement solutions, and evaluate performance.
- Documents and presents requirements, design alternatives, and findings to team members and clients.
- Ability to develop strategic, baselined, data modeling processes; ability to accurately determine cause-and-effect relationships.
- Experience with integrated development environments, data integration, data visualization, data mining, and analysis tools.
- Maintains and guides the development of common libraries and tools used by multiple teams.
- Aids in formulating a strategy on how to achieve rapid prototyping.
Requirements
What We Need:
- Bachelor’s degree plus 7-10 years experience, or a Masters Degree plus 5 years of experience.
- Experience with ML fields, e.g., natural language processing, computer vision, statistical learning theory.
- Hands-on experience with Natural Language Processing (NLP), Large Language Models, text embedding, semantic query, use of generative AI for text, and retrieval augmented generation (RAG).
- Familiarity with data preprocessing, feature engineering, and model evaluation techniques essential for machine learning projects.
- Strong understanding of various machine learning algorithms, including supervised and unsupervised learning, reinforcement learning, and neural networks.
- Experience with version control systems like Git, enabling effective collaboration and code management.
- Experience in an ML engineer or data scientist role building ML models.
- Experience writing code in Python, R, Scala, Java, C++ with documentation for reproducibility.
- Experience using Apache Spark/Databricks distributed compute environments for AI/ML workloads.
- Experience handling petabyte size datasets, diving into data to discover hidden patterns, using data visualization tools, writing SQL, and working with GPUs to develop models.
- Experience with cloud-based data persistence products, especially RDS PostgreSQL and PostgreSQL extensions such as pgvector.
- Experience persisting vectorized data from text embedding processes using Elastic and/or OpenSearch, in addition to vector enabled RDBMS like pgvector enhanced PostgreSQL
- Experience writing and speaking about technical concepts to business, technical, and lay audiences and giving data-driven presentations.
Clearance Required:
- US Citizenship required.
- Possess a minimum of an active Top Secret (TS) security clearance with Sensitive Compartmented Information (SCI) eligibility.
Additional Information:
At ASG, we value diversity and always treat all employees and job applicants based on merit, qualifications, competence, and talent. We do not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, or status as a protected veteran.
Applicants in need of special assistance or accommodation during the interview process or in accessing our website may contact us by sending an email to careers@a2-g.com. We will treat your request as confidentially as possible. In your email, please include your name and preferred method of contact, and we will respond as soon as possible.
Perks:
At ASG, we want you to be well and thrive. Our benefits package includes:
- Healthcare Benefits
- Disability
- Life
- Paid Time Off
- 401k Matching
- Employee Referral Bonus
- Education Assistance
- Learning and Development resources
- EOE, including Disability/Veterans
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Computer Vision Databricks Data Mining Data visualization Engineering Feature engineering Generative AI Git GPU Healthcare technology Java Kubernetes LLMs Machine Learning ML models NLP OpenSearch PostgreSQL Prototyping Python R RAG RDBMS Reinforcement Learning Research Scala Security Spark SQL Statistics Unsupervised Learning
Perks/benefits: Career development Health care Salary bonus
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.