Technical Data Scientist/ETL Engineer

Remote, United States

GovCIO

GovCIO combines extensive federal experience, cutting-edge IT solutions, and a customer-first approach to government IT modernization.

View all jobs at GovCIO

Apply now Apply later

Overview

GovCIO is currently hiring for an ETL Engineer (or Data Scientist) to join our ETL Team focused on designing, developing, and maintaining robust ETL pipelines and data infrastructure within AWS GovCloud environments. This position will be located in Hanover, MD and will be a fully remote position.

Responsibilities

Design, develop, and maintain robust ETL pipelines and data infrastructure within AWS GovCloud environments. You will work closely with cross-functional teams to ensure that data from a variety of sources is efficiently processed, analyzed, and visualized to monitor and optimize system performance. You will use a combination of AWS native services and open-source software to create scalable and high-performance data pipelines, manage complex data flows, and contribute to operational monitoring efforts across a large AWS environment.

 

Data Pipeline Development:

  • Design and implement Extract, Transform, Load (ETL) solutions to move, process, and store data from a wide range of sources, including AWS S3, CloudWatch, EventBridge, and other cloud-based services.
  • Leverage AWS services such as Lambda, Kinesis, and Data Prepper to create data pipelines that span multiple AWS accounts.
  • Use AWS CloudFormation to deploy and manage data pipeline infrastructure in an Infrastructure-as-Code (IaC) environment.
  • Deploy and manage CloudWatch alarms and synthetic canaries to ensure system health and proactively detect issues before they impact users.

Log Aggregation & Metrics Collection:

  • Deploy and configure Fluent-bit agents to collect system and application logs from hundreds of critical systems.
  • Develop custom Lua functions and regex parsers for transforming and routing logs as required.

 

Data Analysis, Visualization & Alerting:

  • Design and build data visualizations, dashboards, and alerts to monitor and detect anomalous system activity.
  • Design, implement, and maintain AWS CloudWatch alarms to monitor the health and performance of cloud applications, infrastructure, and services.
  • Configure notifications for alerts using Amazon SNS, and integrate with communication tools (e.g., Slack, email) to ensure timely awareness of critical issues.
  • Develop custom visualizations using Vega, create alerts using Query DSL and Painless scripting, and produce reports via OpenSearch SQL.

Cluster Management & Data Storage:

  • Manage and maintain an OpenSearch cluster to support large-scale data ingestion and querying.
  • Implement explicit mappings for fields being ingested into OpenSearch and manage index state management policies for optimal performance.
  • Use Active Directory Federated Services to implement access control and multi-tenancy within the OpenSearch environment.

Application Performance Monitoring:

  • Instrument diverse applications using OpenTelemetry for OpenSearch APM to ensure visibility into application performance.

Qualifications

HS Diploma with 9+ years' experience/professional

Clearance: Top Secret

 

Required Skills and Experience

  • Must have IAT level II/III certification  (i.e., CompTIA Security+(CE))
  • Must have experience and certifications in Linux and/or AWS GovCloud technologies

 

Preferred Skills and Experience

  • Strong experience with AWS services such as Lambda, Kinesis, CloudWatch, S3, EventBridge, and CloudFormation.
  • Proficiency in Python for developing and maintaining data pipelines.
  • Experience managing distributed NoSQL databases such OpenSearch, Elasticsearch, MongoDB or Splunk.
  • Experience deploying and configuring agents for log aggregation and system monitoring.
  • Knowledge of Infrastructure-as-Code (IaC) principles, particularly with AWS CloudFormation.
  • Effective written and oral communication skills
  • Certification(s) in relevant technologies

#NSS

#AR

Company Overview

GovCIO is a team of transformers--people who are passionate about transforming government IT. Every day, we make a positive impact by delivering innovative IT services and solutions that improve how government agencies operate and serve our citizens.But we can't do it alone. We need great people to help us do great things - for our customers, our culture, and our ability to attract other great people. We are changing the face of government IT and building a workforce that fuels this mission. Are you ready to be a transformer?We are an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, disability, or status as a protected veteran. EOE, including disability/vets.

 

Posted Pay Range

 

The posted pay range, if referenced, reflects the range expected for this position at the commencement of employment, however, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, education, experience, and internal equity. The total compensation package for this position may also include other compensation elements, to be discussed during the hiring process. If hired, employee will be in an “at-will position” and the GovCIO reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, GovCIO or individual department/team performance, and market factors.

Posted Salary Range

USD $155,000.00 - USD $172,000.00 /Yr.
Apply now Apply later
Job stats:  0  0  0

Tags: AWS CloudFormation Data analysis Data pipelines Elasticsearch ETL Kinesis Lambda Linux Lua MongoDB NoSQL OpenSearch Open Source Pipelines Python Security Splunk SQL Transformers

Perks/benefits: Equity / stock options

Regions: Remote/Anywhere North America
Country: United States

More jobs like this