ML Operations Engineer (TS/SCI with CI poly)

Herndon, VA, United States

Maxar Technologies

Integrated space infrastructure and Earth intelligence capabilities that make global change visible, information actionable and space accessible.

View all jobs at Maxar Technologies

Apply now Apply later

Please review the job details below.

Maxar is seeking a Machine Learning Operations Engineer to support the development and integration of various intelligence capabilities into a test and subsequently operational environment. Our program team is a multi-faceted software development and systems administration team working to build and maintain software applications backed by a self-managed high-performance compute (HPC) infrastructure on a private cloud system.  We are responsible for the system from the hardware to the user interface.

Principal Responsibilities: 

  • Architect, develop, and implement systems that enhance software development and machine learning processes 
  • Streamline the software development life cycle from requirements to monitoring in production 
  • Incorporate open-source tools and automation to reduce tedious tasks 
  • Implement continuous integration and delivery to limit manual testing and troubleshooting 
  • Enhance workflows and processes by building an enterprise-scale environment using DevOps methodologies 
  • Collaborate with software engineers to deploy machine learning models, ensuring optimal performance and resource utilization 
  • Architect and implement solutions to scale machine learning inference for large workloads 
  • Monitor and fine-tune model inference for optimal speed and resource utilization 
  • Implement automation tools and processes for model deployment, monitoring, and scaling 
  • Develop robust monitoring and logging solutions to track model performance and system health in real-time 
  • Maintain detailed documentation of machine learning operations processes and best practices 
  • Provide technical support for debugging and resolving issues related to model deployment and inference 

Minimum Requirements: 

  • Eight (8) years' experience in machine learning, data science, software engineering, data analytics, or DevOps 
  • Bachelor of Science (BS) Degree from an accredited university in a technical field is required. Five (5) additional years of experience in storage operations may be considered in lieu of degree. 
  • Experience building and deploying machine learning models 
  • Experience with Docker or Kubernetes for production-grade solutions 
  • Proficiency in programming languages for data science, including Python, Java, and C++ 
  • Experience architecting and deploying AI solutions with generative models, including large language models (LLMs) 
  • Familiarity with CI/CD pipelines, including Jenkins or Gitlab 
  • Experience with enterprise data platforms 
  • Knowledge of automation technologies, including Ansible and/or Terraform 
  • Must meet DoD 8570 IAT Level II requirements including one of the following: Security+ CE, CND, SSCP, GSEC, GICSP, CySA+, or CCNA Security 
  • Top Secret SCI with a CI Polygraph 

Desired Skills: 

  • Ability to understand ML code leveraging modern ML and data frameworks such as Pytorch and Tensorflow.  
  • Experience with data engineering with distributed data processing and distributed training 
  • Experience with MLOps frameworks like Kubeflow, MLFlow, Airflow, etc. 
  • Familiarity with containerization and orchestration tools such as Dockers and Kubernetes. 
  • Knowledge of A/B testing and benchmarking model performance in production. 
  • Experience with architecting and deploying machine learning cybersecurity tools such as Morpheus

#cjpost

#LI-RD

In support of pay transparency at Maxar, we disclose salary ranges on all U.S. job postings.  The successful candidate’s starting pay will fall within the salary range provided below and is determined based on job-related factors, including, but not limited to, the experience, qualifications, knowledge, skills, geographic work location, and market conditions. Candidates with the minimum necessary experience, qualifications, knowledge, and skillsets for the position should not expect to receive the upper end of the pay range.

● The base pay for this position within the Washington, DC metropolitan area is: $131,000.00 - $219,000.00 annually.

For all other states, we use geographic cost of labor as an input to develop market-driven ranges for our roles, and as such, each location where we hire may have a different range.

We offer a comprehensive package of benefits including paid time off, health and welfare insurance, and 401(k) to eligible employees. You can find more information on our benefits at: https://www.maxar.com/careers/benefits

The application window is three days from the date the job is posted and will remain posted until a qualified candidate has been identified for hire.  If the job is reposted regardless of reason, it will remain posted three days from the date the job is reposted and will remain reposted until a qualified candidate has been identified for hire. 

The date of posting can be found on Maxar’s Career page at the top of each job posting.

To apply, submit your application via Maxar’s Career page.

Maxar Technologies values diversity in the workplace and is an equal opportunity/affirmative action employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected veteran status, age, or any other characteristic protected by law.

Apply now Apply later
Job stats:  2  0  0

Tags: A/B testing Airflow Ansible CI/CD Data Analytics DevOps Docker Engineering Generative modeling GitLab HPC Java Jenkins Kubeflow Kubernetes LLMs Machine Learning MLFlow ML models MLOps Model deployment Model inference Open Source Pipelines Python PyTorch SDLC Security TensorFlow Terraform Testing

Perks/benefits: Career development Health care Insurance

Region: North America
Country: United States

More jobs like this