AI Operations & Infrastructure Engineer
Fort Meade, MD, United States, 20755
USD 184K-333K (estimate) Senior-level Full Time
Tasks
- Configure and administer logical and physical resources
- Configure and manage network topologies and out of band management
- Configure and optimize AI networking infrastructure
- Deploy and manage data processing units
- Diagnose and resolve networking issues
- Ensure efficient power and cooling for AI infrastructure
- Ensure secure efficient and scalable operation of AI infrastructure
- Implement and manage containerization technologies
- Implement workload management and scheduling
- Install and configure GPU drivers and software
- Lead deployment and validation of AI servers and systems
- Manage and maintain AI computing platforms
- Manage storage solutions for AI data
- Monitor and manage AI cluster health and resource utilization
- Monitor document and report cluster health and job performance
- Oversee AI software stack and tools
- Perform firmware upgrades and hardware validation
- Provide technical support for AI infrastructure teams
- Replace faulty components and optimize systems
- Troubleshoot hardware software storage and performance faults
Perks/Benefits
- N/A
Skills/Tech-stack
Base Command Manager | Command Manager | Data Processing | Data Processing Unit DPU | Docker | Ethernet | Infiniband | Kubernetes | NVIDIA Base Command Manager | NVIDIA GPU | Network Protocols | Nvidia Base Command | Run | Slurm | Storage Administration
Education
N/A
Related jobs
-
Mid-level Full TimeSeattle, Washington, United States6h ago
-
C++ | Computer Vision | Data Processing | Debugging | Image classificationSenior-level Full TimeSan Diego, CA, USA7h ago
-
Senior Software Engineer, Generative AI, Core ML USD 174K-253KAgent systems | Computer Vision | Data Processing | Debugging | Distributed SystemsSenior-level Full TimeMountain View, CA, USA7h ago
-
Software Engineer, AI/ML, Google Research USD 147K-211KData Processing | Data Structures | Data Structures and Algorithms | Debugging | Distributed ComputingMid-level Full TimeMountain View, CA, USA7h ago
-
Staff Software Engineer, Generative AI USD 207K-301KC++ | Computer Vision | Data Processing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSunnyvale, CA, USA7h ago
-
Staff Data Platform Software Engineer, Graph — Veza USD 176K-308KAWS | Azure | Caching | Cloud platform | Data Management401k match | ESPP | Family leave programs | Flexible time away | Health plansSenior-level Full TimeSanta Clara, CALIFORNIA, United States14h ago
-
Sr. Software Engineer - Analytics USD 160K-215KAPI Design | Computational Linguistics | Data Modeling | Elasticsearch | Java401k match | Dental insurance | Disability insurance | Health insurance | Life insuranceSenior-level Full TimeSomerville, MA22h ago
-
Bioinformatics Engineer USD 125K-150KBAM | BED | BWA | Batch | Bismark401k match | Dependent care assistance | Educational benefits | Employee referral bonus | Flexible spending accountMid-level Full TimeRockville, MD22h ago
-
Software Engineer - Medical Applications & Algorithms USD 130K-150KAWS CodeBuild | AWS CodePipeline | Agile | Amazon Web Services | C++Cross-functional team collaboration | Hybrid work environment | Medical device industry domainMid-level Full TimeSan Francisco, California, United States1d ago
-
AI Engineer/Forward-Deployed Engineer USD 125K-250K.NET | APIs | Access Management | Agent Orchestration | Angular401k match | Annual paid time off | Dental insurance | Health insurance | Health savings accountSenior-level Full TimeAustin, TX, United States1d ago
-
AI Foundry | AKS | ARM | Agent 365 | Agentic AI401k plan with company matching | Bereavement leave | Employee assistance program | Employee discount program | Health, dental, and vision careSenior-level Full TimeNew York, NY, United States R1d ago
-
Mid-level Full TimeNew York, NY, United States1d ago
-
Agentic AI Engineer II - Cyber Analytics USD 89K-150KAgentic AI | Anomaly Detection | Argo CD | Argo Workflows | CI/CDVisa sponsorshipMid-level Full TimePhoenix, AZ, United States1d ago
-
Applied Research Scientist / Engineer USD 175K-250KData Curation | Deep learning | Diffusion Models | Distributed Training | Domain AdaptationMid-level Full TimeNew York, NY, SF Bay Area, …1d ago
-
Senior Assistant Vice President USD 128K-188KAWS Bedrock | AlloyDB | Amazon Web Services | Azure | Azure OpenAISenior-level Full TimeUnited States1d ago
-
Senior Software Engineer - San Francisco (Onsite) USD 130K-220KAWS | Amazon EMR | Amazon S3 | Apache Flink | Apache SparkFast-paced startup environment | Onsite work environment | Rapid hiring process feedback | Relocation supportSenior-level Full TimeSan Francisco, CA, US1d ago
-
Machine Learning Engineer USD 223K-260KAmazon Web Services | Apache Airflow | Apache Kafka | Apache Spark | BigQuery401k employer match | Caregiving support | Comprehensive healthcare benefits | Family planning support | Flexible vacationMid-level Full TimeNew York City, NY1d ago
-
Bash | Data Pipelines | Distributed Systems | Docker | GCPAccess to cutting-edge technologies | Autonomy | Bonus | Collaborative culture | Distributed-first environmentMid-level Full TimeCanada R1d ago
-
AI/ML Engineer - Supply Chain USD 99K-192KAI Platform | API Authentication | API authorization | Autogen | CI/CDChild care subsidy | Community service time | Dental insurance | Employee resource groups | Health insuranceMid-level Full TimeDearborn, MI, United States1d ago
-
Software Engineer 2/3-BY-SIG-02 USD 78K-250KAccumulo | BSON | Bigtable | Docker | HBase401k match | Diverse inclusive workplace | Employee referral programs | Flexible work arrangements | Mental health supportSenior-level Full TimeHanover, MD1d ago
-
GenAI Engineer III USD 110K-218KArtificial Intelligence | Containerization | Data Analysis | Data Pipelines | DockerProfessional developmentSenior-level Full TimeArlington/Rosslyn, Virginia, United States1d ago
-
Generative AI Engineer III USD 110K-218KArtificial Intelligence | Data Analysis | Data Pipelines | Docker | KubernetesDiscretionary annual incentive program | Mentorship | Professional developmentSenior-level Full TimeAustin, Texas, United States; Boston, Massachusetts, …1d ago
-
Lead Generative AI Data Engineer III USD 159K-265KArtificial Intelligence | Data Pipelines | Docker | Generative AI | KubernetesSenior-level Full TimeAustin, Texas, United States; Boston, Massachusetts, …1d ago
-
Generative AI Data Engineer III USD 131K-218KAlgorithm Development | Artificial Intelligence | Data Analysis | Data Modeling | Data PipelinesMentorship | Travel opportunitiesSenior-level Full TimeAtlanta, Georgia, United States; New York, …1d ago
-
Sr. AI Engineer USD 150K-175KAccess Control | Agentic AI | Auditability | CI/CD | Cloud platform401k | Dental insurance | Expense reimbursement for internet costs | Life insurance | Medical insuranceSenior-level Full TimeRemote, USA, United States R1d ago