Director of AI Infrastructure
Tasks
- Decide when to burst to cloud vs invest in on prem capacity
- Develop storage roadmap for throughput and durability
- Direct strategy for Beaker orchestration platform
- Manage GPU compute budget and resource economics
- Optimize job scheduling for hybrid cloud workloads
- Oversee on prem GPU cluster availability and performance
- Partner with hardware vendors to meet infrastructure demands
- Provide technical bridge to research teams
Perks/Benefits
- 401k plan
- Annual bonuses
- Commuting support
- Employee assistance program
- Fitness and Wellbeing Support
- Health savings account
- Long-term incentive plan
- Medical/Dental/Vision
- Paid Holidays
- Paid sick leave
- Paid vacation
- Personal days
Skills/Tech-stack
AWS | Beaker | Ceph | Containerd | Distributed Systems | Docker | GCP | Go | HPC | High Performance | High-Performance Computing | Hybrid Cloud | Infiniband | Kube scheduler | Kubernetes | Linux | Lustre | NCCL | NVIDIA GPU | Performance Computing | Python | Resource Management | Slurm | Weka
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Related jobs
-
AI Engineer USD 103K-140KAI Agents | AI Studio | Access Control | Anthropic Claude | AuthenticationBonus eligibleSenior-level Full TimeDenver, CO, United States6h ago
-
Director, AI Solutions Architect USD 139K-225KAPI Gateway | Agent Orchestration | BigQuery | CI/CD | Cloud MicroservicesAfter-hours support | Occasional overtime | Occasional travelSenior-level Full TimeOak Brook, IL, United States11h ago
-
Technical Architect – AI, ML & Generative AI USD 142K-240KAWS Bedrock | AWS SageMaker | Agentic AI | Apache Spark | Artificial Intelligence401k | Critical Illness Accident Hospital Indemnity Identity Theft Protection | Dental plans | Life and Accidental Death and Dismemberment | Long-term disabilitySenior-level Full TimeFrisco, United States14h ago
-
Entry-level InternshipChicago, IL, US18h ago
-
Solution Architect (AI & Data Applications) USD 180K-247KAutogen | CI/CD | Databricks | Docker | FastAPIMentoring system | Professional development | Supportive work environmentSenior-level Full TimeJersey City, NJ, United States18h ago
-
AWS | Artificial Intelligence | Azure AI | Data Analysis | DatabricksBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeChicago, IL, United States1d ago
-
Associate Director, Computational Biology USD 180K-240KAgent-based | Agent-based modeling | Bioinformatics Databases | Cell biology | D3JS401k | Dental insurance | ESPP | Employee wellness | Medical insuranceMid-level Full TimeSilver Spring, MD, United States1d ago
-
Mid-level ContractHarrisburg, PA1d ago
-
AI Engineering Sr Director or VP, Data Science USD 128K-175KAI Platform | AWS SageMaker | Agent systems | Agentic AI | Azure MLCollaborative culture | Growth opportunities | Impactful technical work | Professional developmentSenior-level Full TimeColumbia, MD, United States1d ago
-
Software Engineer, Hardware Health USD 250K-445KAutomated remediation | Distributed Systems | Fleet Lifecycle Management | Infiniband | Infrastructure PlatformsSenior-level Full TimeSan Francisco1d ago
-
AI Agents | Apache Spark | Data Ingestion | Data Modeling | Data Transformation401k match | Company provided disability insurance | Dental insurance | Flexible spending accounts | Health care and dependent care flexible spending accountsSenior-level Full TimeUnited States1d ago
-
AI Solutions Engineer, East USD 125K-175KAWS | Azure | Cloud platform | Dspy | Generative AI401k plan | Dental insurance | Medical insurance | Mental wellness support | Parental leaveMid-level Full TimeRemote (New York) R1d ago
-
Sr. AI Engineer USD 176K-240KAWS | Agentic Workflows | Autonomous Agents | Compliance | Context engineering401k plan with employer matching | Advancement opportunities | Employee development program stipend | Fertility/adoption assistance | Flexible PTOSenior-level Full TimeAtlanta, GA1d ago
-
Deployed Engineer (Seattle) USD 165K-280KAWS | Agent architecture | Azure | Containers | Failure handling401k plan | Dental insurance | Flexible vacation | Meals on in office days | Medical insuranceSenior-level Full TimeSeattle, WA1d ago
-
AI Developer II USD 96K-150KAPIs | Agentic AI | Authentication and Authorization | Azure AI | Blue PrismEntry-level Full TimeMaryville, TN, United States1d ago
-
AWS | Airflow | Artificial Intelligence | Azure | CDISCExecutive-level Full TimeNorth Chicago, IL, United States1d ago
-
Senior-level ContractATLANTA, GA1d ago
-
Director, Analytics Engineering USD 270K-330KAggregation | Airflow | BigQuery | DBT | Data Governance401k plan | Commuter benefits | Employee assistance program | Fitness benefits | Flexible time offExecutive-level Full TimeNew York, NY1d ago
-
Senior-level Full TimeErie, PA, United States1d ago
-
Director of Engineering, AI & Computer Vision USD 200K-231KAWS | Call Management | Cloud Architecture | Computer Vision | Data EngineeringExecutive-level Full TimeAlpharetta, GA1d ago
-
Senior Applied AI Engineer - Life Sciences & Healthcare USD 120K-200KAPI | AWS | CI/CD | Caching | Data IngestionSenior-level Full TimeAustin, TX, US1d ago
-
Algorithms | C++ | Compute Technologies | Data Structures | DebuggingSenior-level Full TimeSunnyvale, CA, USA1d ago
-
AI Platform Engineer USD 119K-258KAI orchestration | API Integration | Azure | Azure Data | Azure Data FactoryOccasional travel | Remote workSenior-level Full TimeBaltimore, Maryland, United States R1d ago
-
Director of Embedded Software USD 192K-280KBluetooth Low Energy | C# | C++ | Embedded Systems | Firmware DevelopmentExecutive-level Full TimeBoston MA1d ago
-
Director, Analytics USD 139K-160KArtificial Intelligence | Data Analysis | Data Manipulation | Data Visualization | Excel401k | Hybrid work | Medical, dental, and vision insurance | Paid time offExecutive-level Full TimeGA, United States1d ago