Staff Software Engineer - AI Research Infrastructure
New York City, New York; San Francisco, California
USD 199K-270K Senior-level Full Time
Tasks
- Build CI testing infrastructure for research code
- Build services for scheduling and orchestration
- Convert experimental workloads into robust repeatable pipelines
- Create abstractions for job submission and management
- Design infrastructure for large scale experiments
- Develop monitoring and observability for workloads
- Develop workflows that reduce iteration time
- Improve research developer productivity tooling
- Mentor engineers on compute infra and AI systems
Perks/Benefits
- N/A
Skills/Tech-stack
Backend Services | CI | Cluster management | Data Pipelines | Distributed Systems | Distributed Training | Fine Tuning | GPU Computing | High Performance | High-Performance Computing | Job Scheduling | Kubernetes | Model Evaluation | Model Parallelism | Monitoring | Observability | Performance Computing | Ray | Resource Management | Slurm | Testing
Education
Roles
Regions
Countries
States
Related jobs
-
Senior-level ContractAustin, United States3h ago
-
AI APIs | Backend Development | Data Engineering | Data Pipelines | Frontend DevelopmentMid-level ContractChandler, United States3h ago
-
Java Full Stack Developer-Software Engineer II USD 93K-155KAPI Design | AWS | Ansible | Artificial Intelligence | BenchmarkingMid-level Full TimeDallas, Texas, United States4h ago
-
Data Engineer II USD 101K-176KAWS | Agile | Airflow | Azure | Azure DevOps401k matching | Health & dental insurance | Onsite fitness center | Time off | Tuition reimbursementSenior-level Full TimeUS-Rhode Island-Providence4h ago
-
C++ | Cloud Native | Container Orchestration | Deep learning | Distributed SystemsCareer growth | Open Source contribution | World Class CollaborationEntry-level Full TimeSan Jose, California, United States4h ago
-
Partner Engineer, Generative AI USD 159K-223KAWS | Agent Orchestration | Azure | Bias Mitigation | C++Senior-level Full TimeMenlo Park, CA5h ago
-
Staff Research Engineer, MRS AI USD 146K-208KA/B | A/B Testing | Alignment techniques | B testing | BenchmarkingSenior-level Full TimeBellevue, WA5h ago
-
Senior Software Developer, Computer Vision, XR USD 100K-253KAr | Augmented Reality | C++ | Computer Vision | Data ProcessingSenior-level Full TimeSan Jose, CA, USA; Waterloo, ON, …6h ago
-
Research Engineer, Pretraining, DeepMind USD 174K-253KFine Tuning | Inference Optimization | JAX | Language Models | Large Language ModelsMid-level Full TimeNew York, NY, USA6h ago
-
Staff Datacloud Blackbelt Engineer, Data and AI USD 183K-266KAI/ML | AI/ML workflows | BigQuery | Cloud Architecture | Computer VisionSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA6h ago
-
2D Games | 3D Games | Agentic Workflows | Algorithms | AngularMid-level Full TimeMountain View, CA, USA6h ago
-
Senior Staff Software Engineer, AI/ML, Google Cloud USD 262K-365KAlgorithms | Data Processing | Data Structures | Debugging | Distributed SystemsSenior-level Full TimeSeattle, WA, USA6h ago
-
Senior Software Engineer, AI/ML, Google Cloud Platforms USD 174K-253KC++ | Code Reviews | Data Processing | Data Structures | Data structures algorithmsSenior-level Full TimeKirkland, WA, USA6h ago
-
Staff Software Engineer, Infrastructure, Google Cloud AI USD 207K-301KCompute Technologies | Cross-Functional Collaboration | Cross-functional | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeKirkland, WA, USA; Sunnyvale, CA, USA6h ago
-
Senior Software Engineer, AI/ML, Google Cloud USD 174K-253KC++ | Data Processing | Debugging | Distributed Computing | Information RetrievalSenior-level Full TimeSunnyvale, CA, USA6h ago
-
Senior Software Engineer, AI/ML GenAI, Google Cloud USD 174K-253KAlgorithms | C++ | Computer Vision | Data Processing | Data StructuresSenior-level Full TimeSunnyvale, CA, USA6h ago
-
C++ | Computer Vision | Data Processing | Debugging | Image classificationSenior-level Full TimeSan Diego, CA, USA6h ago
-
Technical Lead, Storage Distributed and Sovereign Cloud USD 207K-301KAI/ML | AI/ML Workloads | Access Control | Automated remediation | Block StorageSenior-level Full TimeRaleigh, NC, USA; Durham, NC, USA6h ago
-
Agent Construction | Agent Orchestration | Air Gapped Computing | Air-gapped | Data IngestionBonus | Equity | Security clearance travel availabilitySenior-level Full TimeWashington D.C., DC, USA; Maryland, USA6h ago
-
Staff Research Engineer, Applied AI, DeepMind USD 207K-301KAgent workflows | Algorithms | Data Structures | Dataset curation | Deep learningSenior-level Full TimeMountain View, CA, USA6h ago
-
Senior Data Scientist, Machine Learning USD 194K-218KAWS | Active Learning | Airflow | Amazon Redshift | Automated Labeling100% TelecommutingSenior-level Full TimeRedwood City, CA R15h ago
-
AI Engineer USD 125K-201KAWS | Agent Frameworks | Agent SDK | Agent coordination | Claude Agent SDKCollaboration with little supervision | Startup environment | Work on cutting-edge AIEntry-level Full TimePittsburgh, Pennsylvania, United States15h ago
-
Mid-level Full TimeSan Francisco16h ago
-
Data Engineer Data Pipelines and ETL USD 99K-147KAnomaly Detection | Apache Airflow | CDC | Cloud Composer | Data Governance401k plan | Disability benefits | Life insurance | Life insurance coverage | Medical/Dental/VisionMid-level Full TimeBurbank, CA, US, 9150517h ago
-
Sr. Software Engineer, Data Streaming Systems USD 130K-195KAutoscaling | Blocking I/O | CI/CD | Concurrency | Distributed Systems401k plan | Dental insurance | Disability benefits | Life insurance | Medical insuranceSenior-level Full TimeBurbank, CA, US, 9150517h ago