Staff Software Engineer - AI Research Infrastructure
Tasks
- Build services for scheduling and orchestration
- Build workflows to reduce iteration time
- Design infrastructure for large scale experiments
- Develop tools for experiment management
- Implement CI testing infrastructure for research code
- Mentor engineers on compute and AI systems
- Monitor and observe training and inference workloads
Perks/Benefits
- N/A
Skills/Tech-stack
Backend Services | CI testing | Cluster scheduling | Data Pipelines | Distributed Systems | Distributed Training | Fine Tuning | GPU Computing | Job orchestration | Kubernetes | Model Evaluation | Model Parallelism | Ray | Resource Management | Slurm
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Senior-level ContractAustin, United States3h ago
-
AI APIs | Backend Development | Data Engineering | Data Pipelines | Frontend DevelopmentMid-level ContractChandler, United States3h ago
-
Java Full Stack Developer-Software Engineer II USD 93K-155KAPI Design | AWS | Ansible | Artificial Intelligence | BenchmarkingMid-level Full TimeDallas, Texas, United States4h ago
-
C++ | Cloud Native | Container Orchestration | Deep learning | Distributed SystemsCareer growth | Open Source contribution | World Class CollaborationEntry-level Full TimeSan Jose, California, United States4h ago
-
Partner Engineer, Generative AI USD 159K-223KAWS | Agent Orchestration | Azure | Bias Mitigation | C++Senior-level Full TimeMenlo Park, CA5h ago
-
Staff Research Engineer, MRS AI USD 146K-208KA/B | A/B Testing | Alignment techniques | B testing | BenchmarkingSenior-level Full TimeBellevue, WA5h ago
-
Senior Software Developer, Computer Vision, XR USD 100K-253KAr | Augmented Reality | C++ | Computer Vision | Data ProcessingSenior-level Full TimeSan Jose, CA, USA; Waterloo, ON, …6h ago
-
Research Engineer, Pretraining, DeepMind USD 174K-253KFine Tuning | Inference Optimization | JAX | Language Models | Large Language ModelsMid-level Full TimeNew York, NY, USA6h ago
-
Staff Datacloud Blackbelt Engineer, Data and AI USD 183K-266KAI/ML | AI/ML workflows | BigQuery | Cloud Architecture | Computer VisionSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA6h ago
-
Senior Staff Software Engineer, AI/ML, Google Cloud USD 262K-365KAlgorithms | Data Processing | Data Structures | Debugging | Distributed SystemsSenior-level Full TimeSeattle, WA, USA6h ago
-
Senior Software Engineer, AI/ML, Google Cloud Platforms USD 174K-253KC++ | Code Reviews | Data Processing | Data Structures | Data structures algorithmsSenior-level Full TimeKirkland, WA, USA6h ago
-
Staff Software Engineer, Infrastructure, Google Cloud AI USD 207K-301KCompute Technologies | Cross-Functional Collaboration | Cross-functional | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeKirkland, WA, USA; Sunnyvale, CA, USA6h ago
-
Senior Software Engineer, AI/ML, Google Cloud USD 174K-253KC++ | Data Processing | Debugging | Distributed Computing | Information RetrievalSenior-level Full TimeSunnyvale, CA, USA6h ago
-
Senior Software Engineer, AI/ML GenAI, Google Cloud USD 174K-253KAlgorithms | C++ | Computer Vision | Data Processing | Data StructuresSenior-level Full TimeSunnyvale, CA, USA6h ago
-
C++ | Computer Vision | Data Processing | Debugging | Image classificationSenior-level Full TimeSan Diego, CA, USA6h ago
-
Technical Lead, Storage Distributed and Sovereign Cloud USD 207K-301KAI/ML | AI/ML Workloads | Access Control | Automated remediation | Block StorageSenior-level Full TimeRaleigh, NC, USA; Durham, NC, USA6h ago
-
Agent Construction | Agent Orchestration | Air Gapped Computing | Air-gapped | Data IngestionBonus | Equity | Security clearance travel availabilitySenior-level Full TimeWashington D.C., DC, USA; Maryland, USA6h ago
-
Staff Research Engineer, Applied AI, DeepMind USD 207K-301KAgent workflows | Algorithms | Data Structures | Dataset curation | Deep learningSenior-level Full TimeMountain View, CA, USA6h ago
-
Senior Data Scientist, Machine Learning USD 194K-218KAWS | Active Learning | Airflow | Amazon Redshift | Automated Labeling100% TelecommutingSenior-level Full TimeRedwood City, CA R15h ago
-
Mid-level Full TimeSan Francisco16h ago
-
Data Engineer Data Pipelines and ETL USD 99K-147KAnomaly Detection | Apache Airflow | CDC | Cloud Composer | Data Governance401k plan | Disability benefits | Life insurance | Life insurance coverage | Medical/Dental/VisionMid-level Full TimeBurbank, CA, US, 9150517h ago
-
Sr. Software Engineer, Data Streaming Systems USD 130K-195KAutoscaling | Blocking I/O | CI/CD | Concurrency | Distributed Systems401k plan | Dental insurance | Disability benefits | Life insurance | Medical insuranceSenior-level Full TimeBurbank, CA, US, 9150517h ago
-
Senior-level Full Time245 Summer St, Boston MA, United …17h ago
-
Senior-level Full Time1 Spartan Way, Merrimack NH, United …17h ago
-
Machine Learning Engineer USD 140K-190KApache Flink | Apache Kafka | Apache Spark | Bigtable | CI/CDMid-level Full TimeRemote - USA R18h ago