Staff Software Engineer - AI Research Infrastructure
Tasks
- Build services for scheduling and orchestration
- Build workflows to reduce iteration time
- Design infrastructure for large scale experiments
- Develop tools for experiment management
- Implement CI testing infrastructure for research code
- Mentor engineers on compute and AI systems
- Monitor and observe training and inference workloads
Perks/Benefits
- N/A
Skills/Tech-stack
Backend Services | CI testing | Cluster scheduling | Data Pipelines | Distributed Systems | Distributed Training | Fine Tuning | GPU Computing | Job orchestration | Kubernetes | Model Evaluation | Model Parallelism | Ray | Resource Management | Slurm
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
C++ | Cloud Native | Container Orchestration | Deep learning | Distributed SystemsCareer growth | Open Source contribution | World Class CollaborationEntry-level Full TimeSan Jose, California, United States3h ago
-
Partner Engineer, Generative AI USD 159K-223KAWS | Agent Orchestration | Azure | Bias Mitigation | C++Senior-level Full TimeMenlo Park, CA4h ago
-
Staff Research Engineer, MRS AI USD 146K-208KA/B | A/B Testing | Alignment techniques | B testing | BenchmarkingSenior-level Full TimeBellevue, WA4h ago
-
Senior Data Scientist, Machine Learning USD 194K-218KAWS | Active Learning | Airflow | Amazon Redshift | Automated Labeling100% TelecommutingSenior-level Full TimeRedwood City, CA R13h ago
-
Mid-level Full TimeSan Francisco15h ago
-
Data Engineer Data Pipelines and ETL USD 99K-147KAnomaly Detection | Apache Airflow | CDC | Cloud Composer | Data Governance401k plan | Disability benefits | Life insurance | Life insurance coverage | Medical/Dental/VisionMid-level Full TimeBurbank, CA, US, 9150515h ago
-
Sr. Software Engineer, Data Streaming Systems USD 130K-195KAutoscaling | Blocking I/O | CI/CD | Concurrency | Distributed Systems401k plan | Dental insurance | Disability benefits | Life insurance | Medical insuranceSenior-level Full TimeBurbank, CA, US, 9150515h ago
-
Machine Learning Engineer USD 140K-190KApache Flink | Apache Kafka | Apache Spark | Bigtable | CI/CDMid-level Full TimeRemote - USA R16h ago
-
Research Engineers, Data USD 150K-250KData Annotation | Data Drift | Data Modeling | Data Pipelines | Data Quality401k | Access to modern AI tools | Commuter benefits | In-office lunch | Medical, dental & vision coverageSenior-level Full TimeSan Francisco17h ago
-
Senior Data Engineer III USD 183K-205KAWS EMR | AWS S3 | Access Control | Amazon Redshift | Apache AirflowSenior-level Full TimeUnited States17h ago
-
Senior Embedded Software Engineer - Future Forward USD 153K-201KAgile | Authentication | Board Bring-up | Bring-up | C#Senior-level Full TimeSunnyvale, CA, United States R17h ago
-
Principal Applied Scientist USD 200K-250KA/B | A/B Testing | Agent Orchestration | B testing | Continuous LearningCareer growth | Hybrid work flexibility | Mentorship | Remote work option | Training opportunitiesSenior-level Full TimeBellevue17h ago
-
Senior-level Full TimeUnited States18h ago
-
Software Engineer, Storage USD 153K-196KAlertmanager | As-a-Service | Availability | C++ | CassandraEquity compensation | Onsite optionSenior-level Full TimeSan Mateo, CA, United States R18h ago
-
Member of Technical Staff (Storage) USD 185K-200KAI Assisted Development | C++ | Concurrency Control | Data replication | Distributed SystemsDental insurance | Flexible time off | Life and disability insurance | Medical insurance | Mental wellbeing benefitsSenior-level Full TimeNew York, NY R18h ago
-
Member of Technical Staff (Storage) USD 185K-200KAI Assisted Development | C# | C++ | Concurrency Control | Data replicationDental insurance | Flexible time off | Life and disability insurance | Medical insurance | Mental wellbeing benefitsSenior-level Full TimeNew York, NY R18h ago
-
Architecture Review | Assembly | C# | C++ | Code review401k retirement plan | Company shuttles | Dental insurance | Employee stock purchase plan | Life insuranceSenior-level Full TimeRedmond, WA18h ago
-
Assembly | C# | C++ | Convex Optimization | Distributed Systems401k retirement plan | Dental insurance | Disability insurance | Employee stock purchase plan | Life insuranceSenior-level Full TimePalo Alto, CA18h ago
-
Assembly | C# | C++ | Convex Optimization | Distributed Systems401k | Dental insurance | Employee stock purchase plan | Life insurance | Medical insuranceSenior-level Full TimePalo Alto, CA18h ago
-
Assembly | C# | C++ | Convex Optimization | Debugging401k | Company shuttle | Dental insurance | Disability insurance | Employee discountsSenior-level Full TimeRedmond, WA18h ago
-
Software Engineer - Sensor Systems, Robot Software USD 209K-291KBuild systems | C++ | Core dumps | Data Streaming | DebuggingHybrid work | Inclusive work environment | Work from home optionMid-level Full TimeSunnyvale19h ago
-
Senior Software Engineer - Data Platform USD 130K-220KAWS Lambda | AWS RDS | Airflow | Amundsen | Apache HiveHealth insurance | Parental leave | Professional development stipend | Remote workSenior-level Full TimeRemote - US R19h ago
-
Data Engineer, Palo Alto USD 110K-180KAWS | Data Modeling | Data Pipelines | DynamoDB | ETL401k matching | Dental insurance | Flexible spending accounts | Health insurance | HolidaysMid-level Full TimePalo Alto, California, United States19h ago
-
Robotics Systems Software Engineer USD 141K-224KAlgorithms | C++ | CI/CD | Collision detection | Computer VisionCompany holidays | Health insurance | Learning and development reimbursement | Life insurance | Long-term disabilityMid-level Full TimeTorrance, California, United States19h ago
-
Industrial Robotics Engineer USD 141K-224KAlgorithms | C++ | CI/CD | Collision detection | Computer VisionCompany holidays | Health insurance | Hybrid work | Learning and development reimbursement | Life insuranceMid-level Full TimeTorrance, California, United States19h ago