Software Engineer, RL Training Infra
Tasks
- Build RL reliability and efficiency tools
- Collaborate across research infrastructure and partner teams
- Convert operational issues into reusable systems and processes
- Debug training systems and distributed infrastructure
- Improve scaling and orchestration
- Maintain RL training runs fast and reliable
- Reduce inference latency and cost
- Resolve numerical issues and hardware failures
- Support research infra for multi agent capabilities and memory
Perks/Benefits
- N/A
Skills/Tech-stack
Agent systems | Async systems | Debugging | Distributed Systems | Hardware Reliability | Inference | Latency optimization | Machine Learning | Model Evaluation | Multi-Agent | Multi-Agent Systems | Orchestration | Performance optimization | Reinforcement Learning | Scaling | Training Infrastructure
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
AI Engineer USD 114K-190KAI orchestration | GPU Inference | GPU inference optimization | Inference Optimization | KubernetesDisability insurance | Health insurance | Holiday pay | Learning and development | Life insuranceSenior-level Full TimeUSA-DC-Washington1h ago
-
Benchmarking | Code review | Data Pipelines | Distributed Systems | EvaluationSenior-level Full TimeMenlo Park, CA2h ago
-
APIs | Agent systems | Cloud platform | CrewAI | Data PipelinesSenior-level Full TimeSan Francisco, CA, USA; Atlanta, GA, …3h ago
-
C++ | Code review | Compute Technologies | Data Analysis | Data StructuresSenior-level Full TimeSunnyvale, CA, USA3h ago
-
Senior Software Engineer, AI/ML, AI and Infrastructure USD 174K-252KC++ | Data Processing | Data Storage | Data Structures | Data structures algorithmsSenior-level Full TimeMountain View, CA, USA; Kirkland, WA, …3h ago
-
Software Engineer III, AI/ML, Google Workspace USD 147K-211KC++ | Data Processing | Debugging | Distributed Computing | Information RetrievalSenior-level Full TimeKirkland, WA, USA3h ago
-
Entry-level Full TimePalo Alto, CA, US, 9430412h ago
-
AWS | Agile | CI/CD | Code review | Distributed Systems401k match | Commuter benefits | Disability insurance | Electric Car Charging Station | Employee assistance programSenior-level Full TimeSeattle, USA13h ago
-
AWS | Agile | CI/CD | Code review | Data Processing401k match | Commuter benefits | Electric Car Charging Station | Employee assistance program | Flexible spending accountsSenior-level Full TimeSeattle, USA13h ago
-
Junior Software Engineer USD 72K-110KDebugging | Problem Solving | Production Code | Python | Software ArchitectureEntry-level Full TimeUnited States or Canada13h ago
-
API Design | Bare Metal | C++ | Embedded Linux | GRPCHealth benefits | Recovery Benefits | Security clearance sponsorship | Travel opportunitiesSenior-level Full TimeCosta Mesa, California, United States13h ago
-
Agent systems | Automated benchmarking | Chain-of-Thought | DPO | Dataset curationMid-level Full TimePalo Alto, California, USA14h ago
-
Senior Machine Learning Engineer USD 198K-287KData Engineering | Fine Tuning | Foundation Models | GenAI | Incident ResponseOn-call rotationSenior-level Full TimeRemote - US R15h ago
-
Robotics Software Engineer, Behaviors USD 146K-194KArduPilot | Autonomy | Behavior Trees | C++ | Computer VisionMid-level Full TimeCosta Mesa, California, United States15h ago
-
Software Engineer, Robot Interfaces USD 140K-200KAI Planning | Audio signal processing | Cloud Computing | Computer Vision | Deployment AutomationMid-level Full TimeRedwood City, CA16h ago
-
Research Scientist, Open Ecosystem USD 167K-260KDeep learning | Efficient algorithms | Experimental Methodology | Generative AI | Language ModelsFamily leave | Paid vacation | Sick leave | Work-life balanceMid-level Full TimeSeattle, WA16h ago
-
Sr. Solutions Engineer - Oil, Gas, Energy USD 152K-209KAWS | Account Management | Artificial Intelligence | Azure | Big DataSenior-level Full TimeHouston, Texas16h ago
-
Senior-level Full TimeRemote, US R16h ago
-
Senior Backend Engineer, ML Inference Systems USD 135K-237KCI/CD | Distributed Systems | Docker | GCP | GolangCommute subsidy | Employee resource groups | Employee stock ownership | Generous vacation and personal days | Global employee assistance programSenior-level Full TimeMountain View, CA, USA16h ago
-
Data Analyst with AI USD 100K-158KArtificial Intelligence | Business Intelligence | Cloud Platforms | Dashboarding | Data GovernanceContract role | Hybrid work scheduleMid-level Contract Full TimeHouston, TX, United States16h ago
-
Sr. Staff Machine Learning Engineer, Content Ecosystem USD 227K-469KCausal Inference | Data Quality | Experimentation | Game theory | Language ModelsSenior-level Full TimeSan Francisco, CA, US; Remote, US R16h ago
-
Deep learning | Language Models | Language Processing | Large Language Models | Learning algorithmsHealth and wellness programs | Time away from workSenior-level Full TimeSunnyvale, CA, United States17h ago
-
AI Engineer I - Hybrid USD 125K-135KAI Services | API Development | Agentic Workflows | Azure | Azure AIHealth insurance | Hybrid work | Paid time off | Remote work options | Retirement planSenior-level Full TimeWindsor, Colorado, United States R17h ago
-
Senior Machine Learning Engineer, Vector Bidding Science USD 148K-229KA/B | A/B Testing | B testing | BigQuery | Control TheoryCommute subsidy | Disability insurance | Employee assistance program | Employee resource groups | Employee stock ownershipSenior-level Full TimeRemote, Washington, USA R17h ago
-
Senior Machine Learning Engineer, Vector Bidding Science USD 148K-258KA/B | A/B Testing | B testing | BigQuery | Control TheoryCommute subsidy | Comprehensive health insurance | Disability insurance | Employee assistance program | Employee resource groupsSenior-level Full TimeMountain View, CA, USA17h ago