Applied Research - RL & Agents
Tasks
- Architect and maintain distributed training and inference pipelines
- Build evaluations to measure reasoning robustness and agent behavior
- Design and implement reinforcement learning and post training methods
- Design and iterate AI agents for real workloads
- Develop AI agent infrastructure for reliable scalable operation
- Develop observability and monitoring for production reliability and performance
- Experiment with post training recipes for downstream performance
- Prototype agents and evaluation harnesses for real world tasks
- Prototype multi-agent and memory-augmented systems
- Translate ambiguous objectives into technical requirements
Perks/Benefits
- Conference attendance
- Flexible work
- Professional development budget
- Relocation support
- Team offsites
- Visa sponsorship
Skills/Tech-stack
Accelerate | Agent Frameworks | Distributed Training | Distributed inference | Docker | Dspy | Evaluation | Grafana | Kubernetes | Langgraph | Language Models | Large Language Models | Machine Learning | Model Alignment | Next.js | Observability | Prometheus | PyTorch | RLHF | Ray | React | Reinforcement Learning | Terraform | Torch | Tracing | TypeScript | VLLM
Education
N/A
Regions
Countries
States
Related jobs
-
Mid-level Full TimeSan Diego, California, United States3h ago
-
Research Scientist - Technologies of Data Management, LLM and AI Agents - Global Tech Research Program - 2027 Start (PhD) USD 202K-368KAIOps | Artificial Intelligence | CPU Scheduling | Cause analysis | Cloud NativeConference publishing opportunity | Publication supportEntry-level Full TimeSeattle, Washington, United States3h ago
-
Research Scientist - Technologies of Data Management, LLM and AI Agents - Global Tech Research Program - 2027 Start (PhD) USD 212K-387KAIOps | CPU Scheduling | Cause analysis | Cloud Computing | Data centerPublication opportunitiesEntry-level Full TimeSan Jose, California, United States3h ago
-
Data parallelism | Deep learning | Distributed Training | Model Acceleration | Model BenchmarkingSenior-level Full TimeSan Jose, California, United States4h ago
-
Computational optimization | Data parallelism | Deep learning | Distributed Training | Generative AIMid-level Full TimeSan Jose, California, United States4h ago
-
Communication optimization | Data parallelism | Deep learning | Distributed Training | Generative AISenior-level Full TimeSeattle, Washington, United States4h ago
-
A/B | A/B Testing | B testing | Data Analysis | Data ModelingSenior-level Full TimeSan Jose, California, United States4h ago
-
Computer Vision | Information Retrieval | Language Processing | Machine Learning | Natural LanguageSenior-level Full TimeSan Jose, California, United States4h ago
-
A/B | A/B Testing | B testing | Dashboards | Data analyticsSenior-level Full TimeSeattle, Washington, United States4h ago
-
Applied Scientist - Monetization Technology - Global Tech Research Program - 2027 Start (PhD) USD 113K-250KCausal Inference | Causal modeling | Deep learning | Fine Tuning | Generative AIEntry-level Full TimeSan Jose, California, United States4h ago
-
Computer Vision | Deep learning | Language Processing | Machine Learning | Natural LanguageSenior-level Full TimeSan Jose, California, United States4h ago
-
Computer Vision | Language Processing | Machine Learning | Natural Language | Natural Language ProcessingSenior-level Full TimeSan Jose, California, United States4h ago
-
Benchmarking | CUDA | Data parallelism | Distributed Training | Model ParallelismSenior-level Full TimeSan Jose, California, United States4h ago
-
Senior Machine Learning E-commerce Feed Recommendation USD 187K-337KData Analysis | Data Pipelines | Feature Engineering | Machine Learning | Model OptimizationSenior-level Full TimeSeattle, Washington, United States4h ago
-
Click Through Rate | Click Through Rate Prediction | Cold Start | Conversion Rate | Conversion Rate PredictionSenior-level Full TimeSeattle, Washington, United States4h ago
-
Algorithm Design | Click Through Rate | Click Through Rate Prediction | Cold Start | Conversion RateSenior-level Full TimeSan Jose, California, United States4h ago
-
Candidate Generation | Click Through Rate | Click Through Rate Modeling | Cold Start | Conversion RateSenior-level Full TimeSan Jose, California, United States4h ago
-
Research Engineer - Language - MRS AI USD 117K-173KComputer Graphics | Computer Vision | Data Analysis | Deep learning | Generative AIEntry-level Full TimeMenlo Park, CA4h ago
-
Software Engineer, YouTube Ads, Machine Learning USD 147K-211KData Processing | Debugging | Distributed Computing | Language Processing | Machine LearningBonus | Career development | Equity | Health insurance | Paid time offMid-level Full TimeMountain View, CA, USA5h ago
-
Senior Software Engineer, Machine Learning, Vertex AI USD 174K-252KCloud Computing | Data Privacy | Data Processing | Debugging | Fine TuningSenior-level Full TimeSunnyvale, CA, USA5h ago
-
Research Data Scientist, YouTube Creator USD 147K-211KData Analysis | Data Visualization | Database querying | Experimental Design | Machine LearningBonus | Equity | Health insurance | Paid time off | Retirement planMid-level Full TimeSan Bruno, CA, USA; Mountain View, …5h ago
-
Software Engineer, AI/ML, Search USD 174K-252KC++ | Data Processing | Data Structures | Data Structures and Algorithms | DebuggingMid-level Full TimeMountain View, CA, USA5h ago
-
Autotuning | Benchmarking | C++ | CUDA | Code generationSenior-level Full TimeSunnyvale, CA, USA5h ago
-
Software Engineer, Infrastructure USD 150K-252KArtificial Intelligence | Language Processing | Machine Learning | Natural Language | Natural Language ProcessingEntry-level Full TimeSan Francisco Bay Area10h ago
-
Infrastructure Software Engineer, Energy Storage USD 180K-237KDeployment Automation | Distributed Systems | Kubernetes | Network Security | Operating SystemsMid-level Full TimeSan Francisco, California, United States11h ago