RL Deep Learning Engineer
Tasks
- Build RL environment infrastructure for long horizon legal reasoning
- Build contamination free evaluation pipelines
- Build retrieval reasoning and drafting evaluation tools
- Create benchmark and RL training tasks from legal filings
- Design evaluation harnesses and scoring systems
- Develop sandboxed execution environments
- Develop scalable data pipelines for legal reasoning environments
- Integrate partner model APIs and evaluation harnesses
- Process large scale document datasets
- Support concurrent agent evaluations
- Write production quality Python systems
Perks/Benefits
Skills/Tech-stack
API Integration | Benchmarking | Data Pipelines | Debugging | Deep learning | Document processing | Evaluation | Information Retrieval | Language Models | Large Language Models | Machine Learning | Python | Reinforcement Learning | Sandboxing | Scoring | Search | Systems Design | TypeScript
Education
N/A
Regions
Countries
States
Cities
Related jobs
-
Senior Data Engineer USD 38K-40KApache Airflow | Artificial Intelligence | Change Data Capture | Cloud Computing | Cloud platformHybrid work schedule | MentorshipSenior-level Full Time1300 Gezon Pkwy SW, Wyoming MI, … R17h ago
-
Senior AI Engineer USD 139K-218KAPIs | APIs integration | Access Control | Agent Orchestration | Agentic architectureAsynchronous work | High-performance culture | Remote workSenior-level Full TimeRemote, US R18h ago
-
Machine Learning Engineer II USD 142K-210KAirflow | Anthropic | Artificial Intelligence | CatBoost | Document processingEmployee stock purchase plan | Flexible spending wallets | Health care coverage | Paid time off | Remote-firstMid-level Full TimeRemote US R1d ago
-
Langchain | MLOps | Machine Learning | Matplotlib | NumPyMid-level FreelanceUnited States - Remote R1d ago
-
AI Native Software Engineer USD 130K-220KAgent Orchestration | Agent systems | Autogen | CI/CD | ContainersSenior-level Full TimeRemote (United States) R1d ago
-
Implementation Engineer USD 110K-170KData Engineering | Data Pipelines | Docker | ETL | KubernetesDental insurance | Flexible health stipends | Flexible time off | Health insurance | Learning opportunitiesMid-level Full TimeUnited States R1d ago
-
A/B | A/B Testing | AWS | Apache Spark | B testingSenior-level Full TimeWork from Home, United States, United … R1d ago
-
Software & Analytics Engineer USD 80K-120KAutomated testing | CI/CD | Dash | Data Pipelines | Data TransformationSenior-level Full TimeUnited States - Remote R1d ago
-
Principal AI/ML Researcher / Engineer Reasoning, Planning, and Decision-making systems USD 296K-370KAgent systems | Artificial Intelligence | Belief State Tracking | Caching | Causal modelingSenior-level Full TimeUnited States R1d ago
-
Media Software Engineer, Speech (All Levels) USD 120K-180KAndroid | Artificial Intelligence | Audio Processing | C# | C++401k retirement savings plan | Company holidays | Complimentary lunch and snacks | Fertility support | Medical, dental, and vision insuranceEntry-level Full TimeSunnyvale R2d ago
-
Mid-level Full TimeUnited States R2d ago
-
Deep learning | LLMs | Langchain | MLOps | Machine LearningFlexible schedule | Part-time availability | Project based workMid-level FreelanceUnited States - Remote R2d ago
-
Freelance Machine Learning Engineer USD 180KLLMs | Langchain | MLOps | NumPy | PandasFlexible project-based engagement | Part-time project workMid-level FreelanceUnited States - Remote R2d ago
-
Langchain | Language Models | Large Language Models | MLOps | Machine LearningPart-time projects | Project based workMid-level FreelanceTexas, United States - Remote R2d ago
-
Langchain | Language Models | Large Language Models | MLOps | Model DeploymentFlexible hours | Part time freelance projects | Project based workMid-level FreelanceNew York, United States - Remote R2d ago
-
Freelance Machine Learning Engineer USD 180KGenAI | Langchain | Language Models | Large Language Models | MLOpsPart-time project workMid-level FreelanceTexas, United States - Remote R2d ago
-
Freelance Machine Learning Engineer USD 180KLangchain | Language Models | Large Language Models | MLOps | NumPyFlexible schedule | Part-time opportunities | Project based workMid-level FreelanceNew York, United States - Remote R2d ago
-
Principal Machine Learning Engineer USD 180K-368KAWS Lambda | Amazon SQS | Amazon SageMaker | Automated testing | Backend EngineeringHybrid workSenior-level Full TimeRemote, USA R3d ago
-
Senior Machine Learning Engineer USD 198K-287KData Engineering | Fine Tuning | Foundation Models | GenAI | Incident ResponseOn-call rotationSenior-level Full TimeRemote - US R3d ago
-
Senior-level Full TimeRemote, US R3d ago
-
Sr. Staff Machine Learning Engineer, Content Ecosystem USD 227K-469KCausal Inference | Data Quality | Experimentation | Game theory | Language ModelsSenior-level Full TimeSan Francisco, CA, US; Remote, US R3d ago
-
Senior Data Platform Engineer USD 133K-197KAWS | Amazon IAM | Amazon Redshift | Ansible | Apache IcebergDental benefits | Free 1Password account | Generous paid time off | Health benefits | Maternity and Parental Leave Top-UpSenior-level Full TimeRemote (United States | Canada) R3d ago
-
AI Engineer I - Hybrid USD 125K-135KAI Services | API Development | Agentic Workflows | Azure | Azure AIHealth insurance | Hybrid work | Paid time off | Remote work options | Retirement planSenior-level Full TimeWindsor, Colorado, United States R3d ago
-
Senior Machine Learning Engineer, Vector Bidding Science USD 148K-229KA/B | A/B Testing | B testing | BigQuery | Control TheoryCommute subsidy | Disability insurance | Employee assistance program | Employee resource groups | Employee stock ownershipSenior-level Full TimeRemote, Washington, USA R3d ago
-
Staff Machine Learning Engineer, Vector Bidding Science USD 172K-266KA/B | A/B Testing | B testing | BigQuery | Control TheoryCommute subsidy | Employee resource groups | Employee stock ownership | Generous vacation | Global employee assistance programSenior-level Full TimeRemote, Washington, USA R3d ago