Senior ML Engineer – Distributed RL & Post-Training Infrastructure
Tasks
- Architect distributed reinforcement learning infrastructure
- Balance inference loads across distributed compute network
- Build fault tolerant model evaluation systems
- Build high throughput model evaluation and ranking infrastructure
- Create real time leaderboards and contribution tracking
- Design automated validation and improvement detection
- Develop copy detection and decoy detection systems
- Develop post training pipelines for downloading fine tuning resubmitting models
- Implement PPO GRPO and RL techniques for coding and reasoning
- Implement anti gaming and sybil resistance mechanisms
- Implement multi-objective optimization
- Monitor and improve system performance
- Optimize distributed caching model diffing and monitoring
- Scale system for high volume model submissions
- Use cryptographic proofs for model ownership integrity verification
Perks/Benefits
- N/A
Skills/Tech-stack
Automated testing | Cryptography | Direct Preference Optimization | Distributed Systems | Docker | Graph Databases | Group Relative Policy Optimization | Inference Load Balancing | JAX | Kubernetes | Load Balancing | Model Evaluation | Model Serving | Monitoring | Multi-Objective Optimization | Objective optimization | Policy Optimization | Preference optimization | Proximal Policy Optimization | PyTorch | Python | Reinforcement Learning | Sybil Resistance | Time Series | Time-series databases
Education
N/A
Related jobs
-
Automation Testing | CI/CD | CSS | Cypress | Feature DevelopmentMedical, dental & vision coverage | Paid time off | Parental leave | Reimbursement programs | Retirement planMid-levelRaleigh, United States R9d ago
-
Senior Software Engineer INR 3200K-4225KAccess Control | Airflow | Artificial Intelligence | Astronomer | CI/CDSenior-level Full TimePune, India R-1d ago
-
Databricks Pipeline Architect USD 150K-180KAWS Glue | AWS Lambda | AWS S3 | Agile | Amazon Web ServicesPublic trust clearance support | Remote workSenior-level Full TimeWork from home, VA, United States R6h ago
-
Tier 3 Network Systems Engineer (Remote) USD 80K-101KActive Directory | Ansible | Ansible Playbook | Apache HTTP | Apache HTTP ServerAfter hours availability | Customer support support | On-call rotation | Remote workMid-level Full TimeDallas, TX, US R6h ago
-
API Design | API Gateway | Authentication | Authorization | CI/CDFlexible work options | Work from home optionMid-level Full TimePoland R9h ago
-
API Gateway | Alerting | Authentication | Authorization | CI/CDFlexible work options | Work from home optionMid-level Full TimePoland R9h ago
-
AI Inference | CI/CD | Cloud services | Containerization | DebuggingFlexible workingMid-level Full TimePoland R9h ago
-
API Development | Authentication | Backend Development | CI/CD | Cloud ComputingFamily benefits | Flexible working options | Health benefits | Remote work optionEntry-level Full TimePoland R9h ago
-
APIs | AWS | Benchmarking | Cloudflare Workers | EvaluationEquity packages | Flexible leave options | Inclusive parental leave | Virtual interviews | Wellbeing allowanceSenior-level Full TimeMelbourne, VIC, Australia R17h ago
-
API Design | AWS | Cloudflare Workers | Evaluation | Language ModelsEquity packages | Flexible leave options | Inclusive parental leave | Office setup allowance | Social connection allowanceSenior-level Full TimeSydney, Australia R17h ago
-
API Design | AWS | Agent architecture | Benchmarking | Cloudflare WorkersEquity packages | Flexible leave | Flexible work arrangements | Parental leave | Wellbeing allowanceSenior-level Full TimeMelbourne, VIC, Australia R17h ago
-
API Design | AWS | Agent architecture | Benchmarking | Cloudflare WorkersEquity packages | Flexible leave options | Inclusive parental leave | Wellbeing allowanceSenior-level Full TimeSydney, Australia R17h ago
-
Senior Machine Learning Engineer USD 150K-200KDistributed Systems | Feature Engineering | Feature Selection | Language Models | Language Processing401k matching | Cell phone and internet stipend | Employee stock purchase plan | Flexible time off | Learning programsSenior-level Full TimeRemote - USA R18h ago
-
Senior Software Engineer - Data Platform USD 186K-218KAccess Control | Airflow | Apache Kafka | Apache Spark | CachingSenior-level Full TimeRemote - USA R20h ago
-
Senior-level Full TimeHybrid (Salt Lake City, UT, US) R21h ago
-
Software Engineer, Data USD 120K-250KAlerting | Anomaly Detection | Avro | BigQuery | ClickHouseDental insurance | Flexible vacation policy | Health insurance | Open Source contribution | Vision insuranceMid-level Full TimeRemote, U.S R22h ago
-
AI Solutions Architect INR 2500K-4500KAWS | Convolutional Neural Networks | Data Governance | Distributed Systems | DockerSenior-level Full TimeIndia - Remote R22h ago
-
Senior Azure Fabric Data Engineer INR 1500K-2000KAirflow | Azure | Azure Data | Azure Data Factory | Azure Data LakeRemote work | Work from homeSenior-level Full TimeIndia - Remote R22h ago
-
Senior Solutions Engineer (UK) GBP 59K-80KAPI Integration | Apache Spark | Cloud Computing | Data Pipelines | Data QualityDental insurance | Flexible hours | Flexible-hybrid work | Health insurance | Paid vacationSenior-level Full TimeRemote - London, Greater London, United … R22h ago
-
Computer Vision Engineer TWD 1000K-1500K3D Geometry | C++ | Camera Calibration | Computer Vision | GitInternational travel | Travel 50 percent to factorySenior-level Full TimeRemote - Taiwan R22h ago
-
Robotics Perception Engineer USD 140K-204KC++ | CI/CD | Camera-based perception | Cameras | Cloud processingCompany-provided laptop | Dental insurance | Health insurance | Paid time off | Remote work optionMid-level Full TimeAtlanta, Georgia, United States - Remote R22h ago
-
Robotics Hardware Integration Engineer USD 120K-158KAWS IoT | AWS IoT Core | AWS IoT Greengrass | C++ | CANDirect impact on shipped products | Early-stage company | Equity | Health, dental, and vision insurance | Permissive time off policyMid-level Full TimeAtlanta, Georgia, United States - Remote R22h ago
-
Robotics Navigation and Control Engineer USD 160K-240KA Star | C++ | CI/CD | EKF | GazeboCompany laptop | Health, dental, vision benefits | Permissive time off | Remote work | Travel to Atlanta facility for hardware integrationSenior-level Full TimeAtlanta, Georgia, United States - Remote R22h ago
-
Senior Software Engineer INR 1800K-2400KAWS | Apache Atlas | Google Cloud | JUnit | JavaCareer development | Employee resource groups | Flexible WFH policy | Generous PTO | Internet reimbursementSenior-level Full TimeIndia-Bangalore-Remote R22h ago
-
Sr. Full Stack Engineer – AI/ML USD 151K-220KAPI Design | Alerting | Architecture Documentation | Artificial Intelligence | Capacity PlanningSenior-level Full TimeRemote Office - KY, United States R22h ago