AI Research Engineer (Multi-Modal Reinforcement Learning)
Tasks
- Analyze model behavior across modalities
- Conduct research on multi-modal reinforcement learning
- Create simulation environments and datasets
- Design and execute evaluation protocols
- Design scalable RL infrastructure
- Develop reward modeling strategies
- Explore next-generation reinforcement learning paradigms
- Publish research in top AI conferences
Perks/Benefits
- Access To Large Scale Experimentation Infrastructure
- Fully remote
- Global First Work Environment
- Research publication support
Skills/Tech-stack
Autoregressive models | Deep learning | Diffusion Models | Distributed Training | Evaluation | Machine Learning | Multi-Modal | Multi-modal AI | Optimization | Policy Optimization | PyTorch | Reinforcement Learning | Reward Modeling | Simulation
Education
Related jobs
-
Benchmarking | Bottleneck analysis | Cloud Computing | Edge Computing | Inference OptimizationCollaborative international team | Flexible location options | Fully remote | High technical ownership | Professional development opportunitiesMid-level Full TimeSouth Africa R2d ago
-
AI machine learning | Account Management | Business case | Business case development | CRMCareer advancement | Cross Functional Team Culture | Fully remoteExecutive-level Full TimeSouth Africa R3d ago
-
Mid-level Full TimeCape Town, South Africa R3d ago
-
API Security | Access Control | Airflow | Amazon Redshift | AuthenticationFlexible working hours | Remote workSenior-level Full TimeSouth Africa - Remote R3d ago
-
Machine Learning Architect ZAR 840K-1200KAI Agents | AWS | Apache Airflow | Apache Spark | Automated retrainingSenior-level Full TimeSouth Africa - Remote R9d ago
-
AWS | Azure | Embeddings | GCP | Hugging FaceFlexible work environment | Fully remote | Growth opportunities | Remote distributed team experienceSenior-level Full TimeSouth Africa - Remote R11d ago
-
Machine Learning Engineer ZAR 420K-600KAirflow | Amazon SageMaker | CI/CD | Docker | Feature EngineeringAgile environment | Collaborative culture | Feedback and empathy | Freedom and responsibility | Self-organizationMid-level Full TimeSouth Africa - Remote R15d ago
-
Ai Developer ZAR 240K-360KComputer Vision | Data Analysis | Data Preprocessing | Deep learning | Feature EngineeringFlexible working hours | Remote workMid-level Full TimeSouth Africa - Remote R30d ago