AI Research Engineer - Reinforcement Learning
Tasks
- Build simulation environments
- Create training datasets
- Define success metrics for deployed systems
- Design reinforcement learning algorithms
- Develop reinforcement learning pipelines
- Document experimental findings
- Improve policy performance
- Integrate reinforcement learning agents into production systems
- Monitor deployed reinforcement learning systems
- Optimize exploration strategies
- Run large scale reinforcement learning experiments
Perks/Benefits
Skills/Tech-stack
Actor-critic | Deep reinforcement learning | Exploration/exploitation | GRPO | Language Processing | Machine Learning | Multi-Modal | Multi-modal AI | Natural Language | Natural Language Processing | Online Reinforcement Learning | Policy Optimization | Policy gradients | PyTorch | Reinforcement Learning | Sample efficiency
Education
Associate Degree | Bachelor of Engineering | Bachelor of Science | Doctor of Philosophy | Master of Science | PhD
Related jobs
-
ML / LLM Engineer (Remote) INR 2500K-3000KAmazon Web Services | Azure | Classification | Feature Engineering | Language ModelsRemote workMid-level Full TimeMaharashtra, Pune, India R17h ago
-
Amazon Web Services | Artificial Intelligence | B2B Product | B2B Product Management | Cloud ComputingCareer growth | Collaborative Entrepreneurial Culture | Flexible work structure | Fully remote | Global collaborationMid-level Full TimeIndia R18h ago
-
Anomaly Detection | Data Modeling | Data Pipelines | Docker | JavaScript100 percent remote | Autonomous work environment | Career growth opportunities | Flexible work environment | International team cultureMid-level Full TimeIndia R1d ago
-
Machine Learning Engineer - 5 INR 2500K-4500KAWS | AWS SageMaker | Airflow | Argo Workflows | Azure Machine LearningSenior-level Full TimeBangalore, India R1d ago
-
Tech Lead - Gen AI with 6+ Years(Remote) INR 2500K-5000KAWS | Chatbot | Computer Vision | Dialogue flow | Distributed ComputingRemote workSenior-level Full TimeBengaluru, KA, India R1d ago
-
Sr. Engineer - Gen AI with 4+ years(Remote) INR 2500K-5000KAWS | Audio Processing | Azure | Computer Vision | Distributed ComputingExtendable contract | Remote workSenior-level Full TimeBengaluru, KA, India R1d ago
-
Accelerate | Data Pipelines | Deep learning | Diffusers | EEGAutonomy and ownership | Career growth | Continuous learning | Flexible work environment | Fully remote workMid-level Full TimeIndia R1d ago
-
Snowflake Data Platform Lead(India) (Remote) INR 1500K-2000KAPIs | AWS | Access Control | Alerting | AzureSenior-level Full TimeMaharashtra, Pune, India R1d ago
-
Deep learning | Distributed Training | GPU Computing | JAX | Machine LearningAutonomous work culture | Collaborative global culture | Flexible work schedule | Fully remote | High-performance GPU clustersSenior-level Full TimeIndia R1d ago
-
Azure Data Platform Lead(India) (Remote) INR 2000K-3500KAlerting | Apache Spark | Azure Data | Azure Data Factory | Azure DevOpsCollaboration culture | Growth opportunities | Learning opportunities | Remote workSenior-level Full TimeMaharashtra, Pune, India R2d ago
-
Fullstack Data Engineer (India) (Remote) INR 1500K-2000KADLS | API | Amazon Kinesis | Amazon S3 | AngularMid-level Full TimeMaharashtra, Pune, India R2d ago
-
Computer Scientist-I/II INR 2500K-4500KAgentic Architectures | CI/CD | Container Orchestration | Distributed Systems | DockerSenior-level Full TimeNoida, India R2d ago
-
Principal Data Engineer- Hyderabad (Hybrid) INR 3000K-4000KAPI Architecture | Apache Spark | Artificial Intelligence | Automation | AzureCareer development | Inclusive culture | Peer recognition | Supportive line management | Technical trainingSenior-level Full TimeIND-Hyderabad, India R2d ago
-
Sr ML Engineer- Hyderabad (Hybrid) INR 2500K-3500KCloud Platforms | Deployment Automation | Lifecycle Management | MLOps | Machine LearningSenior-level Full TimeIND-Hyderabad, India R2d ago
-
Machine Learning Lead Analyst - HIH - Evernorth INR 2500K-4500KAPI Integration | AWS | Authentication | Azure | C#Healthcare focused work | Remote work flexibilitySenior-level Full TimeHIH - Hyderabad, India R2d ago
-
Generative AI Analyst INR 2500K-3000KHuman Feedback | Labeling | Language Models | Large Language Models | Learning from Human FeedbackEntry-level Full TimeAsia (Remote), India R4d ago
-
Senior Machine Learning Engineer INR 2000K-4590KBenchmarking | Data Pipelines | Evaluation | Experimentation | Machine LearningSenior-level Full TimeHybrid in Bangalore, India R4d ago
-
Machine Learning Engineer, Chakra INR 2000K-4600KAgentic AI | Benchmarking | Conversational AI | Data Pipelines | EvaluationMid-level Full TimeHybrid in Bangalore, India R4d ago
-
Machine Learning Engineer, Integrity INR 2500K-2800KAdversarial Machine Learning | Anomaly Detection | Benchmarking | Data Pipelines | Drift DetectionMid-level Full TimeHybrid in Bangalore, India R4d ago
-
Associate Data Engineer INR 1200K-2362KApache Hadoop | Apache Spark | Clustering | Distributed Systems | IndexingCompany retirement plan contributions | Employee travel discounts | Health & wellness coverage | Hybrid work model | Paid time offMid-level Full TimeMumbai, India R5d ago
-
Machine Learning Engineer 5 INR 2500K-4500KAdversarial Networks | CUDA | Deep learning | Diffusion Models | Distributed SystemsSenior-level Full TimeNoida, India R5d ago
-
Resident Solution Architect INR 1800K-2800KAWS | Agentic AI | Asset bundles | Azure | Azure DatabricksSenior-level Full TimeHyderabad, Telangana, India - Remote R5d ago
-
Principal Software Engineer - Data infrastructure/Data Solutions INR 3000K-4000KAccess Control | Airflow | Astronomer | CI/CD | DBTSenior-level Full TimeBangalore - Carina, India R5d ago
-
Senior Analyst, AI Workflows & Automation INR 2000K-3380KAPI | Automation | Dashboards | Data Analysis | EvaluationSenior-level Full TimeGurugram, Haryana, India R5d ago
-
AWS | AWS Bedrock | AWS SageMaker | Computer Vision | Data AnnotationFlexible scheduling | Fully remote | Inclusive international team culture | Paid training opportunities | Stable internet connectionMid-level Full TimeIndia R6d ago