Find jobs in AI/ML, Data Science and Big Data
35 results
for Policy Optimization
(Skill/Tech stack)
-
AI Research Manager - Machine Learning USD 190K-287KCausal Inference | Contrastive Learning | Deep learning | Distributed Systems | Embeddings401k | Dental and vision insurance | Extended maternity leave | Extended paternity leave | Flexible spending accountMid-level Full TimeUSA, Palo Alto4d ago
-
具身智能-强化学习(灵巧操作方向) 实习生 CNY 25K-37KActor-critic | Diffusion Models | Distributed Training | Embodied intelligence | Flow matchingEntry-level Full Time Internship深圳7d ago
-
Applied Machine Learning Scientist - Vice President USD 107K-160KA/B | A/B Testing | AdaLoRA | Agent Orchestration | B testingBackup childcare | Financial coaching | Health and wellness programs | Health insurance | Mental health supportExecutive-level Full TimePalo Alto, CA, United States7d ago
-
Mid-level Internship上海7d ago
-
Senior Staff AI Engineer USD 180K-240KA3C | Actor-critic | Adaptive computation | Benchmarks | C plus plusSenior-level Full TimeLos Altos, California,7d ago
-
Staff Data Science Researcher ILS 285K-366KA/B | A/B Testing | AI Agents | AWS Bedrock | Agent systemsFlexible schedule | Hybrid work model | Mentorship culture | Remote work daysSenior-level Full TimeIsrael - Raanana R9d ago
-
Mid-level Full Time北京 R10d ago
-
AI Engineer - Reinforcement Learning EUR 56K-85KData Pipelines | Evaluation Frameworks | Fine Tuning | Human-in-the-loop | Human-in-the-loop machine learningSenior-level Full TimeParis, France12d ago
-
Reinforcement Learning AI Engineer USD 99K-225KArtificial neural networks | C++ | CUDA | Containerization | Distributed TrainingDependent care | Disability insurance | Health insurance | Life insurance | Paid leaveMid-level Full TimeUSA, CO, Colorado Springs (745 Space …14d ago
-
AI Feedback | Agentic Systems | Direct Preference Optimization | Distributed Training | EvaluationSenior-level Full TimeAMER - United States - California … R15d ago
-
Applied Machine Learning Engineer USD 110K-165KA3C | Apache Kafka | C plus plus | C# | CUDA401k plan | Education assistance | Flexible work schedules | Health care and wellness plans | Paid HolidaysSenior-level Full TimeColorado Springs, United States15d ago
-
Audio Processing | Autoregression | Autoregressive models | Computer Vision | Deep learningRemote workSenior-level Full TimeRemote job R21d ago
-
Research Scientist, Safety Post Training USD 216K-270KAdversarial evaluation | Direct Preference Optimization | Generative AI | Group Relative Policy Optimization | Human FeedbackCommuter stipend | Comprehensive health insurance | Dental insurance | Learning and development stipend | Paid time offSenior-level Full TimeSan Francisco, CA; New York, NY22d ago
-
Applied AI Engineer USD 99K-225KAWS | AgentOps | Azure | ChromaDB | Continued Pretraining401k retirement plan | Bike storage | Commuter benefits | Dependent care FSA | Desk setup stipendMid-level Full TimeWashington DC23d ago
-
AI Scientist GBP 46K-46KAzure | Azure OpenAI | Azure OpenAI Services | Databricks | Dataset PreparationMid-level Full TimeLondon, United Kingdom26d ago
-
Principal Machine Learning Engineer, Short-form USD 233K-350KCloud platform | Data Modeling | Feedback Loop Mitigation | Feedback loop | GCP Pipelines401k plan | Dental insurance | Disability insurance | Life insurance | Medical insuranceSenior-level Full TimeNew York, NY, US, 1003626d ago
-
Senior Machine Learning Engineer, RL / Locomotion USD 220K-336KActor-critic | Domain Randomization | GPU Computing | Isaac Lab | Isaac-GymHealth benefits | Recovery BenefitsSenior-level Full TimeCosta Mesa, California, United States26d ago
-
Research Engineer, Applied AI Engineering USD 250K-555KAds Ranking | Algorithms | Data Pipelines | Data Structures | Deep learningMid-level Full TimeSan Francisco27d ago
-
Head of World Models (Universal Robots, India) INR 3000K-6000KAI orchestration | Actor-critic | Agent Frameworks | Autogen | DPOExecutive-level Full TimeBangalore, IN28d ago
-
Analytics Team Lead USD 109K-230KA/B | A/B Testing | B testing | Crime analysis | Data AnalysisBonus eligibility | Remote workSenior-level Full TimeHome based-Florida, United States R30d ago
-
Applied Scientist, Trustworthy Shopping Experience (TSE) INR 2000K-4000KAgentic AI | Computer Vision | Cross-modal alignment | Data Warehousing | Deep learningSenior-level Full TimeBengaluru, Karnataka, IND1mo ago
-
Deep learning | GPU Computing | Language Models | Language Processing | Large Language ModelsEntry-level Full Time InternshipUS, CA, Santa Clara, United States1mo ago
-
Senior AI Engineer - VLA Foundation Model CHF 128K-192KAutonomy | Diffusion Models | Edge Computing | Generative AI | Imitation LearningIn person Work Mode | Mentorship experienceSenior-level Full TimeZürich1mo ago
-
Agentic Systems | Deep learning | Diffusion Models | Fine Tuning | Generative AI401k eligibility | Annual bonus | Dental insurance | Medical insurance | Paid time offSenior-level Full TimeLos Altos, CA1mo ago
-
Applied Scientist, Customer Behavior Analytics USD 142K-193KCounterfactual analysis | Deep learning | Econometrics | Generative Models | Language ModelsMid-level Full TimeSeattle, Washington, USA1mo ago
-
Senior Principal Machine Learning Engineer (Fulfilment) SGD 182K-240KDecision Processes | DeepSpeed | Direct Preference Optimization | Distributed Training | Dynamic ModelsBirthday leave | Confidential Assistance Programme | FlexWork | Medical insurance | Parental leaveExecutive-level Full TimeSingapore, Singapore1mo ago
-
Adversarial Networks | Computer Vision | Cross-modal alignment | GRPO | Generative Adversarial NetworksEntry-level InternshipSeattle, Washington, United States1mo ago
-
Asynchronous programming | Concurrency | Deep learning | Distributed Systems | JAXCompany-provided equipment | Flexible hours | Fully remote work | Health insurance allowance | Home-office allowanceMid-level Full TimeRemote (EMEA/East Coast) R1mo ago
-
Data Analysis | Dataset Processing | Direct Preference Optimization | Evaluation Pipelines | Fine TuningEntry-level InternshipSan Jose, California, United States1mo ago
-
Senior Director, AI Model LifeCycle USD 301K-355KCheckpointing | Dataset versioning | Experiment tracking | Failure recovery | Fine Tuning401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA contributionsSenior-level Full TimeSan Francisco, CA - US1mo ago
-
Actor-critic | Air Traffic Management | Air traffic | Machine Learning | OptimizationFlexible working space | Informal corporate culture | Thesis assignment allowanceEntry-level InternshipAmsterdam, Noord-Holland, Nederland R1mo ago
-
Tech Lead, Robotic AI Model USD 150K-180KAction Chunking | Action Tokenization | Behavior Cloning | DPO | DeepSpeedSenior-level Full TimeEl Segundo, California, United States1mo ago
-
Senior AI Engineer Specialist INR 2500K-3500KAgentic AI | Apache Spark | Direct Preference Optimization | Distributed Computing | Embedding architecturesSenior-level Full TimeIND - Bengaluru - Esko-Graphics India …1mo ago
-
Robotics & Reinforcement Learning Engineer EUR 60K-84KActor-critic | Actuator modeling | Behavior Cloning | C++ | Control SystemsAnnual leave | Early Friday finish | Flexible working hours | Free coffee and tea | Permanent full-time contractSenior-level Contract Full TimeBarcelona, CT, Spain1mo ago
-
AI Engineer - Imitation Learning (Senior) CHF 128K-192KAutonomy | C++ | Diffusion Model | Diffusion Policy | Generative AIIn-person collaborationSenior-level Full TimeZürich1mo ago