Find jobs in AI/ML, Data Science and Big Data
34 results
for Policy Optimization
(Skill/Tech stack)
-
Adversarial Networks | Computer Vision | Cross-modal alignment | GRPO | Generative Adversarial NetworksEntry-level InternshipSeattle, Washington, United States1d ago
-
Asynchronous programming | Concurrency | Deep learning | Distributed Systems | JAXCompany-provided equipment | Flexible hours | Fully remote work | Health insurance allowance | Home-office allowanceMid-level Full TimeRemote (EMEA/East Coast) R2d ago
-
Data Analysis | Dataset Processing | Direct Preference Optimization | Evaluation Pipelines | Fine TuningEntry-level InternshipSan Jose, California, United States5d ago
-
Senior Director, AI Model LifeCycle USD 301K-355KCheckpointing | Dataset versioning | Experiment tracking | Failure recovery | Fine Tuning401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA contributionsSenior-level Full TimeSan Francisco, CA - US5d ago
-
Actor-critic | Air Traffic Management | Air traffic | Machine Learning | OptimizationFlexible working space | Informal corporate culture | Thesis assignment allowanceEntry-level InternshipAmsterdam, Noord-Holland, Nederland R6d ago
-
Reinforcement Learning AI Engineer USD 99K-225KArtificial neural networks | C++ | CUDA | Cloud Computing | Distributed TrainingDependent care support | Disability insurance | Health insurance | Life insurance | Paid leaveMid-level Full TimeUSA, CO, Colorado Springs (745 Space …7d ago
-
Tech Lead, Robotic AI Model USD 150K-180KAction Chunking | Action Tokenization | Behavior Cloning | DPO | DeepSpeedSenior-level Full TimeEl Segundo, California, United States8d ago
-
Senior Product Manager, LLM Post-Training & Evaluation USD 160K-170KAI Feedback | API Design | Agentic Evaluation | Benchmarking | Context evaluationSenior-level Full TimeRemote Work( USA), United States R8d ago
-
Senior AI Engineer Specialist INR 2500K-3500KAgentic AI | Apache Spark | Direct Preference Optimization | Distributed Computing | Embedding architecturesSenior-level Full TimeIND - Bengaluru - Esko-Graphics India …10d ago
-
Applied Scientist , Amazon Customer Service USD 142K-222KAgentic AI | Artificial Intelligence | Dataset curation | Direct Preference Optimization | Embedding ModelsMid-level Full TimeSanta Clara, California, USA14d ago
-
Robotics & Reinforcement Learning Engineer EUR 60K-84KActor-critic | Actuator modeling | Behavior Cloning | C++ | Control SystemsAnnual leave | Early Friday finish | Flexible working hours | Free coffee and tea | Permanent full-time contractSenior-level Contract Full TimeBarcelona, CT, Spain17d ago
-
AI Engineer - Imitation Learning (Senior) CHF 128K-192KAutonomy | C++ | Diffusion Model | Diffusion Policy | Generative AIIn-person collaborationSenior-level Full TimeZürich17d ago
-
Helix AI Engineer, Reinforcement Learning USD 150K-350KCredit Assignment | Distributed Training | Experiment Management | Exploration | Model-based reinforcement learningIn-office collaborationSenior-level Full TimeSan Jose, CA21d ago
-
AI Engineer - Reinforcement Learning EUR 60K-84KData Pipelines | Evaluation Frameworks | Fine Tuning | Human-in-the-loop | Language model fine-tuningSenior-level Full TimeParis, France21d ago
-
Staff Software Engineer, Generative AI, Core ML USD 207K-300KAI Feedback | Computer Vision | Data Processing | Deep learning | Digital TwinSenior-level Full TimeMountain View, CA, USA22d ago
-
Machine Learning Engineer (Post-Training) EUR 57K-84KAWS | Data Pipelines | Data-parallel | DeepSpeed | Direct Preference OptimizationSenior-level Full TimeParis, France22d ago
-
Entry-level Full Time InternshipSingapore23d ago
-
Senior-level Full Time北京、上海27d ago
-
DDP | Deep learning | Direct Preference Optimization | Distributed Training | DockerSenior-level Full TimePangyo (Software Dream Center), South Korea29d ago
-
Data Science Intern GBP 25K-25KAgile | Constrained optimization | Continuous Delivery | Experiment tracking | GymnasiumAnnual bonus | Charitable Causes Initiatives | Health insurance | Pension | Retention BankEntry-level InternshipLondon, GB29d ago
-
大模型应用算法工程师/专家 CNY 240K-480KC++ | Computer Vision | Deep learning | Direct Preference Optimization | Human Computer DialogueSenior-level Full Time上海、北京29d ago
-
Senior Applied AI Manager USD 170K-234KAgent systems | Agentic Systems | Curriculum learning | Data Deduplication | Data mixingSenior-level Full TimeSan Mateo, CA30d ago
-
Tech Lead Manager- MLRE, ML Systems USD 264K-331KCUDA | Distributed Systems | Flash Attention | GRPO | Human FeedbackCommuter stipend | Generous PTO | Health, dental and vision coverage | Learning and development stipend | Retirement benefitsSenior-level Full TimeSan Francisco, CA; New York, NY30d ago
-
Fine Tuning | Hugging Face | JAX | Language Processing | Llama CPPGenerous parental leave policy | Health insurance | Meal vouchers | Private pension plan | Sport allowanceMid-level Full TimeParis1mo ago
-
Agent RL Infra Engineer USD 224K-356KAI Feedback | Active Learning | Cluster management | Continuous Learning | Data CurationSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Sr Machine Learning Engineer USD 159K-236KAWS | Alignment Tuning | Anomaly Detection | Azure | BERTFinancial security support | Flexible hybrid work model | Healthcare coverage | Mental health resources | Paid time offSenior-level Full TimeUSA - California - San Jose …1mo ago
-
AI Engineer - Reinforcement Learning (Senior) CHF 128K-192KArtificial neural networks | Autonomy | C plus plus | Computer Vision | Deep learningIn-person collaborationSenior-level Full TimeZürich1mo ago
-
Applied Reinforcement Learning Engineer USD 150K-160KActor-critic | Agent systems | BCQ | Behavioral cloning | CQLEqual opportunity employer | Hybrid remote work | Research publications opportunityMid-level Full TimeRemote Work( USA), United States R1mo ago
-
Data Science Researcher ILS 341K-443KA I | A I Safety | A/B | A/B Testing | AWS BedrockCareer growth opportunities | Flexible schedule | Hybrid work model | Mentoring | Remote work flexibilitySenior-level Full TimeIsrael - Raanana1mo ago
-
Checkpointing | Cloud Networking | Failure recovery | Golang | Human Feedback401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributionsSenior-level Full TimeSan Francisco, CA - US1mo ago
-
Staff Software Engineer, Model LifeCycle USD 208K-253KAPI Design | Checkpointing | Distributed Training | Failure recovery | Fine Tuning401k match | Cell phone stipend | Commuter benefits | Dental insurance | Employer HSA contributionsSenior-level Full TimeSan Francisco, CA - US1mo ago
-
Staff AI Engineer, Model Post-Training and Alignment USD 196K-268KBenchmarking | Deep learning | Direct Preference Optimization | Fine Tuning | Generalized Reward Policy OptimizationCompany events | Comprehensive healthcare | Education subsidy | Learning and development programs | Meal allowancesSenior-level Full TimeAPAC1mo ago
-
Researcher, Loss of Control USD 295K-445KAlgorithms | Data Structures | Deep learning | Evaluation | Fine TuningSenior-level Full TimeSan Francisco1mo ago
-
Senior AI Research Scientist (6240) USD 170K-270KAdversarial Learning | Attention Networks | Dash | Data Preprocessing | Data WranglingHybrid work schedule | Professional development programs | Travel for training and team buildingSenior-level Full TimeSan Jose, CA, US1mo ago