Find jobs in AI/ML, Data Science and Big Data
150 results
for RLHF
(Skill/Tech stack)
-
Mid-level Full TimeBengaluru, Karnataka, India12h ago
-
Sr. Staff Data Scientist- Eng USD 145K-209KAgent systems | Agentic AI | BigQuery | Classification | Data GovernanceSenior-level Full TimeLowell,MA,United States R2d ago
-
Data Scientist (Remote) USD 140K-215KContext Management | DPO | DeepSpeed | Experiment tracking | Experimental DesignEmployee networks | Great Place to Work certification | Paid adoption leave | Paid parental leave | Professional developmentMid-level Full TimeUSA VA Remote, United States R2d ago
-
LLM Engineer USD 100K-150KAdapter methods | DPO | Deep reinforcement learning | Distributed Training | Efficient AttentionBenefits | Career growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R2d ago
-
LLM Engineer USD 100K-150KDPO | Deep learning | Distributed Training | Efficient Attention | Efficient Fine TuningRemote workMid-level Full TimeUnited States - Remote R2d ago
-
AI Software Engineer - Fixed Income Technology USD 110K-150KAI Foundry | API | AWS | AWS Bedrock | AzureDiscretionary bonus | Life insurance | Medical/Dental/Vision insurance | Paid time off | Retirement contributionsSenior-level Full TimeIL01 - Chicago, United States2d ago
-
LLM Engineer USD 100KAdapter methods | DPO | DeepSpeed ZeRO | Distributed Training | Efficient AttentionCareer growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R2d ago
-
LLM Engineer USD 100K-150KAdapter Layers | Attention Optimization | Benchmarking | DPO | Evaluation methodologyCareer growth | Full-time employment benefits | Remote work | Technical mentorshipMid-level Full TimeUnited States - Remote R2d ago
-
Mid-level Full TimeUnited States - Remote R2d ago
-
LLM Engineer USD 100K-150KAdapter | DPO | Dataset Distillation | Deep learning | Efficient AttentionCareer growth | Remote workMid-level Full TimeUnited States - Remote R2d ago
-
LLM Engineer USD 100K-150KAdapters | Attention Optimization | DPO | Data Generation | Dataset DistillationCareer growth | Remote workMid-level Full TimeUnited States - Remote R2d ago
-
大模型算法工程师(开放域对话) CNY 180K-300KA/B | A/B Testing | Agentic reinforcement learning | B testing | DeepSpeedMid-level Internship上海、北京4d ago
-
大语言模型后训练/Agentic算法工程师 CNY 180K-360KDistributed Training | Function Calling | GRPO | Human Feedback | JSONEntry-level Full Time上海、北京5d ago
-
Agent Post-Training, Artifacts Research USD 295K-445KData Pipelines | Evals | Experimentation | Grader Systems | Language ProcessingMid-level Full TimeSan Francisco5d ago
-
Agent Post-Training, Computer Use Research USD 295K-445KData pipeline | Evaluation | Experimentation | Grader Development | Machine LearningSenior-level Full TimeSan Francisco5d ago
-
Agent Post-Training, Connectors Research USD 295K-445KData Pipelines | Deep learning | Experimentation | Language Models | Language ProcessingSenior-level Full TimeSan Francisco5d ago
-
Agent Post-Training, Context Research USD 295K-445KData Pipelines | Deep learning | Experimentation | Grading | Language ModelsMid-level Full TimeSan Francisco5d ago
-
Senior Data Scientist II INR 3000K-4800KAgent systems | Agentic AI | Anomaly Detection | CI/CD | ClassificationSenior-level Full TimeBangalore,Karnataka,India5d ago
-
GenAI Project Lead in Washington, DC USD 116K-243KAI guardrails | App Service | Azure | Azure App | Azure App ServiceSenior-level Full TimeWashington, DC6d ago
-
Applied AI ML Lead INR 3264K-5000KAI Libraries | API Development | AWS | Agentic AI | Agentic AI librariesSenior-level Full TimeBengaluru, Karnataka, India6d ago
-
Agent Post-Training Research USD 295K-445KAI Feedback | Agent systems | Calibrated Reasoning | Data Pipelines | Deep learningMid-level Full TimeSan Francisco6d ago
-
Mid-level Full TimeUnited States - Remote R6d ago
-
Senior AI Scientist USD 180K-260KAblation Studies | Adapters | Calibration | Continued Pretraining | DPO401k contributions | Company covered preventive health screenings | Continuing medical education CME CEU support | Fertility and family planning | Fitness perksSenior-level Full TimeUnited States - Remote R6d ago
-
LLM Engineer USD 100K-150KAdapters | Attention Optimization | DPO | Distributed Training | Evaluation benchmarksMid-level Full TimeUnited States - Remote R6d ago
-
优才-多模态交互算法工程师-X-Lab CNY 240K-480KAttention | Benchmarking | Computer Vision | Deep learning | Hard Negative MiningSenior-level Full Time上海、深圳6d ago
-
Abstractive summarization | Active Learning | DPO | Data Privacy | Deep learningIn-office work options | Professional development | Remote work flexibilitySenior-level Full TimeMountain View, CALIFORNIA, United States6d ago
-
Senior-level Full TimeChicago, Illinois, USA R7d ago
-
Sr. GenAI Specialist SA, Solutions Architecture TWD 1000K-1400KA/B | A/B Testing | Agent Orchestration | Agentic Workflows | Amazon BedrockSenior-level Full TimeTaipei City, TWN7d ago
-
Senior Solutions Architect, Generative AI Research USD 184K-287KAgent evaluation | Autogen | Batching | Benchmarking | CheckpointingSenior-level Full TimeUS, FL, Remote R7d ago
-
Sr. Lead AI Engineer USD 149K-193KCloud Computing | DPO | Distributed Training | Experiment tracking | GPU TrainingSenior-level Full TimeFoster City, CA, United States7d ago
-
Attention Mechanism | DPO | Data Generation | Data Preprocessing | Deep learningEntry-level InternshipCalifornia, Sunnyvale8d ago
-
Senior MLOps & Generative AI Engineer - Remote USD 91K-152KAWS | Alerting | Azure | CI/CD | Deep learningAdoption reimbursement | Emergency backup care | Fertility and surrogacy reimbursement | Long-term disability | Medical/Dental/VisionSenior-level Full TimeCorp Facilities MPB - 350 Centre … R8d ago
-
Lead Architect INR 3500K-5500KAccelerator operations | Adversarial Testing | Agent Orchestration | Agentic Retrieval Augmented Generation | CI/CDHybrid work option | Work from office at least 3 days per weekSenior-level Full TimeDGS India - Bengaluru - Manyata …8d ago
-
Entry-level Internship深圳、上海9d ago
-
Llm算法实习生(具身大脑方向) CNY 25K-37KAgentic RL | LLM Agent | Machine Learning | PyTorch | RLHFConference participation | Internship experience | Research mentorshipEntry-level Internship深圳9d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GRPOEntry-level Internship深圳9d ago
-
Senior-level Full TimeIndia9d ago
-
LLM Ops Engineer AUD 130K-150KA B Deployment | A/B | AWS | Alerting | AutoscalingEquity compensation | Flexible hybrid working | Personal development budgetSenior-level Full TimeMelbourne9d ago
-
Senior Applied Scientist, Alexa AI USD 167K-227KAgentic Architectures | Automated Training | Automated training pipelines | C++ | DPOSenior-level Full TimeTurin, Piedmont, ITA9d ago
-
Attention Mechanisms | DPO | Data Preprocessing | Distributed Training | GPU TrainingEntry-level InternshipSunnyvale, California, United States Interns/Temp Intern …9d ago
-
Machine Learning Engineer, Specialist INR 3000K-5000KAgent systems | Attention Mechanism | Containerization | DPO | Deep learningFinancial wellness programs | Hybrid work model | Personal wellness resources | Physical wellness benefits | Wellness supportSenior-level Full TimeHyderabad, India10d ago
-
Senior-level Full TimeTimisoara, RO12d ago
-
Artificial Intelligence | Authentication | Benchmarking | Checkpointing | Cluster OrchestrationFlexible working models | Health and wellbeing benefits | Learning and development opportunitiesSenior-level Full TimePotsdam, DE, 1446912d ago
-
AI Research Scientist (Europe/UK - Remote) USD 186K-258KFine Tuning | Human Feedback | JAX | LLM Fine-tuning | Language ProcessingDental insurance | Discretionary vacation | Equity shares | Flexible working hours | Health insuranceMid-level Full TimeEurope R13d ago
-
Algorithms | COT | Data Structures | Deep learning | Dense Vector DatabasesSenior-level Full TimeToronto13d ago
-
Generative AI - Group Manager - Senior Vice President USD 176K-265KAI compliance | AI guardrails | AWQ | AWS | Autogen401k | Accident and disability insurance | Life insurance | Medical, dental & vision coverage | Paid HolidaysSenior-level Full Time480 WASHINGTON BOULEVARD JERSEY CITY, United …13d ago
-
Sr GenAI Infra Specialist SA, AWS WWSO Startup USD 153K-228KAWS Inferentia | AWS Trainium | Amazon Web Services | Batching | CUDASenior-level Full TimeNew York, New York, USA13d ago
-
Generative AI - Group Manager - Senior Vice President USD 176K-265KAI Applications | AI compliance | AWQ | AWS | Adapter-Tuning401 K | Disability insurance | Life insurance | Medical, dental, and vision coverage | Paid HolidaysSenior-level Full TimeLocation(s): Jersey City, New Jersey, United …13d ago
-
Staff Agentic ML Engineer - Photoshop USD 190K-345KDPO | Data Curation | Evaluation Frameworks | Fine Tuning | LLM Fine-tuningSenior-level Full TimeSan Jose, United States R14d ago
-
Agent systems | DPO | Distributed Training | Fine Tuning | JAXEntry-level Full TimeUS-WA-Bellevue14d ago