LLM Engineer (Reinforcement Learning)

Pangyo (Software Dream Center), South Korea

Senior-level Full Time

@ 4...

Apply Save

Found 1d ago

Tasks

Design self refine training structure
Develop foundation models integrated with external knowledge and APIs
Enhance generation accuracy and stability
Improve LLM training efficiency
Optimize direct alignment training with PPO GRPO DPO
Prevent reward hacking
Train models that select external tools based on instruction types

Perks/Benefits

Skills/Tech-stack

Apply Save

Language: ko | Views: 1 | Clicks: 0 | Saves: 0

Related jobs

Deep Learning Engineer (음성 인식 및 wake up word 기능 개발)

Android | Attention Mechanisms | C# | C++ | CI/CD

Senior-level Full Time

Pangyo (Software Dream Center), South Korea

1d ago
Machine Learning Engineer, R&D

API Development | C++ | Computer Graphics | Diffusion Models | Docker

Equipment support | Flexible work schedule | Health checkups | Meals and snacks | Paid leave

Entry-level Full Time

Seoul

3d ago
Machine Learning Engineer (전문연구요원, alternative military service), R&D

3D Deep Learning | API Development | C plus plus | Computer Graphics | Computer Vision

Equipment support | Flexible work schedule | Health checkups | Learning and development support | Meal and snack support

Entry-level Full Time

Seoul

4d ago
Machine Learning Researcher (NLP)

Attention Mechanisms | Cloud Platforms | Data Processing | Deep learning | Dialogue Systems

Relocation assistance not available

Senior-level Full Time

KOR - Seoul, South Korea, Korea, …

6d ago
AI/MLOps Engineer

AWS | Argo CD | Azure | CI/CD | Docker

Mid-level Full Time

KOR - Seoul, South Korea, Korea, …

6d ago
ML Researcher (Computer Vision - Sensor Fusion)

3D Reconstruction | CI/CD | CMM | Camera Calibration | Cloud processing

Relocation assistance not available

Mid-level Full Time

KOR - Seoul, South Korea, Korea, …

6d ago
AI/MLOps Engineer

AWS | Argo CD | Azure | CI/CD | Data Science

Mid-level Full Time

KOR - Seoul, South Korea, Korea, …

6d ago
ML Researcher (Computer Vision - Sensor Fusion)

3D Reconstruction | CI/CD | CMM | Camera | Cloud processing

Relocation assistance not provided

Mid-level Full Time

KOR - Seoul, South Korea, Korea, …

6d ago
Machine Learning Researcher (NLP)

Attention Mechanisms | Cloud Platforms | Data Processing | Deep learning | Dialogue Systems

Relocation assistance not provided

Senior-level Full Time

KOR - Seoul, South Korea, Korea, …

6d ago
Machine Learning Engineer, Infrastructure

Data Pipelines | Distributed Serving | Distributed Training | GPU Computing | Kubernetes

Corporate card | English education support | Equipment stipend | Health check | Home Office Equipment Refresh

Senior-level Full Time

Seoul, South Korea

7d ago
Senior AI Engineer (Korea)

Agent Orchestration | Embedding Models | Evaluation | LLM APIs | Observability

Equity | Flexible time off | Flexible work schedules | Health and wellness benefits | In-person offsites

Senior-level Full Time

Seoul, South Korea

7d ago
AI Engineer (Korea)

Artificial Intelligence | Evaluation | Feedback loops | LLM APIs | Language Models

Flexible time off | Flexible work schedules | Health and wellness benefits | In-person offsites | Technology reimbursements

Senior-level Full Time

Seoul, South Korea

7d ago
AI Application Engineer

AWS | Argo CD | Azure | BentoML | CI/CD

Relocation assistance not included

Mid-level Full Time

KOR - Seoul, South Korea, Korea, …

7d ago
AI Application Engineer

Argo CD | BentoML | CI/CD | Computer Architecture | Docker Compose

Mid-level Full Time

KOR - Seoul, South Korea, Korea, …

7d ago
Machine Learning Engineer, Embedding

Batch Processing | Computer Vision | Contrastive Learning | Distributed Training | Embedding Models

English education | Equipment stipend | Health checkup | Hybrid work | Snacks and coffee

Senior-level Full Time

Seoul, South Korea

8d ago
Sr. Staff Machine Learning Engineer (Eats Search & Discovery)

Data Mining | Data Pipelines | Deep learning | Experimentation | Feature Engineering

Senior-level Full Time

Seoul, South Korea

9d ago
Staff Machine Learning Engineer (Eats Search & Discovery)

Data Pipelines | Deep learning | Experimentation | Feature Engineering | Integration Testing

Senior-level Full Time

Seoul, South Korea

9d ago
Staff Machine Learning Engineer (Eats Search & Discovery)

Data Mining | Data Pipelines | Deep learning | Experimentation | Feature Engineering

Senior-level Full Time

Seoul, South Korea

9d ago
QA Engineer (AI Applications)

API Testing | Boundary-value analysis | CI/CD | Concurrency Testing | Cypress

Mid-level Full Time

Korea, Republic of

9d ago
Full Stack Gen AI Engineer

API Gateway | AWS | Agile | AppSync | Bitbucket

Senior-level Full Time

Korea, Republic of

9d ago
Gen AI Machine Learning Engineer

AI Agent framework | AI gateway | AWS Bedrock | Adversarial Red Teaming | Agent Framework

Mid-level Full Time

Korea, Republic of

9d ago
Data Engineer KRW 26740K-26740K

AWS | Apache Airflow | Apache Flink | Apache Kafka | Apache Spark

Equipment support | Flexible work schedule | Health checkups | Learning and development support | Meal and snacks support

Mid-level Full Time

Seoul

10d ago
Applied AI Engineer

API Integration | Agent Development | Agent Frameworks | Evaluation Frameworks | LLM Deployment

Conference speaking opportunities | Flexible working hours | Generous vacation and parental leave | Hybrid work policy | Visa sponsorship

Mid-level Full Time

Seoul, South Korea

12d ago
Solution Engineer

AWS Glue | AWS S3 | AWS SageMaker | Amazon Web Services | Apache Spark

Senior-level Full Time

KR-Seoul

13d ago
[AI Research Div.] [전문연구요원] Research Scientist/Engineer - Vision-Language-Action (VLA) for Robotics (2년 이상)

Computer Vision | Diffusion Models | Isaac Sim | Language Models | Large Language Models

Senior-level Full Time

Seoul

13d ago

LLM Engineer (Reinforcement Learning)

Tasks

Perks/Benefits

Skills/Tech-stack

Education

Roles

Regions

Countries

Related jobs