Staff Machine Learning Engineer - TrainingOps
Tasks
- Automate end to end training lifecycle
- Build scalable end to end training pipelines
- Drive training infrastructure design
- Improve reliability reproducibility and efficiency
- Lead evaluation pipeline development
- Mentor engineers with design reviews
- Own data curation workflows and training systems
Perks/Benefits
- Annual health check
- English education support
- Equipment stipend
- Equipment upgrade
- Hybrid work
- Meal and transportation card
- Snacks and drinks
- Winter break
Skills/Tech-stack
Data Curation | Distributed Training | Machine Learning | Model Evaluation | Multimodal Learning | Performance optimization | Reliability Engineering | Reproducibility | Training Infrastructure | Training Pipeline
Education
Related jobs
-
Batching | Caching | Deep learning | Distributed Training | GPU ComputingEnglish education program support | Equipment stipend | Health checkup support | Hybrid work | Meals and transportation cardSenior-level Full TimeSeoul, South Korea15h ago
-
Batching | Caching | Computer Vision | Distributed Training | GPU ComputingAnnual health checkup | English education program | Equipment refresh every 3 years | Hybrid work model | MacBook equipment providedSenior-level Full TimeSeoul, South Korea15h ago
-
Batching | Caching | Computer Vision | Data Curation | Data workflowsCompany card | English education | Equipment stipend | Health checkup | Hybrid workSenior-level Full TimeSeoul, South Korea15h ago
-
ANN | Apache Airflow | Async Programming | BM25 | Distributed SystemsSenior-level Full TimeSeoul, South Korea2d ago
-
A/B | A/B Testing | B testing | Cloud Computing | Data AnalysisEntry-level Full TimeSeoul, South Korea3d ago
-
API Integration | AWS | Agentic Systems | Azure | GCPAdditional paid holidays | Commuting cost support | Flexible work hours | Free parking | Group insurance supportMid-level Full TimeSeoul, South Korea3d ago
-
AWS | AWS Batch | AWS Glue | AWS Lambda | Active DirectorySenior-level Full TimeKR, Gyeonggi-do, Hwaseong, Korea, Republic of3d ago
-
AI | Analytics | Data Governance | Data Management | Machine LearningMid-level Full TimeKR-AIA Tower, Korea, Republic of7d ago
-
Apache Spark | Azure | Cause analysis | Data Analysis | Data PipelinesSenior-level Full TimeHwasung Campus Building A, Korea, Korea, …7d ago
-
DDP | Deep learning | Distributed Training | Docker | Efficient Fine TuningSenior-level Full TimePangyo (Software Dream Center), South Korea10d ago
-
3D Reconstruction | C# | C++ | CUDA | Computer VisionSenior-level Full TimePangyo (Software Dream Center), South Korea10d ago
-
Android | Attention Mechanisms | C# | C++ | CI/CDSenior-level Full TimePangyo (Software Dream Center), South Korea12d ago
-
DDP | Deep learning | Direct Preference Optimization | Distributed Training | DockerSenior-level Full TimePangyo (Software Dream Center), South Korea12d ago
-
Data Pipelines | Distributed Serving | Distributed Training | GPU Computing | KubernetesCorporate card | English education support | Equipment stipend | Health check | Home Office Equipment RefreshSenior-level Full TimeSeoul, South Korea18d ago
-
Agent Orchestration | Embedding Models | Evaluation | LLM APIs | ObservabilityEquity | Flexible time off | Flexible work schedules | Health and wellness benefits | In-person offsitesSenior-level Full TimeSeoul, South Korea18d ago
-
Batch Processing | Computer Vision | Contrastive Learning | Distributed Training | Embedding ModelsEnglish education | Equipment stipend | Health checkup | Hybrid work | Snacks and coffeeSenior-level Full TimeSeoul, South Korea19d ago
-
Data Mining | Data Pipelines | Deep learning | Experimentation | Feature EngineeringSenior-level Full TimeSeoul, South Korea20d ago
-
Data Pipelines | Deep learning | Experimentation | Feature Engineering | Integration TestingSenior-level Full TimeSeoul, South Korea20d ago
-
Data Mining | Data Pipelines | Deep learning | Experimentation | Feature EngineeringSenior-level Full TimeSeoul, South Korea20d ago
-
AI Agent framework | AI gateway | AWS Bedrock | Adversarial Red Teaming | Agent FrameworkMid-level Full TimeKorea, Republic of20d ago
-
Agent Orchestration | Agent systems | Continual pretraining | Deep learning | Distributed TrainingCareer growth opportunities | Flexible innovation culture | GPU cluster accessSenior-level Full TimeSeoul, South Korea25d ago
-
API Development | Algorithms | Cloud Computing | Data Structures | Distributed ComputingMid-level Full TimeYeoksam, Seoul28d ago
-
Apache Beam | BigQuery | Bigtable | Computer Vision | DataflowMid-level Full TimeSeoul, Korea1mo ago
-
Deep learning | Diffusion Model | Machine Learning | Newton | PyTorchFlexible work arrangements | Learning and development opportunitiesSenior-level Full TimeSeoul1mo ago
-
Data Preprocessing | Distributed Training | Model Optimization | Model Training | PyTorchSenior-level Full Time서울 강남구 논현로 508, GS강남타워1mo ago