Staff AI Engineer, Model Post-Training and Alignment
Tasks
- Apply reinforcement learning based optimization
- Build data augmentation pipeline
- Collaborate to productionize training and deployment workflows
- Deploy models using low latency serving frameworks
- Design DPO training
- Design GRPO training
- Design RLAIF closed loop systems
- Develop domain specific data curation strategy
- Evaluate model performance with automated benchmarks
- Evaluate using human AI feedback loops
- Execute supervised fine tuning
- Implement preference optimization
- Lead post training pipeline for large language models
- Optimize inference efficiency
- Train reward models
Perks/Benefits
- Company events
- Comprehensive healthcare
- Education subsidy
- Learning and development programs
- Meal allowances
- Team building programs
- Wellness allowances
Skills/Tech-stack
Benchmarking | Deep learning | Direct Preference Optimization | Fine Tuning | Generalized Reward Policy Optimization | Language Models | Language Processing | Large Language Models | Low Latency | Low Latency Inference | Machine Learning | Model Deployment | Natural Language | Natural Language Processing | Policy Optimization | Preference Learning | Preference optimization | Reinforcement Learning | Reinforcement Learning from AI Feedback | Reward Modeling | SGLang | Supervised Fine Tuning | VLLM
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Roles
AI | AI Engineer | Engineer | Learning Engineer | Machine Learning Engineer
Related jobs
-
Research Intern (AI Agent) CNY 25K-37KAgent systems | Embodied AI | Language Models | Large Language Models | Memory-augmented systemsEntry-level Full Time Internship深圳15h ago
-
具身智能算法实习生 (Manipulation) CNY 25K-37KCLIP | Computer Vision | Deep learning | Diffusion Model | Fine TuningEntry-level Internship深圳15h ago
-
校招-Ai研究科学家-大语言模型/视觉语言模型算法与后训练(博士优先) CNY 500K-500KAdapters | Direct Preference Optimization | Fine Tuning | Flax | Function designNone Full Time上海15h ago
-
Data Science Internship SGD 57K-57KAnti-Money Laundering | Data Engineering | Data Pipelines | Data cleaning | Fraud DetectionFlexible remote work options | Focus Fridays | Generous PTO | Hybrid work options | Learning and development opportunitiesEntry-level InternshipSingapore - Singapore18h ago
-
Lead Consultant - RAG Engineer INR 2535K-4225KAWS | AWS Step Functions | Airflow | Amazon SageMaker | Cloud NativeContinuous learning opportunities | Hands-on experience | MentorshipSenior-level Full TimeIndia-Gurugram20h ago
-
Capacity Planning | Computer Vision | Deep reinforcement learning | LLM | Language ModelsMid-level Full TimeSingapore, Singapore20h ago
-
Azure | Csharp | Data Ingestion | Data cleaning | Machine LearningCommunity forums | Laptop provided | Medical insurance | Mentorship | No weekend workSenior-level Full TimeIndia R20h ago
-
Algorithm Implementation | Automatic Parameter Optimization | Capacity Scheduling | Data Modeling | Dynamic Route PlanningMid-level Full TimeSingapore, Singapore20h ago
-
Senior-level Full TimeGurugram, Haryana, India21h ago
-
Software Engineer - Product (Technical Leadership) INR 2829K-5000KAI | Data Analysis | Java | JavaScript | Machine LearningSenior-level Full TimeBangalore, India21h ago
-
Senior-level Full TimeBangalore, India | Mumbai, India21h ago
-
Senior Machine Learning Engineer, Compliance SGD 106K-106KAML | Anomaly Detection | Automation workflows | Batch Data Processing | Batch dataCandidate Support Programs | Company events | Education subsidy | Healthcare schemes | L and D programsSenior-level Full TimeSingapore, Singapore21h ago
-
Data Science – (Gen AI Developer) - Associate INR 2000K-2400KAutogen | CI/CD | Data Analysis | Data Visualization | ETLSenior-level Full TimeBengaluru, Karnataka, India22h ago
-
Computer Vision | Diffusion Models | Isaac Sim | Language Models | Large Language ModelsSenior-level Full TimeSeoul1d ago
-
[BD] AI Intern JPY 2400K-2400KData Quality | Decision Trees | Deep learning | Language Processing | Machine LearningAccident insurance | Birthday leave | International projects | Meal support | Paid leaveEntry-level Full Time InternshipHanoi, Vietnam1d ago
-
Mid-level Full TimeMumbai, Maharashtra, India1d ago
-
AI Specialist THB 216K-480KAPI Integration | Azure OpenAI | Azure OpenAI Service | Copilot Studio | CrewAIAccess to Microsoft tools and community | Birthday gift | Flexible work location | Flexible working hours | Generous paid time offMid-level Full TimeBangkok, Bangkok, Thailand1d ago
-
Entry-level Internship Part TimeChina1d ago
-
Mid-level Full TimeGurgaon, Haryana, India1d ago
-
Senior-level Full TimeNoida1d ago
-
Data Science Lead AUD 130K-160KArtificial Intelligence | BigQuery | Cloud Computing | DBT | Data EngineeringCertification support | Conference access | Employee assistance program | Health and wellness programs | Learning budgetSenior-level Full TimeAU - HQ - NSW1d ago
-
Software Engineer II - AIML Platform Reliability INR 1800K-3000KAWS | Automation | Azure | Client Libraries | Continuous DeliveryMid-level Full TimeBengaluru, Karnataka, India1d ago
-
Machine Learning Engineer AUD 110K-130K3D Geometry | 3D Two D Projection | Camera Geometry | Cloud processing | Computer VisionEntry-level Full TimeFyshwick, Australia1d ago
-
Senior Software Engineer INR 3200K-4225KAccess Control | Airflow | Artificial Intelligence | Astronomer | CI/CDSenior-level Full TimePune, India R1d ago
-
Cloud Computing | Computer Vision | Data Engineering | Data Interpretation | Data PipelinesMid-level Full TimeKOR - Seoul, South Korea, Korea, …1d ago