Tech Lead, LLM & Generative AI (Full Remote - Gibraltar)
Tasks
- Architect LLM systems
- Build data sourcing labeling and cleaning pipeline
- Collaborate with engineering and DevOps teams
- Design and train moderation classifiers
- Drive RLHF DPO preference optimization
- Drive supervised fine tuning
- Implement context aware safety alignment strategies
- Implement memory and RAG retrieval
- Lead model training and deployment
- Optimize context windows and inference latency
Perks/Benefits
- AI tools access
- Annual in-person meetup
- Co-working space budget
- Equipment provided
- Fully remote
- Health and wellness support
- Learning budget
- Paid time off
Skills/Tech-stack
Classifier Training | Context window | Context window optimization | Data cleaning | Data labeling | Direct Preference Optimization | Evaluation | Fine Tuning | Huggingface | Human Feedback | Language Models | Large Language Models | Latency optimization | Learning from Human Feedback | Machine Learning | Moderation systems | Preference optimization | PyTorch | Python | RAG | Reinforcement Learning | Reinforcement Learning from Human Feedback | Retrieval-Augmented Generation | Sampling | Supervised Fine Tuning | Throughput Optimization | VLLM
Education
N/A
Roles
Engineer | Lead | Learning Engineer | Machine Learning Engineer | Tech Lead
Related jobs
- No jobs found.