Machine Learning Engineering Intern
Menlo Park
Lamini
Lamini is the enterprise LLM platform for existing software teams to quickly develop and control their own LLMs. Lamini has built-in best practices for specializing LLMs on billions of proprietary documents to improve performance, reduce...As a Machine Learning Intern, you will work closely with our AI researchers and engineers to optimize data pipelines for LLM training. You will gain hands-on experience in large-scale data processing, model fine-tuning, and infrastructure optimization, playing a key role in improving the efficiency and scalability of our AI systems.
Key responsibilities include:
- Design, develop, and optimize data pipelines for large-scale LLM training.
- Process and clean large datasets for pretraining and fine-tuning language models.
- Assist in implementing distributed training strategies for LLMs.
- Experiment with data augmentation and preprocessing techniques to enhance model performance.
- Work with frameworks like PyTorch, TensorFlow, and Hugging Face Transformers.
- Monitor and debug training runs, optimizing computational efficiency.
- Collaborate with cross-functional teams to integrate data pipelines into production workflows.
Requirements:
- Currently pursuing a Bachelor's, Master's, or PhD in Computer Science, Machine Learning, Data Science, or a related field.
- Strong programming skills in Python.
- Familiarity with deep learning frameworks such as PyTorch or TensorFlow.
- Understanding of NLP concepts and transformer-based architectures (e.g., GPT, BERT, LLaMA).
- Experience with data preprocessing, ETL pipelines, or large-scale dataset management.
- Basic knowledge of cloud platforms (AWS, GCP, or Azure) and distributed computing.
Preferred Qualifications:
- Experience working with LLM fine-tuning, LoRA, or adapter-based training.
- Hands-on experience with ML model training and hyperparameter tuning.
- Familiarity with tools like Hugging Face Datasets, Ray, or Apache Spark.
- Exposure to containerization (Docker, Kubernetes) and ML model deployment.
Why Join Us?
- Work on cutting-edge AI projects in a fast-paced and innovative environment.
- Gain hands-on experience with real-world LLM training and optimization.
- Collaborate with experts in AI research and machine learning engineering.
- Potential opportunity for full-time conversion based on performance.
At Lamini AI, we are committed to providing an environment of mutual respect where equal employment opportunities are available to all applicants without regard to race, color, religion, sex, pregnancy (including childbirth, lactation and related medical conditions), national origin, age, physical and mental disability, marital status, sexual orientation, gender identity, gender expression, genetic information (including characteristics and testing), military and veteran status, and any other characteristic protected by applicable law. Lamini AI believes that diversity and inclusion among our employees is critical to our success as a company, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool. Selection for employment is decided on the basis of qualifications, merit, and business need.
Tags: Architecture AWS Azure BERT Computer Science Data pipelines Deep Learning Docker Engineering ETL GCP GPT Kubernetes LLaMA LLMs LoRA Machine Learning Model deployment Model training NLP PhD Pipelines Python PyTorch Research Security Spark TensorFlow Testing Transformers
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.