Machine Learning Engineering Intern

Menlo Park

Applications have closed

Lamini

Lamini is the enterprise LLM platform for existing software teams to quickly develop and control their own LLMs. Lamini has built-in best practices for specializing LLMs on billions of proprietary documents to improve performance, reduce...

View all jobs at Lamini

Find more jobs like this Jobs in the United States

Posted 2 months ago

Lamini enables every enterprise to safely, quickly, and cost-effectively build their own Expert AI. Our customers own their own models, trained on their data. Lamini optimizes for Expert AI workloads with minimal hallucination, enterprise-grade security, and enterprise flexibility, running on any infrastructure. Our team is made up of highly committed engineers, researchers, and tech industry veterans excited by mission and technology. We’re backed by leading VCs as well as computing and technology companies.
As a Machine Learning Intern, you will work closely with our AI researchers and engineers to optimize data pipelines for LLM training. You will gain hands-on experience in large-scale data processing, model fine-tuning, and infrastructure optimization, playing a key role in improving the efficiency and scalability of our AI systems.

Key responsibilities include:

Design, develop, and optimize data pipelines for large-scale LLM training.
Process and clean large datasets for pretraining and fine-tuning language models.
Assist in implementing distributed training strategies for LLMs.
Experiment with data augmentation and preprocessing techniques to enhance model performance.
Work with frameworks like PyTorch, TensorFlow, and Hugging Face Transformers.
Monitor and debug training runs, optimizing computational efficiency.
Collaborate with cross-functional teams to integrate data pipelines into production workflows.

Requirements:

Currently pursuing a Bachelor's, Master's, or PhD in Computer Science, Machine Learning, Data Science, or a related field.
Strong programming skills in Python.
Familiarity with deep learning frameworks such as PyTorch or TensorFlow.
Understanding of NLP concepts and transformer-based architectures (e.g., GPT, BERT, LLaMA).
Experience with data preprocessing, ETL pipelines, or large-scale dataset management.
Basic knowledge of cloud platforms (AWS, GCP, or Azure) and distributed computing.

Preferred Qualifications:

Experience working with LLM fine-tuning, LoRA, or adapter-based training.
Hands-on experience with ML model training and hyperparameter tuning.
Familiarity with tools like Hugging Face Datasets, Ray, or Apache Spark.
Exposure to containerization (Docker, Kubernetes) and ML model deployment.

Why Join Us?

Work on cutting-edge AI projects in a fast-paced and innovative environment.
Gain hands-on experience with real-world LLM training and optimization.
Collaborate with experts in AI research and machine learning engineering.
Potential opportunity for full-time conversion based on performance.

This is a 2-4+ month full-time paid internship. Please apply with a resume, projects, publications, and cover letter.
At Lamini AI, we are committed to providing an environment of mutual respect where equal employment opportunities are available to all applicants without regard to race, color, religion, sex, pregnancy (including childbirth, lactation and related medical conditions), national origin, age, physical and mental disability, marital status, sexual orientation, gender identity, gender expression, genetic information (including characteristics and testing), military and veteran status, and any other characteristic protected by applicable law. Lamini AI believes that diversity and inclusion among our employees is critical to our success as a company, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool. Selection for employment is decided on the basis of qualifications, merit, and business need.

Find more jobs like this Jobs in the United States

Job stats: 19 11 0

Categories: Engineering Jobs Machine Learning Jobs

Tags: Architecture AWS Azure BERT Computer Science Data pipelines Deep Learning Docker Engineering ETL GCP GPT Kubernetes LLaMA LLMs LoRA Machine Learning Model deployment Model training NLP PhD Pipelines Python PyTorch Research Security Spark TensorFlow Testing Transformers