Machine Learning Engineer
Mountain View, CA, US
CARIAD
At CARIAD, we believe that software has the potential to reshape the automotive experience for everyone, everywhere. Find out how!We are CARIAD, an automotive software development team with the Volkswagen Group. Our mission is to make the automotive experience safer, more sustainable, more comfortable, more digital, and more fun. To achieve that we are building the leading tech stack for the automotive industry and creating a unified software platform for over 10 million new vehicles per year. We’re looking for talented, digital minds like you to help us create code that moves the world. Together with you, we’ll build outstanding digital experiences and products for all Volkswagen Group brands that will transform mobility. Join us as we shape the future of the car and everyone around it.
Role Summary:
The Machine Learning Engineer II will design, implement, and test AI Application for CARIAD’s software development platform.
This role will support the software and integration team by building AI-driven applications that automate test case generation and analyze code and testing logs for error prediction and self-diagnosis. Additionally, the role will focus on improving QA processes through task automation, identifying patterns in testing failures, and delivering proactive solutions to enhance the overall engineering workflow and software quality.
Role Responsibilities:
AI Application Development
- Research and develop AI application to improve engineering efficiency and productivity
- Design and create RAG and RL application pipelines
- Define and create vector datastores for different application use
- Design and develop application frontend and backend services
Prompt Engineering
- Develop application specific prompt tuning pipeline
- Create and design prompt engineering architecture for different applications
- Develop scripts to process data and generate ground truth dataset for fine-tuning
Model Fine-tuning
- Fine tune base models with different quantization techniques for application specific usage
- Optimize models for performance and size
- Evaluate and improve the efficacy of the fine-tuned model
- Define and create multi-model and multi-adapter inferencing workflow
Model Evaluation & Deployment
- Define and validate model evaluation requirement
- Define and execute stress testing for inferencing scalability
- Generate and publish evaluation result
- Define and create model deployment pipeline
Collaboration and Communication
- Work with infrastructure team for model and application deployment
- Collaborate effectively with cross-functional teams including software, testing, and program management teams.
- Communicate project status, technical challenges, and solutions in a clear and concise manner.
General Skills:
- Analytical and debugging skills.
- Effective written and oral communication skills.
- Experience in project and technical leadership.
- Ability to lead technical meetings and provide feedback.
- Collaborate and work with multiple teams across geographies and time zones.
Required Specialized Skills:
- Experience with LLM models Llama 3, Gemini, ChatGPT-4o, Mistral, Claude 2, Grok-1
- Experience fine-tuning LLM models with PEFT, Quantization, Tokenization for LoRA/QLoRA
- Strong understanding on embeddings, vectorization, custom tokens
- Experience generating model dataset using parquet, defining schema of the dataset
- Proficient in Python, AI-related training and inferencing tools such as PyTorch, vLLM, Ray, Gradio, FastAPI, Streamlit
- Experience on prompting techniques such as Iterative refinement, Feedback loops, Zero-shot, Few-shot, CoT
- Experience with different model evaluation tasks such as HellaSwag, TruthfulQA, MMLU
- Good understanding of natural language processing, machine learning, AI generated content development
- Experience using LLM tools Ollama, ComfyUI
- Experience developing RAG, RL applications
- Experience with developing ML pipelines and MLOps for deployment
Desired Skills:
- Experience with Vision AI models GAN, Stable Diffusion
- Experience with Image generation pipeline with segmentation, image detection, image masking and processing
Workplace Flexibility:
- Calls, (virtual) meetings & workshops (overlapping with German/US business hours as needed) to align with leadership, development teams and partners.
- Occasional international and domestic travel to provide on-site support and planning/integration workshops with our internal and external stakeholders.
Years of Relevant Experience:
- 2+ year’s experience with Machine Learning and Deep Learning Models
- 2+ year’s experience in Python and PyTorch
Required Education:
- MS in Computer Science, Data Science, or Computer Engineering
Desired Education:
- PhD in Computer Science, Data Science, or Computer Engineering
Compensation
Salary range is dependent on factors such as geographical differentials, credentials or certifications, industry-based experience, qualification and training. In the city of Mountain View, California, the salary range for this position is $101,800 - 147,700.
CARIAD, Inc. provides performance based merits and annual bonus along with a competitive benefits package. Benefits include medical, dental, vision, 401k with employer match and defined contribution plan, short and long term disability, basic life and AD&D insurance, employee assistance program, tuition reimbursement and student loan repayment plans, maternity and non-primary caregiver leave, adoption assistance, employee referral program and vacation and paid holidays. We also offer a unique vehicle lease program that covers registration and insurance fees.
CARIAD is an Equal Opportunity Employer. We welcome and encourage applicants from all backgrounds, and do not discriminate based on race, sex, age, disability, sexual orientation, national origin, religion, color, gender identity/expression, marital status, veteran status, or any other characteristics protected by applicable laws.
Tags: AI content Architecture ChatGPT Claude Claude 2 ComfyUI Computer Science Deep Learning Engineering FastAPI Gemini GPT Gradio LLaMA LLMs LoRA Machine Learning MLOps Model deployment NLP Parquet PhD Pipelines Prompt engineering Python PyTorch RAG Research Stable Diffusion Streamlit Testing vLLM
Perks/benefits: 401(k) matching Career development Competitive pay Health care Insurance Medical leave Salary bonus
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.