Data Scientist- VLM (Vision Language Model)
US-WI-Waukesha
Capgemini
A global leader in consulting, technology services and digital transformation, we offer an array of integrated services combining technology with deep sector expertise.Description
About the job you’re considering
We are seeking a highly skilled and detail-oriented Vision-Language Models (VLM) Data Scientist/ Vision Data Analyst to join our team. The ideal candidate will have a strong background in computer vision, natural language processing, data analysis, and machine learning.This role involves developing and deploying multimodal AI solutions that integrate vision and language capabilities, analyzing visual data to extract meaningful insights, and collaborating with cross-functional teams to improve our products and services.
Your role
· VLM Development & Deployment: Design, train, and deploy efficient Vision-Language Models (e.g., VILA) for multimodal applications. Explore cost-effective methods such as knowledge distillation, modal-adaptive pruning, and LoRA fine-tuning to optimize training and inference.
· Multimodal AI Solutions: Develop solutions that integrate vision and language capabilities for applications like image-text matching, visual question answering (VQA), and document data extraction. Leverage interleaved image-text datasets and advanced techniques (e.g., cross-attention layers) to enhance model performance.
· Healthcare Domain Expertise: Apply VLMs to healthcare-specific use cases such as medical imaging analysis, position detection, motion detection, and measurements. Ensure compliance with healthcare standards while handling sensitive data.
· Efficiency Optimization: Evaluate trade-offs between model size, performance, and cost using techniques like elastic visual encoders or lightweight architectures. Benchmark different VLMs (e.g., GPT-4V, Claude 3.5) for accuracy, speed, and cost-effectiveness on specific tasks.
· Data Analysis: Analyze large sets of visual data to identify patterns, trends, and anomalies.
Algorithm Development: Develop and implement computer vision algorithms to process and interpret visual data.
· Machine Learning: Apply machine learning techniques to improve the accuracy and efficiency of vision-based systems.
· Reporting: Create detailed reports and visualizations to communicate findings to both technical and non-technical audiences.
Your skills and experience
· Education: Master's or Ph.D. in Computer Science, Data Science, Machine Learning, Electrical Engineering, or a related field.
· Experience: 3+ years of experience in machine learning or data science roles with a focus on vision-language models and computer vision. Proven expertise in deploying production-grade multimodal AI solutions.
· Technical Skills: Proficiency in Python and ML frameworks (e.g., PyTorch, TensorFlow). Hands-on experience with VLMs such as VILA, or VSS. Strong understanding of image processing techniques and tools.
· Analytical Skills: Excellent problem-solving skills and the ability to analyze complex data sets.
Communication: Strong written and verbal communication skills. Ability to present complex information clearly and concisely.
· Teamwork: Ability to work effectively in a collaborative team environment.
· Experience with cloud computing platforms such as AWS or Azure.
· Familiarity with data visualization tools like Tableau or Power BI. Knowledge of statistical analysis and data mining techniques.
Life at Capgemini
Capgemini supports all aspects of your well-being throughout the changing stages of your life and career. For eligible employees, we offer:
- Flexible work
- Healthcare including dental, vision, mental health, and well-being programs
- Financial well-being programs such as 401(k) and Employee Share Ownership Plan
- Paid time off and paid holidays
- Paid parental leave
- Family building benefits like adoption assistance, surrogacy, and cryopreservation
- Social well-being benefits like subsidized back-up child/elder care and tutoring
- Mentoring, coaching and learning programs
- Employee Resource Groups
- Disaster Relief
About Capgemini Engineering
World leader in engineering and R&D services, Capgemini Engineering combines its broad industry knowledge and cutting-edge technologies in digital and software to support the convergence of the physical and digital worlds. Coupled with the capabilities of the rest of the Group, it helps clients to accelerate their journey towards Intelligent Industry. Capgemini Engineering has 65,000 engineer and scientist team members in over 30 countries across sectors including Aeronautics, Space, Defense, Naval, Automotive, Rail, Infrastructure & Transportation, Energy, Utilities &
Chemicals, Life Sciences, Communications, Semiconductor & Electronics, Industrial & Consumer, Software & Internet.
Capgemini Engineering is an integral part of the Capgemini Group, a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, generative AI, cloud and data, combined with its deep industry expertise and partner ecosystem. The Group reported 2024 global revenues of €22.1 billion.
Get the future you want | www.capgemini.com
Disclaimer
Capgemini is an Equal Opportunity Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender identity/expression, age, religion, disability, sexual orientation, genetics, veteran status, marital status or any other characteristic protected by law.
This is a general description of the Duties, Responsibilities and Qualifications required for this position. Physical, mental, sensory or environmental demands may be referenced in an attempt to communicate the manner in which this position traditionally is performed. Whenever necessary to provide individuals with disabilities an equal employment opportunity, Capgemini will consider reasonable accommodations that might involve varying job requirements and/or changing the way this job is performed, provided that such accommodations do not pose an undue hardship.
Capgemini is committed to providing reasonable accommodations during our recruitment process. If you need assistance or accommodation, please reach out to your recruiting contact.
Please be aware that Capgemini may capture your image (video or screenshot) during the interview process and that image may be used for verification, including during the hiring and onboarding process.
Click the following link for more information on your rights as an Applicant http://www.capgemini.com/resources/equal-employment-opportunity-is-the-law
Applicants for employment in the US must have valid work authorization that does not now and/or will not in the future require sponsorship of a visa for employment authorization in the US by Capgemini.
Job
: Programmer/AnalystSchedule
: Full-timePrimary Location
: US-WI-WaukeshaOrganization
: ERD PPL US* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture AWS Azure Claude Computer Science Computer Vision Data analysis Data Mining Data visualization Engineering Generative AI GPT Industrial LoRA Machine Learning NLP Power BI Python PyTorch R R&D Statistics Tableau TensorFlow
Perks/benefits: Career development Flex hours Flex vacation Health care Medical leave Parental leave
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.