VLM-based Scene Understanding Research – Intern
Sunnyvale, CA, United States
Internship Entry-level / Junior USD 82K - 136K
Bosch Group
Moving stories and inspiring interviews. Experience the meaning of "invented for life" by Bosch completely new. Visit our international website.Company Description
The Bosch Research and Technology Center North America with offices in Sunnyvale, California, Pittsburgh, Pennsylvania, and Cambridge, Massachusetts is a part of the global Bosch Group (www.bosch.com), a company with over 70 billion euro revenue, 400,000 employees worldwide, a very diverse product portfolio, and a history spanning over 125 years. The Research and Technology Center North America (RTC-NA) is dedicated to providing technologies and system solutions for various Bosch business fields, primarily in the field of artificial intelligence, energy technologies, internet technologies, circuit design, semiconductors and wireless, as well as advanced MEMS design.
As a part of the global research, our AI research in Silicon Valley focuses on Foundation Models, Big Data Visual Analytics, Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Data Science, AI System Engineering, Time-series Analysis. We develop scalable, intelligent, and trustworthy AIoT solutions for Bosch products and services in application areas such as automated driving, advanced driver assistance systems (ADAS), robotics, smart manufacturing, enterprise AI, health care, smart home and building solutions.
Originating from the AI research in Silicon Valley, our Intelligent Autonomous Systems group is responsible for enabling future autonomous Bosch products by pushing the boundaries of robotics, automated driving and automation through key innovations that encompass system architecture and AI components. These include methods for localization, motion planning, high level task planning and decision making as well as systems for making these technologies work on real products by building frameworks that take advantage of technologies in the field of reliable distributed computing. We work with internal partners of different Bosch business units to transfer our solutions into future products. We also actively collaborate with leading groups in academia and industry to promote research ideas and publish research findings in internationally renowned conferences and journals such as CVPR, ICRA, IROS, RSS, NeurIPS and CoRL.
Job Description
- Build up an advanced cloud-based system for sparse semantic scene understanding using VLMs.
- Architect a cloud-retrieval pipeline consisting of scene representation storage, localization and data retrieval, as well VLM-based map creation.
- Implement the system using off-the-shelf building blocks for localization, mapping and communication, extending them where needed.
- Summarize research findings in high-quality paper and/or patent submissions.
Qualifications
Basic Qualifications
- Ph.D. student or highly qualified M.S. student in Computer Science, Machine Learning, Robotics, or related fields (Must be a current student or recent graduate – less than 1 year)
- Hands-on experience in setting up and running computer vision technologies, such as YOLO, SAM, etc.
- Hands-on experience in setting up and running VLMs or LLMs
- Experience with localization and mapping algorithms, such as SLAM or place recognition
- Knowledgeable on the state-of-the-art in VLM/LLM ideas and software
- Solid C++ and Python programming skills
- Hands-on experience working with robotic middlewares such as ROS2 or FogROS2
Preferred Qualifications
- Publication record in top venues (ICRA, IROS, RSS, CoRL, NeurIPs etc.)
- Experience in embedded systems or distributed systems is a plus
- Experience in cloud computation and Cloud computing platforms (i.e. AWS, Azure, Google Cloud) is a plus
- Able to work independently, has strong research and problem-solving skills
- Good communication and teamwork skills
Additional Information
By choice, we are committed to a diverse workforce - EOE/Protected Veteran/Disabled.
BOSCH is a proud supporter of STEM (Science, Technology, Engineering & Mathematics)
- FIRST Robotics (For Inspiration and Recognition of Science and Technology)
- AWIM (A World In Motion)
The U.S. base salary range for this intern position is $41.00-$68.00 hourly. Within the range, individual pay is determined based on several factors, including, but not limited to, type of degree, work experience and job knowledge, complexity of the role, type of position, job location, etc. Your Hiring Manager can share more details about the specific salary range for this position during the interview process.
For more information on our culture and benefits, please visit:
Tags: Architecture AWS Azure Big Data Circuit Design Computer Science Computer Vision Distributed Systems Engineering GCP Google Cloud LLMs Machine Learning Mathematics NeurIPS NLP Python Research Robotics SLAM STEM YOLO
Perks/benefits: Career development Conferences Health care
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.