Computer Vision and Generative Models, Research Scientist, Vision AI

Tokyo

Woven by Toyota

Woven by Toyota will help Toyota to develop next-generation cars and to realize a mobility society in which everyone can move freely, happily and safely.

View all jobs at Woven by Toyota

Apply now Apply later

About Woven by ToyotaWoven by Toyota, a part of the Toyota Group, is challenging the current state of mobility through human-centric innovation and empowering mobility transformation. Through our AD/ADAS technology, our automotive software development platform Arene OS, our mobility test course Toyota Woven City, and Toyota’s growth fund, Woven Capital, we are pioneering the movement of people, goods, information, and energy, weaving a future of enhanced safety, connectivity and well-being for all.
=========================================================================
TEAMToyota is redefining what it means to move. We're challenging the current state of mobility by enhancing the movement of people, goods, information and energy. Centered around three core concepts - A Living Laboratory™, Human-Centered, and Ever Evolving City™ - Woven City serves as a test course for mobility to fulfill our purpose of well-being for all.
We do this by bringing together a diverse community of people with a shared passion for the future of mobility to co-create, develop and refine innovative products and services. This cross-section of social infrastructure, mobility, and people provides a unique opportunity for inventors, residents and visitors to interact seamlessly with new technologies throughout daily life in an environment that emulates a real city.
For more information about Woven City, please visit: https://www.woven-city.global/
WHO ARE WE LOOKING FOR?We are seeking highly self-motivated persons with expertise in machine learning and computer vision, focused on the state-of-the-art in Multi-Modal Foundation Models. You are expected to conduct cutting-edge research on vision-language models (VLMs) and large language models (LLMs), especially in connecting with multi-modal domains such as vision, sensor, and text. You will join our Tokyo-based team to build next-generation technology in computer vision-related products used in the woven city as human-centric services.

RESPONSIBILITIES

  • Research and development of new technologies for multi-modal foundation models and generative AI-related areas including vision, language, and sensors in the real-world domain 
  • Conduct large-scale training and evaluation on multiple benchmarks
  • Develop the novel algorithm for prototyping and present research findings both internally and externally
  • Collaborate with other researchers, engineers, and business team members across the company and overseas institutes to proceed with state-of-the-art research and products. for the next generation of AI services

MINIMUM QUALIFICATIONS

  • Ph.D. degree in computer science, a related field, or equivalent practical experience
  • Strong publication record in top-tier computer vision, and machine learning conferences or journals, e.g., CVPR, ICCV, ECCV, TPAMI, etc.
  • R&D experience in machine learning, and generative models regarding computer vision or multi-modal domains
  • Experienced in ML frameworks such as PyTorch or TensorFlow, etc.
  • 3+ years of experience in software development or R&D
  • Familiar with modern development tools and cloud-based environments (GIT, Docker, AWS, GCP, etc.)
  • Strong communication skills, both verbal and written in English

NICE TO HAVES

  • 5+ years of experience in software development or R&D
  • Experience in training and deployment of large language models or large vision-language models
  • Experience in pre- and post-train under supervised, self-supervised, reinforcement learning, and graph learning
  • Experience in conducting large-scale dataset creation
  • Experience in large-scale distributed training environments and frameworks
  • Experience in RAG and large-scale model deployment
=========================================================================Important Points・All interviews will be arranged via Google Meet, unless otherwise stated.・The same job descriptions are available in both English and Japanese; therefore, we kindly ask that you apply to only one version.・We kindly request that you submit your resume in English, if possible. However, Japanese resumes are also acceptable. Please note that, depending on the English proficiency requirements of the role, we may request an English version of your resume later in the process.
WHAT WE OFFER・Competitive Salary - Based on experience・Work Hours - Flexible working time・Paid Holiday - 20 days per year (prorated)・Sick Leave - 6 days per year (prorated)・Holiday - Sat & Sun, Japanese National Holidays, and other days defined by our company・Japanese Social Insurance - Health Insurance, Pension, Workers’ Comp, and Unemployment Insurance, Long-term care insurance・Housing Allowance・Retirement Benefits・Rental Cars Support・In-house Training Program (software study/language study)
Our Commitment・We are an equal opportunity employer and value diversity.・Any information we receive from you will be used only in the hiring and onboarding process. Please see our privacy notice for more details.
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: AWS Computer Science Computer Vision Docker GCP Generative AI Generative modeling Git LLMs Machine Learning Model deployment Privacy Prototyping PyTorch R RAG R&D Reinforcement Learning Research TensorFlow

Perks/benefits: Career development Competitive pay Conferences Flex hours Health care

Region: Asia/Pacific
Country: Japan

More jobs like this