Research Scientist, Visual Language Models

Zürich, Switzerland

Google

Google’s mission is to organize the world's information and make it universally accessible and useful.

View company page

Minimum qualifications:

  • PhD degree in Computer Science, a related field, or equivalent practical experience.
  • Experience with Generative AI and use cases.
  • Experience with Machine Learning, Deep Learning, Reinforcement Learning, Computer Vision.
  • One or more scientific publication submission(s) for conferences, journals, or public repositories.

Preferred qualifications:

  • Candidates will typically have 2 years of coding experience.
  • Typically 1 year of experience owning and initiating research agendas.
  • Familiarity with tensorflow programming.
  • History of contribution to research communities and/or efforts.

About the job

As an organization, Google maintains a portfolio of research projects driven by fundamental research, new product innovation, product contribution and infrastructure goals, while providing individuals and teams the freedom to emphasize specific types of work. As a Research Scientist, you'll setup large-scale tests and deploy promising ideas quickly and broadly, managing deadlines and deliverables while applying the latest theories to develop new and improved products, processes, or technologies. From creating experiments and prototyping implementations to designing new architectures, our research scientists work on real-world problems that span the breadth of computer science, such as machine (and deep) learning, data mining, natural language processing, hardware and software performance analysis, improving compilers for mobile platforms, as well as core search and much more.

As a Research Scientist, you'll also actively contribute to the wider research community by sharing and publishing your findings, with ideas inspired by internal projects as well as from collaborations with research programs at partner universities and technical institutes all over the world.

The AR Semantic Perception team develops novel core tech around computer vision and ML, in particular scene and object understanding, 3D computer vision, high-level text understanding, 3D reconstruction and SLAM, object detection and tracking. The activity spans the whole range from research to product. Our team is composed of Research Scientists and Software Engineers with a focus on 3D scene reconstruction, object and text understanding, and Generative AI. The team serves multiple perception-related products at Google, with a particular focus on AR products and applications.

The AR Semantic Perception team develops novel core tech around computer vision and ML, in particular scene and object understanding, 3D computer vision, high-level text understanding, 3D reconstruction and SLAM, object detection and tracking. The activity spans the whole range from research to product. Our team is composed of Research Scientists and Software Engineers with a focus on 3D scene reconstruction, object and text understanding, and Generative AI. The team serves multiple perception-related products at Google, with a particular focus on AR products and applications.

The AR Semantic Perception team develops novel core tech around computer vision and ML, in particular scene and object understanding, 3D computer vision, high-level text understanding, 3D reconstruction and SLAM, object detection and tracking. The activity spans the whole range from research to product. Our team is composed of Research Scientists and Software Engineers with a focus on 3D scene reconstruction, object and text understanding, and Generative AI. The team serves multiple perception-related products at Google, with a particular focus on AR products and applications.

The AR Semantic Perception team develops novel core tech around computer vision and ML, in particular scene and object understanding, 3D computer vision, high-level text understanding, 3D reconstruction and SLAM, object detection and tracking. The activity spans the whole range from research to product. Our team is composed of Research Scientists and Software Engineers with a focus on 3D scene reconstruction, object and text understanding, and Generative AI. The team serves multiple perception-related products at Google, with a particular focus on AR products and applications.

Responsibilities

  • Develop innovative computer vision and machine learning technology for applications in the field of augmented reality and beyond.
  • Lead productionization of innovative computer vision and ML technology in the area of visual language models for scene, object and text understanding, including development and optimization of the ML model, integration in a mobile framework, hillclimb and evaluation.
  • Establish and maintain relationships with main stakeholders and keep recurring updates on the advancements of the project, making sure that their expectations are aligned to the research and engineering work.
Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: 3D Reconstruction Architecture Computer Science Computer Vision Data Mining Deep Learning Engineering Generative AI Machine Learning NLP PhD Prototyping Reinforcement Learning Research SLAM TensorFlow

Perks/benefits: Conferences

Region: Europe
Country: Switzerland
Job stats:  4  0  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.