Research Intern - Multimodal Foundation Models

Palo Alto, California, United States

Apply now Apply later

About the Role:
OPPO US Research Center is seeking self-motivated Research Interns to join our team for a 3-month full-time internship. Interns will work on projects at the intersection of personal interest and our business objectives, contributing to the development of unified multimodal models under the guidance of experienced mentors and professionals.

 

Key Responsibilities:

  • Collaborate with mentors to define and execute a research project aligned with both personal interests and organizational goals.
  • Develop and train unified foundation models targeting high performance in multimodal generation and understanding tasks.
  • Engage with internal and external teams across different time zones to foster collaborative research efforts.
  • Work alongside research scientists to transition research outcomes into production-ready features and products.

Requirements

Minimum Qualifications:

  • Currently pursuing a degree in Computer Science, Computer Engineering, or a related field.
  • Solid foundation in diffusion models for image generation; image editing using generative models and traditional methods.
  • Strong interest in exploring and iterating on new techniques for multimodal models.
  • Proficiency in Python and PyTorch.
  • Familiarity with distributed model training and inference acceleration techniques.
  • Availability for a full-time internship in Palo Alto for 3 months.

Preferred Qualifications:

  • Experience with one or more of the following: multimodal models, diffusion models for video generation, large language models, foundation model training, and data processing pipelines for foundation models.
  • Strong publication record in top-tier conferences (e.g., CVPR) and/or journals.
  • Currently pursuing a Ph.D. in Computer Vision or Machine Learning, preferably in the second year or beyond.

Benefits

OPPO is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements.

The US base salary range for this full-time position is $30-$60/hour. Our salary ranges are determined by role, level, and location.

Apply now Apply later
Job stats:  5  0  0
Category: Research Jobs

Tags: Computer Science Computer Vision Diffusion models Engineering Generative modeling LLMs Machine Learning Model training Pipelines Python PyTorch Research

Perks/benefits: Career development Conferences

Region: North America
Country: United States

More jobs like this