Vision Language Data Scientist

Bengaluru

Fractal

Fractal Analytics helps global Fortune 100 companies power every human decision in the enterprise by bringing analytics and AI to the decision.

View all jobs at Fractal

Apply now Apply later

It's fun to work in a company where people truly BELIEVE in what they are doing!

We're committed to bringing passion and customer focus to the business.

Company Overview:

Fractal is a strategic AI partner to Fortune 500 companies with a vision to power every human decision in the enterprise. Fractal is building a world where individual choices, freedom, and diversity are the greatest assets. An ecosystem where human imagination is at the heart of every decision. Where no possibility is written off, only challenged to get better. We believe that a true Fractalite is the one who empowers imagination with intelligence. Fractal has been featured as a Great Place to Work by The Economic Times in partnership with the Great Place to Work® Institute and recognized as a ‘Cool Vendor’ and a ‘Vendor to Watch’ by Gartner.

Job Location: Bangalore

Job Description:

We are seeking highly skilled and motivated vision / vision-language data scientists to join our dynamic AI team.

Ideal candidate should have experience into

  • Vision Transformers, BEiT, MAE, DINO, SAM, DINO v2, Visual chatGPT, segGPT, CLIP, BLIP, FILIP, GLIP, FLIP, VQA, Stable Diffusion, Visual Prompt Tuning
  • Image classification, image retrieval, object detection and tracking, optical flow, instance segmentation, pose estimation, monocular depth estimation, image captioning, GAN
  • Ideal candidate should have strong basics with proven experience / exposure in Computer Vision, along with fundamental understanding of NLP.

The candidate will be responsible for developing and deploying state-of-the-art models and techniques to carry out client projects in the domain of retail, surveillance, manufacturing, process automation and content generation, and deploy them in production environments meeting the desired SLA.

Expectations & Responsibilities:

Proactive attitude towards quick adaptability in ongoing tasks.

Develop contrastive, self-supervised, semi-supervised and natural language based learning techniques to boost the performance of computer vision tasks in retail, surveillance, CPG, manufacturing and automation, and extend their functionalities.

Zero-shot and few-shot learning, implementing them to address in-house use cases effectively.

Collaborate on vision language integration, employing cutting-edge vision/graphics and language-based prompt tuning, demonstrate proficiency in open-vocabulary classification, object detection, segmentation and image retrieval tasks.

Follow up with state-of-the-art research in unimodal generative techniques in computer vision- encompassing adversarial networks, cross-attention mechanisms, and latent diffusion modelling based approaches.

Demonstrate expertise in fundamental computer vision tasks, including object detection, segmentation, pose estimation, image captioning, and image/video understanding.

Hands-on experience with PyTorch, TensorFlow for prototyping and deploying models.

Collaborate with cross-functional teams to integrate R&D outcomes into practical solutions.

Qualifications & Expertise:

Education:

Qualification and academic background is not a barrier for the right candidate with proven skills.

Master's or Ph.D. (completed / degree awaited) in application of Computer Vision, Artificial Intelligence with the aforementioned skills, and a strong inclination for industrial research application will be preferred.

Work Experience:

Prior experience in working with the aforementioned technologies is indispensable. Your previous work experience should preferably be in CV/NLP & DL domain, but not mandatory.

Vision Expertise: Demonstrated expertise in fundamental computer vision tasks, like object detection, segmentation, pose estimation, image captioning and image/video understanding.

Technical Skills:

Proficiency in Python and C++ programming is essential. Strong knowledge of PyTorch is required for model development. Exposure to Langchain will be preferred.

Problem-Solving: Strong analytical and problem-solving skills to tackle complex challenges in AI research and development.

Communication: Excellent communication skills, both written and verbal, to articulate research findings and collaborate effectively with cross-functional teams, prepare proposals for client.

If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!

Not the right fit?  Let us know you're interested in a future opportunity by clicking Introduce Yourself in the top-right corner of the page or create an account to set up email alerts as new job postings become available that meet your interest!

Apply now Apply later
  • Share this job via
  • 𝕏
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  2  0  0

Tags: ChatGPT Classification Computer Vision GPT Industrial LangChain ML models NLP Prototyping Python PyTorch R R&D Research Stable Diffusion TensorFlow Transformers

Perks/benefits: Career development

Region: Asia/Pacific
Country: India

More jobs like this