Computational Linguistics Expert
Delhi
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
Wadhwani AI
Wadhwani AI is an independent nonprofit institute developing AI-based solutions for underserved communities in developing countries.Location: Delhi,None,None
SUMMARY
The Computational Linguistics Expert will play a pivotal role in advancing Bhashini’s mission to bridge linguistic barriers across India. This role involves developing and refining AI-driven language technologies, collaborating with diverse stakeholders, and contributing to the creation of multilingual digital solutions. The ideal candidate will possess a deep understanding of computational linguistics, natural language processing (NLP), and the unique challenges associated with Indian languages.
Location- Delhi
1 Year Contractual Role
ABOUT US - https://www.wadhwaniai.org/
Wadhwani AI is a nonprofit institute building and deploying applied AI solutions to solve critical issues in public health, agriculture, education, and urban development in underserved communities in the global south. We collaborate with governments, social sector organizations, academic and research institutions, and domain experts to identify real-world problems and develop practical AI solutions to tackle these issues to make a substantial positive impact.
We have over 30 AI projects supported by leading philanthropies such as the Bill & Melinda Gates Foundation, USAID, and Google.org. With a team of over 200 professionals, our expertise encompasses AI/ML research and innovation, software engineering, domain knowledge, design, and user research.
In the Press:
G20 India’s Presidency: AI Healthcare, Agriculture, & Education Solutions Showcased Globally.
Wadhwani AI Takes an Impact-First Approach to Applying Artificial Intelligence - data.org
Cultures page of Wadhwani AI - https://www.wadhwaniai.org/culture/
ABOUT BHASHINI
Bhashini, an initiative under the National Language Translation Mission by MeitY, aims to make digital services accessible in local languages using AI and NLP technologies. By providing translation services for 22 scheduled Indian languages, Bhashini seeks to empower citizens, enhance digital governance, and foster social inclusion. The platform encompasses a comprehensive framework, including a data repository, a model repository, and the Universal Language Contribution API (ULCA), facilitating the development of AI technologies such as machine translation, automatic speech recognition (ASR), text-to-speech (TTS), and optical character recognition (OCR) across various Indian languages.
ROLES AND RESPONSIBILITIES
Language Technology Development: Design, develop, and optimize NLP models tailored for Indian languages, focusing on tasks such as machine translation, ASR, TTS, and OCR and approaches towards fine tuning of the models.
Data Annotation and Curation: Lead efforts in collecting, annotating, and curating linguistic data, ensuring high-quality datasets for training and evaluation.
Stakeholder Collaboration: Engage with startups, academic institutions, state governments, and line ministries to identify new use cases and drive the adoption of Bhashini’s language technologies.
Research and Innovation: Stay abreast of the latest advancements in computational linguistics and AI, integrating cutting-edge techniques into Bhashini’s solutions.
Proposal Development: Contribute to the drafting of proposals for new projects and partnerships with donors, multilateral organizations, and other stakeholders.
Documentation and Reporting: Prepare comprehensive documentation, including technical reports, research papers, and user manuals, to support the dissemination and adoption of developed technologies.
REQUIREMENTS
Educational Background: Master’s or Ph.D. in Computational Linguistics, Computer Science, Artificial Intelligence, or a related field.
Experience: Minimum of 6 years of experience in NLP, with a focus on Indian languages and multilingual systems.
Technical Skills: Proficiency in programming languages such as Python, familiarity with NLP libraries (e.g., TensorFlow, PyTorch, Hugging Face), and experience with large language models.
Linguistic Expertise: Deep understanding of the linguistic nuances of Indian languages, including syntax, semantics, and phonetics.
Project Management: Demonstrated ability to manage projects, coordinate with diverse stakeholders, and deliver results within stipulated timelines.
Communication Skills: Strong written and verbal communication skills, with the ability to convey complex technical concepts to non-technical audiences.
DESIRABLE QUALIFICATIONS
Experience with crowdsourcing platforms and initiatives, such as Bhasha Daan.
Familiarity with the challenges of low-resource languages and strategies to address them.
Prior involvement in government or public sector projects related to language technology.
Publications in reputed journals or conferences in the field of computational linguistics or NLP.
We are committed to promoting diversity and the principle of equal employment opportunity for all our employees and encourage qualified candidates to apply irrespective of religion or belief, ethnic or social background, gender, gender identity, and disability. If you have any questions, please email us at careers@wadhwaniai.org.
Apply to this job* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs ASR Computer Science Engineering Linguistics LLMs Machine Learning NLP Nonprofit OCR Python PyTorch Research TensorFlow
Perks/benefits: Conferences
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.