MLDL - Bilingual, Bedrock

US, MA, Virtual Location - Massachuset

Applications have closed

Amazon.com

Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa...

View all jobs at Amazon.com

Find more jobs like this

Posted 8 months ago

The Bedrock team is a team of data linguists who primarily support the training of different models in the AWS generative AI platform. We work with different model types, such as text-to-text, text-to-image, text-to-speech, and others, generating data for ML model training, as well as toxic content evaluation, and categorization. Some of the aspects of ML development that the Bedrock team works with include Responsible AI, Reinforcement Learning from Human Feedback, Supervised Fine Tuning, and Human Content Evaluation. Our team represents a great array of experience in the field of linguistics, including sociolinguistics, computational linguistics, conversation analysis, syntax-semantics, linguistic typology, ESL and foreign languages, as well as translation.

Key job responsibilities
* Build a thorough understanding of data collection and annotation guidelines and various annotation tools.
* Annotate, generate and QA data, identifying linguistic categories based on detailed annotation and adhering to guidelines.
* Use generative AI to facilitate workflows or automate repetitive tasks
* Monitor AI outputs for biases or ethical issues and adjusting inputs to mitigate these risks.
* Perform annotation related tasks; you participate in data generation, collection and quality assurance tasks
* Collaborate with other ML Data Linguists to resolve data ambiguities and annotation disagreements.
* Dive deep into the data to perform qualitative error trend analysis, and devise action plan to improve data quality.
* Provide feedback to Language Engineers and Scientists on tool improvements and annotation processes.
* Diving deep into issues and implement solutions independently
* Contribute to process improvements to reduce handling time and improve resource output.
* Develop a variety of language artifacts crucial for model development such as datasets for training and evaluation.
* Support and consult in pre-screening interviews for Data Associates.
* Collaborate with LEs, scientists, and Ops Manager to innovate processes, tracker automations, and workflows.
* Assist LEs in communication with vendor to provide detailed feedback to annotators.

Basic Qualifications

* Bachelor's degree in Linguistics, Philosophy, Cognitive Science, a foreign language, or Literature.
* Ability to identifying linguistic ambiguity, and other inaccuracies in linguistic data, as well as identify basic parts of speech, and produce reports of analyzed data.
* At least 6 months of experience with natural language data labeling, data annotation, linguistic annotation or other forms of data markup, and/or teaching experience, as well as experience leading a team of peers.
* Knowledge of different domains such as Finance, Health Care, and/or Insurance.
* Ability to generate innovative and diverse inputs to explore various aspects of an AI model's capabilities
* Familiarity with json, yaml, xml or other forms of text markup.
* Ability to navigate a Unix terminal and use common command line tools
* Knowledge of Python, Java or any other scripting language.
* Strong organizational and leadership skills and detail-oriented.
* Ability to communicate well and actively listen with other data associates on a team.
* Ability to deliver high quality results under tight deadlines.
* Comfortable working in a fast paced, collaborative work environment.
* Willingness to support several projects at one time, and to accept re-prioritization as necessary.

Preferred Qualifications

* Master's degree in a relevant field, such as Linguistics, Communications, a foreign language, computational linguistics or other language, data or tech related disciplines is a plus.
* Proficient in another foreign language.
* Familiarity with common text processing tools.
* Passion for language, linguistics, human language technology and AI.
* Ability to work in different operating systems (Windows, MacOS, or Linux).
* Strong understanding of NLP concepts and techniques

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.

Find more jobs like this

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats: 0 0 0

Categories: Big Data Jobs Machine Learning Jobs

Tags: AWS Data quality Finance Generative AI Java JSON Linguistics Linux Machine Learning ML models Model training NLP Python Reinforcement Learning Responsible AI RLHF Teaching XML