Data Scientist
New York, NY, US, 10281
Associated Press
Discover a world of unbiased journalism, in-depth analysis, and real-time updates from every corner of the globe.The Associated Press is an independent global news organization dedicated to factual reporting. Founded in 1846, AP today remains the most trusted source of fast, accurate, unbiased news in all formats and the essential provider of the technology and services vital to the news business. More than half the world's population sees AP journalism every day.
Duties: Perform data analysis, evaluate commercial and open-source models, and help design data science and data engineering solutions supporting AI initiatives, news search and discovery, content enrichment and metadata generation. Partner with search team to define, fine-tune and test NLP and machine learning capabilities to improve customer search experience. Set up and execute annotation tasks to build quality data training sets. Use knowledge of current best practices and available datasets to research and find best matching data to supplement training tasks. Fine-tune existing statistical and machine learning models used in production. Measure performance improvements and present findings to stakeholders. Combine, clean and pre-process Google Analytics and other data from various sources to ensure accuracy and usability for analysis or modeling. Analyze news metadata to find and propose fixes to support content search and discovery. Help stakeholders understand and evaluate opportunities to use GenAI in their platforms.
Requirements: Master’s degree or foreign equivalent in Computer Science, Data Science or a related field and 2 years of experience in the position offered or related. Must have 1 year of experience with the following: using Python, Pandas and NumPy to perform data analysis for data science work including ML modeling, fine tuning, clustering, statistical modeling, topic modeling, and/or search and retrieval systems; implementing, demonstrating, and presenting PoCs developed using data science methods; using data analysis and data science tools and methods, including LLMs and ETL; accessing and working with data in standard data formats, including xml and json; working with text content and metadata using NLP techniques. Must have 1 year of experience in an editorial or reporting role at a journalism or media organization. Salary: $112,000 - $125,000.20/year.
Application deadline is April 24, 2025 at 11:59PM EST.
AP seeks to build an inclusive organization grounded in respect for differences. We support all aspects of diversity and provide equal employment opportunities to all employees and applicants without regard to race, color, religion, sex, marital status, national origin, age, sexual orientation, gender identity, disability, status as a veteran, or other characteristic protected by law.
Tags: Clustering Computer Science Data analysis Engineering ETL Generative AI JSON LLMs Machine Learning ML models NLP NumPy Open Source Pandas Python Research Statistical modeling Statistics Topic modeling XML
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.