Lead - GenAI Testing and Evaluation Framework - VP – Citi Wealth
388 GREENWICH STREET - TOWER, United States
Full Time Senior-level / Expert USD 142K - 213K
Citi
Citi is a leading global bank for institutions with cross-border needs, a global provider in wealth management and a U.S. personal bank.Position Overview: We are seeking an innovative and detail-oriented professional to lead the development and management of the Generative AI (GenAI) testing and evaluation framework. This role focuses on creating patterns, methodologies, and iterative structures to optimize the performance and effectiveness of GenAI models, with a particular emphasis on prompt engineering and evaluation. The ideal candidate will have a strong background in GenAI, a deep understanding of natural language processing, and a passion for refining AI solutions through rigorous testing and iteration.
Key Responsibilities:
Framework Development:
Design and implement a comprehensive testing and evaluation framework for GenAI model outputs.
Develop standards and patterns for assessing the quality and "goodness" of prompts across diverse use cases.
Create iterative processes for testing and refining prompts to optimize model outputs.
Prompt Engineering and Evaluation:
Establish criteria for evaluating prompt performance, including accuracy, completeness, relevance, coherence, and alignment with desired outcomes.
Experiment with prompt structures to identify optimal configurations for various business applications.
Develop and document best practices for prompt design and refinement.
Collaboration and Integration:
Work closely with tech partners, engineers, and product teams to ensure testing frameworks integrate seamlessly into the development lifecycle.
Partner with stakeholders to understand business requirements and tailor testing methodologies to address specific needs.
Provide actionable insights and recommendations to improve model performance based on evaluation results.
Tooling and Automation:
Identify and implement tools for automating the testing and evaluation process.
Develop dashboards and reporting mechanisms to monitor prompt and model performance metrics.
Stay updated on emerging tools and techniques in AI testing and integrate them into the framework.
Continuous Improvement:
Establish feedback loops to iteratively improve testing methodologies and evaluation standards.
Establish process for ongoing monitoring of prompts, once productionalized.
Monitor industry trends and advancements in Generative AI to ensure the framework remains cutting-edge.
Advocate for a culture of experimentation and continuous learning within the organization.
Qualifications and Experience:
Expertise in Generative AI and natural language processing (NLP) models.
Strong proficiency in prompt engineering and familiarity with frameworks for AI evaluation.
Hands-on experience with AI tools, libraries, and cloud platforms.
Key Attributes for Success:
Analytical Mindset:
Strong problem-solving skills and ability to derive actionable insights from complex data.
Attention to detail with a focus on precision and accuracy in evaluation.
Technical Expertise:
Deep understanding of AI/ML testing methodologies and best practices.
Proficiency in programming languages like Python and experience with relevant libraries (e.g., PyTorch, TensorFlow).
Innovative Thinking:
Passion for exploring new methodologies to improve AI evaluation frameworks.
Creativity in designing experiments and testing approaches.
Collaboration and Communication:
Excellent communication skills to convey technical concepts to diverse audiences.
Ability to work collaboratively across cross-functional teams and influence stakeholders.
Adaptability:
Comfortable working in a fast-paced, dynamic environment.
Willingness to learn and adapt to new tools, technologies, and methodologies.
Why Join Us:
Lead the development of a transformative AI testing framework in a forward-thinking organization.
Work with cutting-edge technologies and a team of passionate innovators.
Contribute to impactful projects that shape the future of Generative AI.
Enjoy competitive compensation and benefits with opportunities for professional growth.
If you are driven to refine and optimize AI solutions through innovative testing frameworks and have the expertise to lead this effort, we encourage you to apply.
------------------------------------------------------
Job Family Group:
Decision Management------------------------------------------------------
Job Family:
Business Analysis------------------------------------------------------
Time Type:
Full time------------------------------------------------------
Primary Location:
New York New York United States------------------------------------------------------
Primary Location Full Time Salary Range:
$142,320.00 - $213,480.00
In addition to salary, Citi’s offerings may also include, for eligible employees, discretionary and formulaic incentive and retention awards. Citi offers competitive employee benefits, including: medical, dental & vision coverage; 401(k); life, accident, and disability insurance; and wellness programs. Citi also offers paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays. For additional information regarding Citi employee benefits, please visit citibenefits.com. Available offerings may vary by jurisdiction, job level, and date of hire.
------------------------------------------------------
Anticipated Posting Close Date:
Jan 30, 2025------------------------------------------------------
Citi is an equal opportunity and affirmative action employer.
Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Citigroup Inc. and its subsidiaries ("Citi”) invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View the "EEO is the Law" poster. View the EEO is the Law Supplement.
View the EEO Policy Statement.
View the Pay Transparency Posting
Tags: Engineering Generative AI Machine Learning NLP Prompt engineering Python PyTorch TensorFlow Testing
Perks/benefits: Career development Competitive pay Health care Insurance Medical leave Transparency
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.