ML Engineer — LLM Evaluation
San Francisco, CA
Dynamo AI
Dynamo AI offers end-to-end AI Performance, Security, and Compliance solutions for delivering Enterprise-grade Generative AI.
At Dynamo AI, we believe that LLMs must be developed with safety, privacy, and real-world responsibility in mind. Our ML team comes from a culture of academic research driven to democratize AI advancements responsibly. By operating at the intersection of ML research and industry applications, our team empowers Fortune 500 companies’ adoption of frontier research for their next generation of LLM products. Join us if you:• Wish to work on the premier platform for private and personalized LLMs. We provide the fastest end to end solution to deploy research in the real world with our fast-paced team of ML Ph.D.’s and builders, free of Big Tech / academic bureaucracy and constraints.• Are excited at the idea of democratizing state-of-the-art research on safe and responsible AI.• Are motivated to work at a 2023 CB Insights Top 100 AI Startup and see your impact on end customers in the timeframe of weeks not years.• Care about building a platform to empower fair, unbiased, and responsible development of LLMs and don’t accept the status quo of sacrificing user privacy for the sake of ML advancement.
Salary for this position may vary based on several factors, including the candidate's experience, expertise, and the geographic location of the role. Compensation is determined to ensure competitiveness and equity, reflecting the cost of living in different regions and the specific skills and qualifications of the candidate.
Responsibilities
- Own LLM evaluation processes and methods with a focus on generating benchmarks representative of real-world usage and safety vulnerabilities.
- Generate high quality synthetic data, curate labels, and conduct rigorous benchmarking.
- Deliver robust, scalable, and reproducible production code.
- Push the envelope by developing methods for benchmarking that revamps how we assess the best LLMs for harmlessness and helpfulness. Your research will directly empower our customers to more feasibly deploy safe and responsible LLMs.
- Co-author papers, patents, and presentations with our research team by integrating other members’ work with your vertical.
Qualifications
- Domain knowledge in LLM evaluation and data curation techniques.
- Extensive experience in designing and implementing LLM benchmarking, extending previous methods. Comfortability with leading end-to-end projects.
- Adaptability and flexibility. In both the academic and startup world, a new finding in the community may necessitate an abrupt shift in focus. You must be able to learn, implement, and extend state-of-the-art research.
- Preferred: past research or projects in benchmarking LLMs.
Salary for this position may vary based on several factors, including the candidate's experience, expertise, and the geographic location of the role. Compensation is determined to ensure competitiveness and equity, reflecting the cost of living in different regions and the specific skills and qualifications of the candidate.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
0
0
0
Categories:
Engineering Jobs
Machine Learning Jobs
Tags: LLMs Machine Learning Privacy Research Responsible AI
Perks/benefits: Equity / stock options Startup environment
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Principal Data Scientist jobsPrincipal Data Engineer jobsData Scientist II jobsStaff Data Scientist jobsBI Developer jobsData Manager jobsJunior Data Analyst jobsResearch Scientist jobsData Science Manager jobsBusiness Data Analyst jobsLead Data Analyst jobsSenior AI Engineer jobsData Engineer III jobsData Science Intern jobsSr. Data Scientist jobsData Specialist jobsSoftware Engineer II jobsSoftware Engineer, Machine Learning jobsJunior Data Engineer jobsData Analyst II jobsSenior Data Scientist, Performance Marketing jobsBI Analyst jobsData Analyst Intern jobsSr Data Engineer jobsSenior Artificial Intelligence/Machine Learning Engineer - Remote, Latin America jobs
Economics jobsSnowflake jobsLinux jobsHadoop jobsComputer Vision jobsOpen Source jobsJavaScript jobsPhysics jobsRDBMS jobsMLOps jobsBanking jobsAirflow jobsKafka jobsNoSQL jobsData Warehousing jobsScala jobsR&D jobsGoogle Cloud jobsStreaming jobsKPIs jobsData warehouse jobsClassification jobsGitHub jobsOracle jobsCX jobs
SAS jobsPostgreSQL jobsScikit-learn jobsData Mining jobsScrum jobsPandas jobsDistributed Systems jobsTerraform jobsE-commerce jobsPySpark jobsLooker jobsBigQuery jobsRobotics jobsJira jobsIndustrial jobsJenkins jobsUnstructured data jobsRedshift jobsdbt jobsReact jobsMicroservices jobsData strategy jobsMySQL jobsNumPy jobsPharma jobs