Software Engineer, Data
San Francisco
Full Time Senior-level / Expert USD 180K - 280K
Harvey
Harvey builds custom LLMs for elite law firms to tackle the most complex legal challenges across every practice area, jurisdiction and legal system in the world.Harvey is a secure AI platform for professionals in law, tax, and finance that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customized and developed by our expert team of lawyers, engineers and research scientists. We’ve found product market fit and are scaling our team very quickly. Some reasons to join Harvey are:
Exceptional product market fit: We have partnered with the largest law firms and professional service providers in the world, including Paul Weiss, A&O Shearman, Ashurst, O'Melveny & Myers, PwC, KKR, and many others.
Strategic investors: Raised over $200 million from strategic investors including Sequoia, Google Ventures, Kleiner Perkins, and the OpenAI Startup Fund.
World-class team: Harvey is hiring the best talent from DeepMind, Google Brain, Stripe, FAIR, Tesla Autopilot, Glean, Superhuman, Figma, and more.
Partnerships: Our engineers and researchers work directly with OpenAI to build the future of generative AI and redefine professional services.
Performance: $0-30M ARR in the last 18 months.
Compensation: Top of market cash and equity compensation.
As a Software Engineer, Data on the Engineering team at Harvey, you will own and lead engineering projects across our product lines. We are looking for individuals who have strong backend and infrastructure fundamentals and have experience building products where data is a core component.
This role is based in San Francisco, CA. We use an in-person work model and offer relocation assistance to new employees.
What You’ll DoDevelop distributed crawlers, data pipelines, and storage infrastructure to ingest data from numerous sources including: websites, APIs, law firm knowledge bases, and Harvey’s data partners. These must handle real-time updates, while being performant and robust.
Work directly with domain experts and customers to understand how to structure complex, referential datasets in Legal, Tax, Finance, etc, then translate that into technical data systems.
Build and scale our Retrieval platform which provides knowledge and citations to our widely used RAG products.
Crawl, structure, and index an entire legal dataset in a particular country, then build an AI application for law firms to perform accurate, grounded research on it. See Harvey’s Research product offerings.
With the partnership of tax experts, ingest tax codes and regulations across 10+ international jurisdictions to create a product used by thousands of the world’s top domain specialists. See Harvey’s Tax model.
Scale data infrastructure to index tens of millions of complex documents, and enable search and retrieval in milliseconds. See Harvey’s massive Case law dataset used to train a custom model with OpenAI.
Explore and deploy cutting edge embedding search technologies. See Harvey’s projects with Lance DB and with Voyage AI.
3+ YoE (post-BS/MS) in an engineering role.
Experience with shipping and scaling an impactful product powered by data, e.g. data pipelines, databases, and backend platforms.
Experience with search infrastructure or vector databases is a plus.
Track record of shipping reliable products and a strong attention to detail.
Grit - experience working at early-stage startups is a plus.
Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.
Tags: APIs Data pipelines Engineering Finance Generative AI LLMs OpenAI Pipelines RAG Research
Perks/benefits: Equity / stock options Relocation support Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.