Senior Machine Learning Engineer
New York City
GreenLite
Founded in 2022, GreenLite is revolutionizing development in America by streamlining the collaboration between developers, builders, and local regulatory authorities. GreenLite’s software powers its Private Plan Review offering, serving many of the nation’s largest public retailers, developers, and production home builders. By leveraging GreenLite’s technology, its customers save months on each project, significantly accelerating their timelines and staying within budget.
GreenLite is founded by experts in technology, development, and within the AEC (Architecture, Engineering, and Construction) industry, and backed by leading venture capital firms. GreenLite is at the forefront of the privatization of construction permitting and plan review, reshaping a multi-hundred billion dollar industry.
GreenLite has raised nearly $40M from the country’s leading venture capital investors, including Craft Ventures, who led GreenLite’s $28.5M Series A. We’re well capitalized to achieve our mission of revolutionizing the plan review and construction permitting process across the country.
Why this role matters
We're on a mission to automate one of the most costly and expertise-dependent bottlenecks in the built environment — construction plan review. Today, plan review is slow, expensive, and highly manual, requiring licensed experts to navigate thousands of unique jurisdictional construction codes and complex architectural documents. We believe AI can help.
As a key hire in our AI engineering organization, you’ll operate with a founder mindset, help in defining our long‑term data strategy, and align multiple squads behind it. Our product roadmap includes computer‑vision and large‑language‑model (LLM) capabilities; but those future models will only be as good as the thought that goes into designing them. As an ML engineer you will design, build, and own the way that happens and directly influence company OKRs.
What you’ll do
Work with a rich proprietary dataset encompassing:
Access to building codes across every jurisdiction
Expert-generated comments on building plans
In-house architects and code experts shaping the problem and validating results
Own the decision making with data engineering on the tech stack we use for MLOps.
Alongside data engineering, optimize scalable data processing pipelines on our platform and maintain ML infrastructure.
Model Development: Design, develop, deploy and monitor innovative ML solutions that drive efficiency for our customers
End-to-end Workflow Orchestration: Design and maintain complex workflows that automate the path from feature store to inference.
Design and construct greenfield pipelines that automate pre-training and fine-tuning domain-specific LLM/CV models.
Define and shape RLHF and retrieval pipelines to inject code-compliance knowledge.
Evaluations and Experimentation: Experiment, evaluate and implement novel research ideas for proprietary LLMs/CVMs
You may be a fit if you...
Have a graduate degree in Computer Science, Data Science or ML related field or equivalent industry experience
Have 2+ years of experience shipping ML systems (not just research)
Have experience shipping AI models into production (bonus: experience with AI Agents / LLM orchestration and ML Ops)
Can efficiently translate open-ended problems into actionable solutions
Familiarity implementing novel NLP / CVM research ideas and techniques. Prior publications in top conference journals or other evidence of staying current (e.g. open source contributions, conference talks) is a plus.
Excitement to encode dense regulations & messy CAD & PDF data into structured, learnable signals.
Prior experience in a startup environment is a plus.
Excitement for building AI systems that go beyond benchmarks into the real-world messiness of imperfect data
Thrive in “debate, decide, deliver” cultures—turning ambiguous product goals into concrete, maintainable systems.
What success looks like...
90 days: LLM/CV model trained on a reproducible pipeline and a clear ML roadmap signed off by product & domain experts.
6 months: Alpha model running in shadow via a fully automated MLOps pipeline, showing > 20 % quality lift on real plans.
12 months: Model live in customer beta, cutting manual review time ≥ 25 % with auto‑retraining and monitoring that scales to new hires.
Why Join Us?
Shape the data platform and labeling strategy that future hires will depend on.
Be an early technical leader on a high-impact AI product and trailblaze the first AI model for the building compliance industry.
Work directly with architects, engineers, and domain experts who shape the system
Contribute to making the built world more efficient, sustainable, and safe
Solve one of the most under-explored AI problems in the real world (with a huge market size)
Operating principles in action
Own It: You act like the CEO of our data layer—no broken pipelines on your watch.
Solve Real Problems: Every schema, pipeline, and quality check exists to deliver ML‑ready data that shortens training cycles and lifts model performance—turning raw construction files into fuel for CV & LLM models.
Full Send: You move with urgency, never at the expense of data integrity.
Be Real: Radical candor about trade‑offs and technical debt keeps the team honest.
If you’re ready to turn raw architectural drawings into the data engine that powers the first AI plan‑reviewer, we’d love to meet you.
Competitive salary
New hire stock equity packages
Annual bonuses based on performance and delivering results
Medical, dental, and vision insurance plans
401(k) savings plan
Employee wellness program
Home productivity stipend
Team building events
Unlimited PTO policy
GreenLite values people from all walks of life and professional backgrounds. We understand not everyone will meet all the above qualifications on day one. That's okay. If you’re passionate about the construction industry or solving the housing crisis in America, and want the opportunity to grow in your career, we encourage you to apply.
GreenLite is an equal employment opportunity employer, committed to an inclusive workplace where we do not discriminate on the basis of race, sex, gender, national origin, religion, sexual orientation, gender identity, marital or familial status, age, ancestry, disability, genetic information, or any other characteristic protected by applicable laws. We believe in diversity and encourage any qualified individual to apply.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture CAD Computer Science Data strategy Engineering LLMs Machine Learning ML infrastructure ML models MLOps NLP OKR Open Source Pipelines Research RLHF
Perks/benefits: Career development Competitive pay Equity / stock options Health care Home office stipend Insurance Salary bonus Startup environment Team events Unlimited paid time off Wellness
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.