Senior Data Scientist

Remote

Vendelux

AI-Powered Event Intelligence Platform with speaker, sponsor and attendee data covering over 200,000 trade shows and conferences

View all jobs at Vendelux

Apply now Apply later

Vendelux helps companies discover the best events. Event marketers are the unsung heroes of successful companies. From generating leads to building world-class brands, event marketers make magic happen throughout the year. Vendelux is here to help maximize the impact of all the events that a company sponsors and attends.

We are a Series A SaaS company and provide the system of record for event marketing. Our software platform provides proprietary insights that helps high-growth companies find the highest ROI events, conferences and trade shows to attend and sponsor. We have built an AI-powered platform that customers describe as an event marketer’s dream.

Vendelux was founded in 2021, and our recent $14 million Series A was led by FirstMark, whose portfolio includes companies like Shopify, Pinterest, Discord, Airbnb, Draft Kings, Carta and Justworks (amongst others). Our leadership team includes alumni from Shutterstock, Bain, CB Insights, Button, ZoomInfo and Compass.

As a Senior Data Scientist at Vendelux, you will build data solutions that help our customers identify which events to attend and who to connect with at those events to achieve maximum ROI. You will apply methods in statistics, machine learning, NLP, and LLMs to problems such as ranking events, predicting attendance, extracting content from unstructured text, and enhancing data quality to enable our customers to have complete confidence in our products. Along the way you will extract and share insights from your findings and build data pipelines to support end to end implementation of your models. We are looking for candidates who are comfortable owning the full lifecycle of model development, from research to implementation, while collaborating with data and engineering teams as needed.

Scope of Responsibilities

  • Collaborate with stakeholders and other engineers to define success criteria, frame machine learning problems, align model metrics with business goals, design minimum viable products, architect and implement model solutions in production.

  • Perform analysis and modeling with large data sets, including discovering data sources, accessing and cleaning data, and developing feature and prediction pipelines.

  • Apply data mining, NLP, machine learning and generative AI to real-world problems, including but not limited to: supervised/unsupervised learning, large language models, and causal inference.

  • Communicate insights and results to peers and leaders, promoting a culture of collaboration and learning across teams via mentoring, documentation, presentations, or other knowledge-sharing methods.

  • Collaborate with engineers in order to design scalable implementations of your models.

  • Proactively research and explore emerging technology and state-of-art methods, consider possible extensions and prototype new modeling ideas to solve customer problems.

  • Evangelize appropriate technology, data and engineering best practices.

  • Work with stakeholders including engineering, product and executives and assist them with data-related technical issues.

  • Identify bottlenecks and implement improvements to our processes and tools. We're early and the expectation of folks joining at this stage is that you'll play a huge part in setting and improving how we work. Our current stack is Python, Dagster, MySQL and Snowflake, but we’re early stage and open to change if it makes sense.

Qualifications

  • Minimum of 5 years of relevant data science experience with a BS or equivalent experience in an appropriate technology field (Computer Science, Statistics, Applied Math, Operations Research, etc.).

  • 3+ years of industry experience in building machine learning or genAI systems including model training, tuning, deploying, and monitoring. 

  • Experience in Cloud-based infrastructure; proficiency in SQL, PySpark, etc.

  • Experience in data pipelines and workflow management tools like Dagster or Airflow a plus. 

  • Experience with ML Ops processes and deploying/productizing ML models a plus.

  • Practical AI skills, including experience with prompt engineering, RAG, and shipping an LLM to production a plus..

  • Track record of shaping and shipping valuable features.

  • Judgment to take on technical debt and risk where appropriate.

  • Strong communication skills, especially written. 

  • Previous startup experience.

Benefits

  • Competitive base salary and bonus

  • Healthcare covering medical, dental and vision 

  • Work remotely or from our NYC HQ

  • Unlimited PTO plus two company-wide shutdowns during the July 4th week and the Christmas – New Years week 

Not all candidates will check all of the requirements listed above and that’s ok! We are open to great people from non-traditional backgrounds.

Vendelux is proud to be an equal opportunity workplace. We are committed to equal opportunity regardless of race, color, ancestry, religion, gender, gender identity, parental or pregnancy status, national origin, sexual orientation, age, citizenship, marital status, disability, or veteran status.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  2  1  0
Category: Data Science Jobs

Tags: Airflow Causal inference Computer Science Dagster Data Mining Data pipelines Data quality Engineering Generative AI LLMs Machine Learning Mathematics ML models Model training MySQL NLP Pipelines Prompt engineering PySpark Python RAG Research Snowflake SQL Statistics Unsupervised Learning

Perks/benefits: Career development Competitive pay Conferences Health care Salary bonus Startup environment Team events Unlimited paid time off

Region: Remote/Anywhere

More jobs like this