Founding Data Scientist (San Francisco)
San Francisco
â ď¸ We'll shut down after Aug 1st - try foođŚ for all jobs in tech â ď¸
Scrollmark
Join us in a high-impact role where youâll build the data foundation that powers nextâgeneration social media insights at Scrollmark. At Scrollmark, weâre building SocialGPT (https://www.gpt.social/), your copilot for social media managementâempowering brands and creators to grow fanbases, nurture audiences, and drive revenue.
Weâve raised $8M in seed funding from topâtier investors including Mayfield, Jackson Square Ventures, Lemniscap, Shakti, and Aperiam. As an earlyâstage startup, weâre looking for a dataâdriven partner to help define our analytics infrastructure and unlock deep insights that drive product and business strategy. Youâll report directly to our CTO and collaborate across engineering, product, and design teams, but youâll own endâtoâend projectsâfrom raw data ingestion to actionable dashboardsâwith minimal oversight.
What Youâll Do
Design & Build Robust Pipelines: Architect and maintain ETL pipelines that scrape, normalize, aggregate, and store text, image, video, and audio data from multiple social platforms at scale.
Multimodal Modeling: Collaborate with engineers to integrate multimodal models that extract sentiment, object recognition, and audio cuesâthen translate those signals into clear trends.
Clean & Enrich Data: Develop automated workflows and quality checks to detect and correct anomalies, handle missing data, and ensure dataset integrity.
Extract Insights & Tell the Story: Use statistics, visualization tools, and dashboards to surface key performance indicators, emerging trends, and recommendations via SocialGPT.
Own the Analytics Roadmap: Partner with stakeholders to prioritize data initiatives, champion best practices, and help shape Scrollmarkâs longâterm vision for analytics and AI.
What Weâre Looking For
5+ Years of Data Expertise: Proven track record in data science or ML engineering, ideally with experience in social media or large unstructured datasets.
Strong Statistical Foundation: Comfortable with hypothesis testing, regression, timeâseries analysis, and designing rigorous experiments.
HandsâOn Pipeline Skills: Deep experience with Python (Pandas, NumPy), SQL, and orchestration tools like Airflow or Prefect. You'll guide the roadmap for the tools we choose, and how we use them.
Multimodal Analysis: Familiarity with applying or integrating vision, text, and audio modelsâwhether via Hugging Face, OpenAI, or custom architectures.
API & Infrastructure Savvy: Experience working with social platform APIs (Facebook, Instagram, TikTok, Twitter, YouTube) and cloud data stores (GCS, BigQuery, Snowflake).
Hacker Mentality: You take ownership, thrive in ambiguity, and can shepherd complex data projects from concept to production in a fastâpaced startup.
Communication & Storytelling: Ability to translate raw data into compelling narratives and recommendations for both technical and nonâtechnical audiences.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index đ°
Tags: Airflow APIs Architecture BigQuery Copilot Engineering ETL GPT Machine Learning NumPy OpenAI Pandas Pipelines Python Snowflake SQL Statistics Testing
Perks/benefits: Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.