Founding Data Scientist (San Francisco)

San Francisco

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Apply now Apply later

Join us in a high-impact role where you’ll build the data foundation that powers next‑generation social media insights at Scrollmark. At Scrollmark, we’re building SocialGPT (https://www.gpt.social/), your copilot for social media management—empowering brands and creators to grow fanbases, nurture audiences, and drive revenue.

We’ve raised $8M in seed funding from top‑tier investors including Mayfield, Jackson Square Ventures, Lemniscap, Shakti, and Aperiam. As an early‑stage startup, we’re looking for a data‑driven partner to help define our analytics infrastructure and unlock deep insights that drive product and business strategy. You’ll report directly to our CTO and collaborate across engineering, product, and design teams, but you’ll own end‑to‑end projects—from raw data ingestion to actionable dashboards—with minimal oversight.

What You’ll Do

  • Design & Build Robust Pipelines: Architect and maintain ETL pipelines that scrape, normalize, aggregate, and store text, image, video, and audio data from multiple social platforms at scale.

  • Multimodal Modeling: Collaborate with engineers to integrate multimodal models that extract sentiment, object recognition, and audio cues—then translate those signals into clear trends.

  • Clean & Enrich Data: Develop automated workflows and quality checks to detect and correct anomalies, handle missing data, and ensure dataset integrity.

  • Extract Insights & Tell the Story: Use statistics, visualization tools, and dashboards to surface key performance indicators, emerging trends, and recommendations via SocialGPT.

  • Own the Analytics Roadmap: Partner with stakeholders to prioritize data initiatives, champion best practices, and help shape Scrollmark’s long‑term vision for analytics and AI.


What We’re Looking For

  • 5+ Years of Data Expertise: Proven track record in data science or ML engineering, ideally with experience in social media or large unstructured datasets.

  • Strong Statistical Foundation: Comfortable with hypothesis testing, regression, time‑series analysis, and designing rigorous experiments.

  • Hands‑On Pipeline Skills: Deep experience with Python (Pandas, NumPy), SQL, and orchestration tools like Airflow or Prefect. You'll guide the roadmap for the tools we choose, and how we use them.

  • Multimodal Analysis: Familiarity with applying or integrating vision, text, and audio models—whether via Hugging Face, OpenAI, or custom architectures.

  • API & Infrastructure Savvy: Experience working with social platform APIs (Facebook, Instagram, TikTok, Twitter, YouTube) and cloud data stores (GCS, BigQuery, Snowflake).

  • Hacker Mentality: You take ownership, thrive in ambiguity, and can shepherd complex data projects from concept to production in a fast‑paced startup.

  • Communication & Storytelling: Ability to translate raw data into compelling narratives and recommendations for both technical and non‑technical audiences.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0
Category: Data Science Jobs

Tags: Airflow APIs Architecture BigQuery Copilot Engineering ETL GPT Machine Learning NumPy OpenAI Pandas Pipelines Python Snowflake SQL Statistics Testing

Perks/benefits: Startup environment

Region: North America
Country: United States

More jobs like this