Senior ML Data Platform Engineer - INDIA (Remote)
Remote
Luxury Presence
Enhance your luxury real estate marketing with award-winning real estate website designs & digital marketing. Created for high-grossing agents, teams & brokers.About the RoleWe’re seeking a Senior ML-focused Data Platform Engineer to strengthen our MLS data platform team. You will build robust data pipelines and deliver advanced ML solutions—embeddings, fine-tuning, retrieval-augmented generation (RAG), and reinforcement learning from user feedback. Your work powers property discovery, personalized recommendations, conversational agents, and the evaluation infrastructure that keeps them improving.
Who is the Data Squad?We make sure clean, reliable MLS listing records and user click-stream data is always available to our products and customers. Our team—a mix of data engineers and software engineers—owns the entire listing pipeline: ingestion, transformation, and normalization across 400+ MLS feeds and other sources. We also extend the platform to capture user-activity data for new features and build AI agents that automate feed onboarding and listing-issue triage, reducing manual effort for internal teams and clients and shortening the path from data to business impact.
Strategic Projects (Year 1)
- Autonomous MLS AI agents: Launch AI agents that onboard new MLS feeds and triage / resolve listing issues using structured and unstructured data.
- Evaluation & monitoring: Expand A/B testing, offline metrics, and model-health dashboards to maximize relevance and engagement.
- Shared ML/data infra: Stand up embedding pipelines, vector databases, and personalization APIs for current and future AI features.
What You’ll Do
- Architect and operate scalable batch & streaming data systems (Spark/EMR, Kafka, SQL).
- Own data quality, performance, and reliability through automated testing and monitoring.
- Design transactional and analytical data models and feature stores.
- Fine-tune embeddings and ML models; deploy RAG‐ and RL-based ranking pipelines.
- Integrate and optimize LLMs for conversational agents.
- Evolve the evaluation stack (A/B, offline metrics, model monitoring) to track impact end-to-end.
- Collaborate with product, engineering, and business stakeholders; mentor peers; shape the long-term ML data-platform strategy.
Our Tech Stack
- Python, Spark Streaming, Kafka, Iceberg, Fast API & NodeJS microservices
- AWS, EMR, Kubernetes, Airflow
- Postgres, DynamoDB, Athena, ElasticSearch, LanceDB
Qualifications (Required)
- BS/MS in Computer Science or related field, or equivalent experience.
- 5+ years building large-scale data pipelines on AWS or GCP with Spark/EMR, Kafka, and SQL.
- Strong with a backend programming language like Python or Java; production experience with TensorFlow or PyTorch.
- Delivered ML-powered features in recommendations, search/ranking, or conversational AI.
- Hands-on with embeddings, RAG, and reinforcement learning from feedback.
- Familiar with vector databases, LLM deployment, Kubernetes/EKS, and modern CI/CD.
- Excellent communicator who drives results across product, engineering, and business teams.
Preferred
- Proven wins building large-scale ranking or personalization platforms.
- Experience with integrating Large Language Models in production systems.
- Experience leading projects and mentoring engineers.
- Proven success working in Agile environments.
The real estate industry is in the midst of a seismic shift, and the future belongs to those who break new ground. As one of the fastest-growing companies in the proptech and marketing sectors, Luxury Presence challenges the status quo of what technology can do for real estate agents, leaders, and brokerages.
We’re a team of agile and tenacious innovators working collaboratively to drive the industry forward. Together, we build game-changing products that empower modern real estate entrepreneurs to dominate their markets. From award-winning web design to agile SEO solutions to cutting-edge AI tools, we deliver tech that anticipates market shifts and keeps our clients ahead of their competition.
Founded in 2016 by Stanford Business School alum Malte Kramer, Luxury Presence has grown to a global team ranked on the Inc. 5000 fastest-growing companies list three years in a row. We’re backed by world-class investors, including Bessemer Venture Partners, Toba Capital, and Switch Ventures, and have raised $52.6 million to date.
More than 15,000 real estate businesses rely on our platform, including 31 of the RealTrends top 100 agents featured in The Wall Street Journal. Additionally, many of the industry’s most powerful brokerages — including Compass, Coldwell Banker, and Sotheby’s International Realty — rely on Luxury Presence as a trusted business partner.
Every year since 2020, Luxury Presence has ranked on BuiltIn’s Best Place to Work lists. HousingWire named our founder and CEO a 2024 Tech Trendsetter, we’ve received several Tech100 Awards, and our lead nurturing tool just scored an Inman Innovation Award for Best AI-Powered Platform.
Luxury Presence is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, or national origin.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: A/B testing Agile Airflow APIs Athena AWS CI/CD Computer Science Conversational AI Data pipelines Data quality DynamoDB Elasticsearch Engineering GCP Java Kafka Kubernetes LLMs Machine Learning Microservices ML models Node.js Pipelines PostgreSQL Python PyTorch RAG Reinforcement Learning Spark SQL Streaming TensorFlow Testing Unstructured data
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.