Data Scientist
India - Remote
Halo believes in innovation by inclusion to solve digital problems. As an international agency of over 200 people specializing in interactive media strategy and development, we embrace equity and empowerment in a serious way. Our interdisciplinary teams of unique designers, developers and entrepreneurial minds with a variety of backgrounds, viewpoints, and skills connect to solve business challenges of every shape and size. We empathize to form deep, meaningful relationships with our clients so they can do the same with their audience. Working at Halo feels like belonging. Learn more about our philosophy, benefits, and team at https://halopowered.com/As an AI Architect, you will lead the design of scalable, secure, and modern technology solutions, leveraging artificial intelligence, cloud platforms, and microservices—while ensuring alignment with AI governance principles, agile delivery, and platform modernization strategies
As a Data Scientist, you’ll be part of a multidisciplinary team applying advanced analytics, machine learning, and generative AI to solve real-world problems across our consulting, health, wealth, and career businesses. You will collaborate closely with engineering, product, and business stakeholders to develop scalable models, design intelligent pipelines, and influence data-driven decision-making across the enterprise.
Requirements
- Design, develop, and deploy robust machine learning models and data pipelines that support AI-enabled applications.
- Apply exploratory data analysis (EDA) and feature engineering techniques to extract insights and improve model performance.
- Collaborate with cross-functional teams to translate business problems into analytical use cases.
- Contribute to the full machine learning lifecycle: from data preparation and model experimentation to deployment and monitoring.
- Work with structured and unstructured data, including text, to develop NLP and generative AI solutions.
- Define and enforce best practices in model validation, reproducibility, documentation, and versioning.
- Partner with engineering to integrate models into production systems using CI/CD pipelines and cloud-native services.
- Stay current with industry trends, emerging techniques (e.g., RAG, LLMs, embeddings), and relevant tools.
Required Skills & qualifications
- 3+ years of experience in Data Science, Machine Learning, or Applied AI roles.
- Proficiency in Python (preferred) and a strong grasp of pandas, NumPy, and scikit-learn.
- Skilled in data querying, manipulation, and pipeline development using SQL and modern ETL frameworks.
- Experience working with Databricks, including notebooks, MLflow, Delta Lake, and job orchestration
- Experience with Git-based workflows and Agile methodologies.
- Strong analytical thinking, problem-solving skills, and communication abilities.
- Exposure to Generative AI, LLMs, prompt engineering, or vector-based search.
- Hands-on experience with cloud platforms (AWS, Azure, or GCP) and deploying models in scalable environments.
- Knowledge of data versioning, model registry, and ML lifecycle tools (e.g., MLflow, DVC, SageMaker, DataBricks, or Vertex AI).
- Experience working with visualization tools like Tableau, Power BI, or Qlik.
- Degree in Computer Science, Data Science, Applied Mathematics, or a related field
Benefits
- 100% RemoteWork.
- Salary in USD.
- Get to work on challenging projects for the U.S
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile AI governance AWS Azure CI/CD Computer Science Consulting Data analysis Databricks Data pipelines EDA Engineering ETL Feature engineering GCP Generative AI Git LLMs Machine Learning Mathematics Microservices MLFlow ML models NLP NumPy Pandas Pipelines Power BI Prompt engineering Python Qlik RAG SageMaker Scikit-learn SQL Tableau Unstructured data Vertex AI
Perks/benefits: Career development Equity / stock options Health care
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.