Data Engineer - Bangalore, India - JPMC
India
Job Summary
- We are seeking a Data Engineer to help build and integrate a Generative AI-powered conversational assistant, into our website and mobile app. This role is crucial in handling data pipelines, model training, and infrastructure setup to deliver a seamless, privacy-compliant experience for users seeking personalized health insights. The Data Engineer will work closely with our AI and software development teams to design scalable data solutions within Google Cloud Platform (GCP) to support this next-generation AI service.
- Data Integration & Pipeline Development: Design and implement data pipelines to support training and finetuning of knowledge base and user data, ensuring data quality, scalability, and efficiency.
- Data Processing & Transformation: Develop data transformation processes to prepare data for Natural Language Processing (NLP) models, facilitating personalized and accurate health recommendations.
- Privacy & Security Compliance: Ensure all data handling practices comply with privacy and security standards, focusing on user data protection within AI model training and deployment.
- Infrastructure Setup & Management: Build and maintain foundational cloud infrastructure on GCP to host, deploy, and scale securely and efficiently across platforms.
- Collaboration with AI & DevOps Teams: Partner with AI/ML and DevOps teams to finetune, test, and optimize NLP models for production, focusing on deployment performance and user experience.
- Website & Mobile Integration Support: Work alongside frontend developers to ensure smooth data flow and integration between the backend, website and mobile app.
- Monitoring & Optimization: Implement monitoring, logging, and automated alerts to ensure data pipelines, model interactions, and infrastructure meet performance and reliability requirements.
- Education: Bachelor’s or Master’s in Computer Science, Data Engineering, or a related field.
- Experience:
- 3+ years in data engineering, preferably within Generative AI or NLP-focused projects.
- Hands-on experience with Google Cloud Platform (GCP), including BigQuery, Dataflow, and Cloud Storage.
- Proven ability in data pipeline design and data transformations for AI model training.
- Skills:
- Strong programming skills in Python and familiarity with SQL.
- Experience with DevOps tools (e.g., Kubernetes, Docker) and CI/CD pipelines in GCP.
- Proficient in data management practices, data privacy, and security protocols.
- Familiarity with AI/ML workflows, specifically NLP model training and finetuning.
- Nice to Have:
- Experience working with Contentful, or React Native integrations.
- Knowledge of MLOps practices to support continuous model training and deployment.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: BigQuery CI/CD Computer Science Dataflow Data management Data pipelines Data quality DevOps Docker Engineering GCP Generative AI Google Cloud Kubernetes Machine Learning MLOps Model training NLP Pipelines Privacy Python React Security SQL
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.