Data Scientist - Gen AI Engineer
Lisbon, PT
Capgemini
A global leader in consulting, technology services and digital transformation, we offer an array of integrated services combining technology with deep sector expertise.Data Scientist - Gen AI Engineer (Lisbon/Porto)
At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world’s most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology experts think outside the box as they provide unique R&D and engineering services across all industries. Join us for a career full of opportunities. Where you can make a difference. Where no two days are the same.
YOUR ROLE
Develop, fine-tune, and deploy Generative AI models using AWS services like Bedrock, SageMaker, and Lambda.
Work with LLMs, embeddings, transformers, and diffusion models for applications in NLP, image generation, and AI automation.
Optimize prompt engineering, fine-tuning, and Reinforcement Learning from Human Feedback (RLHF) techniques.
Build scalable MLOps pipelines for training and deploying GenAI models using SageMaker, ECS, and Kubernetes.
Process and manage large-scale datasets for AI training using AWS Glue, Athena, and Redshift.
Implement vector databases (Pinecone, Weaviate, FAISS, Amazon OpenSearch) for efficient retrieval-augmented generation (RAG) applications.
Design and optimize ETL pipelines for AI/ML data workflows.
Collaborate with software engineers, DevOps, and product teams to integrate AI models into applications and APIs.
Ensure security, compliance, and data privacy in AI/ML workflows.
Monitor AI model performance and retraining needs using AWS CloudWatch, MLFlow, and other observability tools.
YOUR PROFILE
Strong background in Data Science, Machine Learning, and Generative AI.
Proficiency in Python, SQL, and ML frameworks (TensorFlow, PyTorch, Hugging Face Transformers).
Experience with AWS AI/ML services such as SageMaker, Bedrock, Lambda, and Comprehend.
Hands-on experience with LLMs, embeddings, transformers, and diffusion models.
Familiarity with Retrieval-Augmented Generation (RAG), vector databases, and knowledge graphs.
Experience in MLOps, containerization (Docker, Kubernetes, ECS), and CI/CD for ML pipelines.
Solid understanding of cloud optimization, distributed computing, and model scaling.
Strong data engineering skills for processing large datasets in AWS Glue, Athena, or Spark.
Knowledge of NLP, image generation models, or multimodal AI solutions.
Nice to have:
Experience with fine-tuning open-source models (LLaMA, Falcon, Mistral, Stable Diffusion).
AWS certifications such as AWS Certified Machine Learning – Specialty.
Experience with real-time AI applications, chatbot development, or autonomous agents.
Knowledge of ethical AI, bias mitigation, and AI safety best practices.
WHAT YOU'LL LOVE ABOUT WORKING HERE
- Join a multicultural and inclusive team environment.
- Enjoy a supportive atmosphere promoting work-life balance.
- Engage in exciting national and international projects.
- Hybrid work.
- Your career growth is central to our mission. Our array of career growth programs and diverse professionals are crafted to support you in exploring a world of opportunities.
- Training and certifications programs.
- Health and life insurance.
- Referral program with bonuses for talent recommendations.
- Great office locations.
ABOUT CAPGEMINI
Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues around the world, and where you’ll be able to reimagine what’s possible. Join us and help the world’s leading organizations unlock the value of technology and build a more sustainable, more inclusive world.
Apply now!
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs Athena AWS AWS Glue Chatbots CI/CD DevOps Diffusion models Docker ECS Engineering ETL FAISS Generative AI Kubernetes Lambda LLaMA LLMs Machine Learning MLFlow MLOps NLP OpenSearch Open Source Pinecone Pipelines Privacy Prompt engineering Python PyTorch R RAG R&D Redshift Reinforcement Learning RLHF SageMaker Security Spark SQL Stable Diffusion TensorFlow Transformers Weaviate
Perks/benefits: Career development
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.