Gen AI ML/LLM Ops Manager
Business Office - Athens, Greece
Mondelēz International
Mondelēz International, Inc. (NASDAQ: MDLZ) is one of the world’s largest snacks companies, empowering people to snack right in over 150 countries.Job Description
Are You Ready to Make It Happen at Mondelēz International?
Join our Mission to Lead the Future of Snacking. Make It Uniquely Yours.
Mondelēz Consumer Solutions Global MDS (IT) Team is looking for an energetic and experienced candidate with strong AI infrastructure skills to join us during an exciting time of transformation. The GenAI Ops Engineer will be responsible for the reliable and efficient operation of Generative AI pipelines that power our Marketing Creative Agentic platform.
This role involves managing the end-to-end lifecycle of GenAI models (text, image, and in the future video), ensuring scalability, performance, and cost-effectiveness. The engineer will also play a key role in shaping the future of agentic architectures, working with emerging frameworks such as Google’s Agent2Agent (with ADK) and LangGraph, as we evolve our platform toward more intelligent, modular, and autonomous capabilities.
The GenAI Ops Engineer will work across MDS (IT) internal and external teams, collaborate with technology partners, and engage with business stakeholders. A strong foundation in AI, ML, and Cloud technologies will be essential for guiding both the current platform operations and future innovations.
How you will contribute
Key Responsibilities:
- Model Deployment and Management: Deploy, monitor, and maintain generative models (including LLMs and image generators) in a production environment, ensuring high availability and optimal performance.
- Model Comparison & Selection: Evaluate and compare various GenAI models based on accuracy, latency, cost, and relevance to specific content generation tasks.
- Model Transformation & Optimization: Apply fine-tuning, quantization, and other optimization techniques to enhance model performance and resource efficiency.
- Designing, implementing, and optimizing generative AI models for various market applications, while evolving the platform with newer GenAI frameworks, agents, models.
- Infrastructure Management: Design and manage the infrastructure supporting GenAI workloads, including compute, networking, and storage, primarily on GCP.
- Performance Monitoring and Optimization: Implement robust monitoring and alerting systems to identify and address performance bottlenecks, latency issues, and other anomalies.
- Cost Management: Optimize resource utilization and implement cost-saving measures to minimize the operational expenses associated with LLM infrastructure.
- Agentic Framework Support: Experience of integration of agentic workflows using frameworks such as LangGraph and Google’s Agent2Agent (ADK), MCP, and support their deployment into production environments.
- Collaboration: Work closely with data scientists, ML engineers, and platform teams to ensure seamless end-to-end delivery of GenAI solutions.
- Security and Compliance: Implement security best practices to protect LLM infrastructure and data from unauthorized access and ensure compliance with relevant regulations.
- Troubleshooting and Support: Act as a primary responder for GenAI-related operational issues, providing timely troubleshooting and resolution
- Staying Current: Stay informed on advancements in GenAI and agentic systems, and proactively identify opportunities for innovation and process improvement.
- Direct interactions with Architecture, Application/Platform, and Technical Product teams to guide technology selections.
What you will bring
Required Skills:
- Proficient in Google Cloud Platform (GCP) services, particularly Vertex AI, BigQuery, GKE, Cloud Run, and Cloud Functions.
- Experience deploying and optimizing LLMs, image generation models, or other GenAI models in cloud environments..
- Experience with evolving technologies like MCP and A2A — and the ability to apply them in a business and marketing context
- Ability to design and implement scalable AI computing infrastructures and application stacks for efficient ML operations
- Familiarity with emerging agentic architectures, including LangGraph, Agent2Agent (ADK), or similar frameworks.
- Understanding of the technical aspects of RAG models, how to retrieve relevant information from knowledge bases and integrate it with LLM output, including their training process and how retrieved information is used to enhance generation
- Working knowledge of NLP, computer vision, and multimodal AI model operations..
Model Transformation Techniques: Experience with techniques like quantization, pruning, and knowledge distillation for optimizing LLMs. - Familiarity with ML frameworks such as Transformers, PyTorch, or TensorFlow..
- Containerization and Orchestration: Proficiency in containerization technologies (e.g., Docker, Kubernetes) and orchestration tools.
- Hands-on experience with monitoring and logging solutions, including open-source tools (e.g., Prometheus, Grafana, ELK stack) and cloud-native platforms such as Google Cloud Logging and Cloud Monitoring.
- Automation: Strong scripting and automation skills (e.g., Python, Bash).
- Knowledge of security best practices in cloud and AI infrastructure.
- Excellent communication skills, capable of articulating technical visions and strategies to a diverse audience, fostering cross-functional collaboration and consensus.
- Eager to embrace new technologies, methodologies, and industry developments, with a commitment to lifelong learning and professional growth.
- Evaluation of ML and LLM models, required
- (Nice to have) Advanced skills in data integration techniques, ETL processes, and familiarity with APIs and web services for connecting disparate data sources in a secure and efficient manner
Business Unit Summary
At Mondelēz International, our purpose is to empower people to snack right by offering the right snack, for the right moment, made the right way. That means delivering a broad range of delicious, high-quality snacks that nourish life's moments, made with sustainable ingredients and packaging that consumers can feel good about.
We have a rich portfolio of strong brands globally and locally including many household names such as Oreo, belVita and LU biscuits; Cadbury Dairy Milk, Milka and Toblerone chocolate; Sour Patch Kids candy and Trident gum. We are proud to hold the top position globally in biscuits, chocolate and candy and the second top position in gum.
Our 80,000 makers and bakers are located in more than 80 countries and we sell our products in over 150 countries around the world. Our people are energized for growth and critical to us living our purpose and values. We are a diverse community that can make things happen—and happen fast.
Mondelēz International is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation or preference, gender identity, national origin, disability status, protected veteran status, or any other characteristic protected by law.
Job Type
RegularSoftware & ApplicationsTechnology & Digital* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs Architecture BigQuery Computer Vision Docker ELK ETL GCP Generative AI Generative modeling Google Cloud Grafana Kubernetes LLMOps LLMs Machine Learning ML infrastructure Model deployment NLP Open Source Pipelines Python PyTorch RAG Security TensorFlow Transformers Vertex AI
Perks/benefits: Career development Relocation support
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.