Software Engineer - Inference squad (AI Tribe)

Paris

Scaleway

Build, train, deploy and scale AI models and intelligent applications on a resilient and sustainable cloud ecosystem.

View all jobs at Scaleway

Apply now Apply later

OUR STORY:
🇪🇺 Join Scaleway and shape the sovereign cloud of tomorrow !Since 1999, we have been designing secure, sustainable infrastructures aimed at supporting the most ambitious companies.
Historically known for our dedicated servers (Dedibox), we made a strategic shift to cloud computing in 2015. Staying true to our principles of simplicity, flexibility, and technical excellence, we have become one of the leading players in Europe in the sector.
With the rise of artificial intelligence, we have strengthened our commitment, supported by the Iliad Group, which is investing €3 billion to develop a serious, sovereign AI alternative to American and Asian giants.
Every day, thanks to our rich catalog of products and services (bare metal, containerization, serverless, AI, etc.), Scaleway proudly serves 38,000 private and public sector clients, from Photoroom to Mistral AI, Golem AI, and ADEME.
📍 Our offices are located in Paris, Lille, Toulouse, Bordeaux, and Lyon.

🚀 WHY WE NEED YOU?Our growth is driving us to strengthen our AI Infrastructure team to support our next-generation inference platform. Your mission will be to build and operate production-grade DevOps infrastructure for AI workloads, in order to enable the deployment and scaling of LLMs and generative AI applications on a sovereign European cloud.
YOUR FUTURE TEAMWe work in a collaborative and international environment where the diversity of Scalers, combined with a spirit of sharing, helps bring new projects to life every day, advancing our ambitions together. You will be part of a team of 4 DevOps engineers, working closely on backend and infrastructure topics related to AI inference. The team is part of a broader AI tribe focused on two strategic products: - Managed Inference, a platform to deploy, scale, and monitor AI models in production - Generative APIs, a unified API layer to access cutting-edge generative models (LLMs, Diffusion, etc.)
Manager information: You will report to Grégoire de Turckheim, who has been with Scaleway for over 14 years. He brings a unique perspective to team leadership, having evolved through several roles in the company, including engineering, product management, logistics, procurement, and strategic partnerships. His cross-functional background helps foster a culture of ownership, agility, and long-term vision.
🗓️ YOUR DAILY ROUTINE INCLUDES: - Build and operate infrastructure for serving LLMs and other generative models at scale - Design, deploy, and maintain Kubernetes-based services optimized for AI inference - Develop in Golang to build robust and efficient backend services - Optimize serving stacks using tools like vLLM, Triton, and CUDA - Integrate open-source AI tooling such as Hugging Face, KServe, or custom components - Participate in architecture discussions and make high-impact technical decisions - Keep up with the fast-paced evolution of AI serving ecosystems - Troubleshoot and resolve complex infrastructure and deployment issues - Collaborate with internal teams to integrate new AI features - Contribute to code reviews and knowledge sharing within the team
🔍 ABOUT YOU:
HARDSKILLSStrong experience with Golang in production environmentsProficiency with Kubernetes and container orchestration toolsFamiliarity with AI inference stacks (e.g., vLLM, Triton, CUDA)Hands-on experience with deploying and scaling ML modelsSolid understanding of backend architectures and DevOps practicesSOFTSKILLSExtremely fast learner with high adaptability to new technologiesStrong curiosity and willingness to dive deep into emerging AI toolsProactive mindset and readiness to take initiativeTechnical autonomy and ownership of your scopeClear communicator, capable of explaining complex systems simply
 WHAT YOU WILL FIND AT SCALEWAY ?
Hybrid work: Up to 3 days remote per weekModern offices: Creative, ergonomic workspaces in prime locations with terraces and bike facilitiesHealthy dining: Chef-prepared meals at HQ; Swile card for regional sitesWell-being: Support for gym memberships, childcare, and caregiver servicesDiverse culture: English is as common as French; teams span many nationalitiesGrowth mindset: Strong internal mobility and opportunities across the Iliad Group
🚀 Why join the Scaleway adventure ?  
✔ A rich and diverse product offering: Scaleway offers over 100 public cloud products in IaaS, PaaS, AI.✔ A cutting-edge technical environment: Scaleway provides modern infrastructures, including high-performance bare metal servers, to tackle exciting technical challenges.✔ Commitment to responsible cloud: Scaleway is dedicated to a more responsible cloud, with data centers powered solely by renewable energy since 2017, minimizing our ecological footprint and holding top-level certification.
🔜 The next steps ?
Discovery call with a recruiter (30 min)Technical interview to validate your expertise (1h)Operational Problem-SolvingInterview with the Head of the Tribe to deepen your discussions and assess your fit with the team (45 min)HR interview to tour our offices and meet your future colleagues
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  2  1  0

Tags: APIs Architecture CUDA DevOps Engineering Generative AI Generative modeling Golang KServe Kubernetes LLMs Machine Learning ML infrastructure Open Source vLLM

Perks/benefits: Startup environment

Region: Europe
Country: France

More jobs like this