Senior LLM Engineer

Worldwide

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Apply now Apply later

Our Vision & Products

🚀 EverAI — Building the Future of AI Companionship

One of the Top 15 Largest & Fastest-Growing AI Companies in the World

30+ Million Users in under 2 years — Help Us Reach 100M first, 500M next

At EverAI, we’re shaping what it means to connect with AI. With 30+ million users and counting, we're not just building products — we're creating entirely new categories.

Our flagship product is the world's largest AI girlfriend/boyfriend platform, redefining relationships for millions. And we’re only just getting started.

Up next? We’re scaling our second product to revolutionize the creator economy. Think best-in-class AI content engines for video and image generation — designed to put world-class tools in every creator’s pocket.

All of this is governed by our proprietary moderation system, EverGuard — an internal AI designed to ensure everything we build is safe, ethical, and human-first.

Our Team

We are an enthusiastic, passionate and hardworking team of 55+ people. Our founding team has strong entrepreneurial experience building and scaling web products from 0 to IPO.

Alexis Soulopoulos [CEO]

• 10+ years in Tech Executive Leadership

• Co-Founder Mad Paws Holdings (from 0 to IPO)

• Forbes 30 under 30 + Deloitte TechFast50 ’22 & ‘23

Michael Monin [Co-founder & CTO]

• 10+ years as CTO / COO (web2/web3), 1+ year in AI/LLM

• Serial-entrepreneur: MTK Digital (exited / 0->$20m revenue) and Zipchat (AI Chatbot for E-commerce brands)

Thomas Lacroix [Co-founder & CMO]

• 8+ years in Customer Acquisition & E-commerce Growth

• Serial-entrepreneur: Curatible (sold to Blackstone) and MTK Digital (exited / 0->$20m revenue)

Maruša Fasano [CFO/Legal]

• 25+ years in Finance, Strategy, M&A

• Ex-CFO/M&A @Curatible (exited to Blackstone)

• Ex-President of the Board @SotremoSA (exited)

• Co-founder/CFO @SoftOne (exited)

Your Role

🚀 Architect the Future of AI Relationships

As our LLM Engineer, you'll fine-tune and optimize large language models that power conversations for over 30 million users, processing more than 5 million messages daily. You'll be at the forefront of developing AI companionship technology that scales globally while maintaining personalized and meaningful interactions.

Key Responsibilities

  • Interact with stakeholders (Co-founders, Web Engineers, DevOps Engineers) to bring your project to life.

  • Oversee the creation and optimization of algorithms for LLM behavior adjustments based on user interactions, focusing on fine-tuning and prompt engineering.

  • Develop features to improve the richness of the product (multi-character chats, gamification, etc)

  • In addition to chat, interacting with modalities managed by other team members (audio, image, video), and collaborating with them

  • Adaptation and fine-tuning of base models for multilingual support

  • Manage the creation and maintenance of diverse datasets critical for training and improving the performance of LLMs.

  • Assess and determine the best technological approaches, selecting between classifiers, fine-tuning, and other methods based on the specific project's needs.

Your Qualifications

Must-Haves

  • Python Mastery: 5+ years building production‑grade, modular, maintainable codebases

  • LLM Architecture Expertise: Deep understanding of transformers and their training dynamics (attention, positional encodings, samplers, tokenizers, post-training, reasoning LM)

  • Inference Optimization at Scale: Expert with vLLM / TensorRT‑LLM (or similar); proven record of reducing latency and memory via quantization and/or distillation

  • Distributed Training: Hands‑on multi‑GPU / multi‑node fine‑tuning using FSDP, DeepSpeed, or accelerate; comfortable with mixed‑precision, gradient checkpointing, and memory‑aware scheduling

  • Performance Profiling & Optimization: Skilled at identifying and resolving compute or memory bottlenecks across CPU/GPU pipelines with industry‑standard profiling workflows

Nice‑to‑Haves

  • Concurrency & Runtime Engineering: Strong with asyncio, multiprocessing, or equivalent backend/batch‑scheduling patterns

  • Low‑level Systems: Practical CUDA / Triton experience; able to write or debug custom kernels

  • Open‑Source Impact: Contributor to core LLM tooling (vLLM, HF Transformers, Triton, etc.)

  • Real‑time Deployments: Built or maintained latency‑critical, multi‑user LLM services (RAG, streaming, agents, chatbots)

  • Specialized Generation Use Cases: Exposure to erotic role playing, multi‑turn instruction tuning, or non‑English quality alignment

Soft Skills

🗣 Strong communication & collaborative skills (perfectly fluent in English)

🎯 Goal-oriented, ownership and commitment

⚡️ Doer mindset - we are moving fast and we need people who can find the right balance between executing, planning and strategy

🧢 Humble - willing to learn, open to feedback

🍭 #NSFW - you are comfortable building products that are based on uncensored models and content

Why EverAI?

📈 Exponential Growth: From 30M+ users in 18 months, to 100M next — and 500M beyond

🚀 Track Record of Category-Creating Innovation: We consistently launch world-first AI applications — setting the pace, not following it

🌍 Global Impact: Top-tier user growth, real-world adoption, and cultural relevance

🧠 Proven Leadership: A senior team that’s launched, scaled, and exited & IPO’d multiple scale ups — now fully focused on reshaping AI companionship

👥 Elite Remote Team: 100% remote and built to win — world-class talent from Tier 1 tech companies, with a culture of ownership, velocity, and radical creativity

🛡️ Ethical Core: Our AI ecosystem is governed by EverGuard, our proprietary AI moderation technology, ensuring responsible development at scale

What We Offer

✍️ We prefer a B2B contract but we can be flexible, as long as you’re in it for the long haul

📍 Full-remote (you work from the place that suits you best)

🏝️ 4 weeks PTO

👨‍👩‍👧‍👦 Annual gathering to get to know each other better

💆‍♀️ Wellbeing budget up to 200$

📚 Learning budget

💻 Company laptop

⚡️ GPT-4, Mistral and Hugging Face Pro plan

🎯 Top Tier Talent Is Our Multiplier

We’re a fully remote group of A-players from Tier 1 tech, led by an exec team who’ve launched, scaled, and exited multiple companies. We move fast, and care deeply about what we build — and who we build it with.

We’re looking for exceptional talent ready to ship & distribute world-first AI products at scale, fast, and co-create with us this category-defining business.

If that’s you — reach out and apply!

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  2  0  0
Category: Engineering Jobs

Tags: AI content Architecture Chatbots CUDA DevOps E-commerce Engineering Finance FSDP GPT GPT-4 GPU LLMs Pipelines Prompt engineering Python RAG Streaming TensorRT Transformers vLLM

Perks/benefits: Career development Flex vacation Gear

Region: Remote/Anywhere

More jobs like this