Machine Learning Engineer

Greece - Remote

Omilia

Find out why Omilia was named a Leader in the 2023 Gartner Magic Quadrant™ for Enterprise Conversational AI Platforms for the second time in a row!

View all jobs at Omilia

Apply now Apply later

The Machine Learning Engineer will focus on designing, developing, and optimizing advanced TTS, voice cloning models, and voice-enabled Large Language Models (Voice LLMs) that combine the naturalness of TTS with the intelligence of LLMs. By leveraging state-of-the-art technologies such as scalable zero-shot synthesis, this role is integral to creating systems that redefine how machines generate and mimic human speech. You will play a pivotal role in pioneering innovations in speech synthesis and shaping the future of human-AI interaction.

Research and Development:

  • Research and develop state-of-the-art Text-to-Speech (TTS) models and voice-enabled Large Language Models (LLMs), focusing on naturalness, scalability, and zero-shot capabilities.
  • Implement voice cloning technologies to replicate specific voice characteristics while ensuring scalability and efficiency.
  • Design, develop, and optimize algorithms for neural vocoders, speaker adaptation, and scalable zero-shot TTS synthesizers.
  • Extend existing TTS architectures and contribute to developing/training new models for multiple languages, ensuring high-quality synthesis and speaker consistency.
  • Build and maintain data pipelines to preprocess, augment, and manage audio datasets for model training.
  • Collaborate with cross-functional teams to integrate TTS models into Conversational AI platforms.
  • Conduct performance tuning to minimize latency and enhance the scalability of TTS systems in production environments.
  • Stay up-to-date with advancements in LLM-based TTS, generative models, and speech signal processing to drive innovation in solutions.

Ownership:

  • Take full ownership of tasks and projects, from conceptualization to deployment, highlighting testing to ensure accountability and high-quality results.
  • Integrate software components into a fully functional software system.

Agile Methodologies & Collaboration

  • Actively participate in Agile software development processes, including sprint planning, daily stand-ups, and retrospectives to ensure timely and high-quality deliverables.
  • Work closely with cross-functional teams, including product managers, designers, and other engineers, to gather requirements and ensure alignment on project goals.
  • Participate in project planning, including research and development.
  • Contribute to the backlog of tasks with improvements and suggestions.
  • Implement Proof of Concepts (PoC) to introduce new solutions and ideas to the team.
  • Effectively manage time and meet deadlines.

Contribute actively and effectively as an integrated team member

  • Meet regularly with the line manager to review progress
  • Manage issue resolution and critically escalate
  • Work effectively with other teams, units, and departments
  • Manage issues with clarity and ensure effective information flow and team working
  • Support organization's other priority activities, when necessary
  • Act as an Omilia ambassador

Requirements

  • MSc degree in Computer Science, Engineering, or a related subject.
  • 2+ years of experience in speech synthesis development roles.
  • Ph.D. in a relevant field is a plus but not required.
  • Proven experience in developing AI-driven applications, particularly in speech synthesis, voice cloning, or related fields.
  • Strong understanding of state-of-the-art voice LLM techniques
  • Proficiency in Python and deep learning frameworks like PyTorch or TensorFlow.
  • Hands-on experience with TTS frameworks (e.g., Tacotron, FastSpeech, FastPitch, VITS) and neural vocoders (e.g., HiFi-GAN, WaveGlow, Vocos)
  • Hands-on experience with LLMs, Generative Models, Transformers and diffusion models.
  • Familiarity with zero-shot synthesis approaches and multi-speaker TTS systems.
  • Self-motivated and driven to create extraordinary things.
  • Ability to work under pressure and on strict deadlines.
  • Continuous innovation mindset.
  • Excellent written and oral communication skills in English.
  • Effective time management skills and the ability to meet deadlines.

Nice to have 

  • Experience with AWS cloud platform for scalable model deployment and monitoring.
  • Experience with NVIDIA Triton Inference Server.
  • Experience with MLOps practices

Benefits

  • Fixed compensation;
  • Long-term employment with the working days vacation;
  • Development in professional growth (courses, training, etc);
  • Being part of successful cutting-edge technology products that are making a global impact in the service industry;
  • Proficient and fun-to-work-with colleagues;
  • Apple gear.

Omilia is proud to be an equal opportunity employer and is dedicated to fostering a diverse and inclusive workplace. We believe that embracing diversity in all its forms enriches our workplace and drives our collective success. We are committed to creating an environment where everyone feels welcomed, valued, and empowered to contribute their unique perspectives without regard to factors such as race, color, religion, gender, gender identity or expression, sexual orientation, national origin, heredity, disability, age, or veteran status, all eligible candidates will be given consideration for employment.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  23  4  0

Tags: Agile Architecture AWS Computer Science Conversational AI Data pipelines Deep Learning Diffusion models Engineering Generative modeling LLMs Machine Learning MLOps Model deployment Model training Pipelines Python PyTorch Research Speech synthesis TensorFlow Testing Transformers

Perks/benefits: Career development

Regions: Remote/Anywhere Europe
Country: Greece

More jobs like this