ML Production Engineer - Generative AI

We are looking for an ambitious ML Engineer to play a leading role in scaling & taking our ML models to production in both a low latency cloud & on-device setting. You’ll also collaborate with Vatsal (ex Amazon Alexa & Cambridge) to train, fine-tune & optimise these generative ML models at the core of our products.

You’ll be the first full-time ML hire at a well-funded startup and play a critical role in shaping the future of MetaVoice as a core member of the founding team. We’re particularly interested in candidates who have been founders themselves or built impressive side projects.

If you think you have what it takes to be a part of an ambitious & high-performing team, please share links to Github code you've written, contributions you've made & papers you’ve published. Check out our Notion careers page for more information & FAQs on MetaVoice.

KEY RESPONSIBILITIES

  • You’ll be working on latency critical applications where ML modules are running on the edge in real-time.
  • You will create and train highly performant new models & training pipelines from scratch
  • You will optimise trained internal and open-source models for device-specific architectures across Apple, NVIDIA, Intel, AMD, and other platforms.
  • You will be writing write real-time audio pipelines in low-level code for both Windows and Mac.
  • Work with ML Audio or Digital Signal Processing techniques to analyse, clean, segment & filter speech data
  • Participate in research activities, including the application and evaluation of generative voice & speech-to-speech techniques
  • Research and implement novel ML and statistical approaches to add value to the business.

BASIC REQUIREMENTS

  • PhD in fields such as Deep Generative Models, STS, Deep Learning, TTS, ASR, NLU. Bachelor’s/Master’s degree considered with existing applied experience in industry
  • Deep Knowledge in fields such as Voice Conversion, Deep Generative Models, Machine Learning, Deep Learning, TTS, ASR, NLU or Statistical modelling
  • Hands on experience with machine learning frameworks such as PyTorch, Keras, Tensorflow
  • Deep experience with techniques to speed-up inference, reduce model memory consumption & reduce reliance on future context for streaming settings
  • Significant experience with Transformer architecture, diffusion models, and other Generative models
  • Experience deploying and managing ML models in an on-device setting: MPS/ANE on Apple Silicon & NVIDIA GPUs
  • Significant experience with Python & Rust or C/C++
  • 4 years of applied research experience
  • Creative thinker & problem solver who can execute independently & quickly with a bias for action
  • An unrelenting desire to built world-class products which delight users
  • Outstanding written, spoken & interpersonal communication skills

PREFERRED REQUIREMENTS

  • Extensive experience of applied research. Ideally developing voice conversion, speech synthesis and natural language processing models.
  • PhD with specialisation in voice conversion, text-to-speech, natural language processing, or machine learning.
  • Scientific thinking and the ability to invent, a track record of thought leadership and contributions that have advanced the field.
Apply for this job
logo MetaVoice Engineering Full-time Hybrid 📍 London Office Apply Now
Your subscription could not be saved. Please try again.
Your subscription has been successful.

Newsletter

Subscribe and stay updated.

Your subscription could not be saved. Please try again.
Your subscription has been successful.

Join our newsletter