ML Production Engineer - Generative AI

We are looking for an ambitious ML Engineer to play a leading role in scaling & taking our ML models to production in both a low latency cloud & on-device setting. You’ll also collaborate with Vatsal (ex Amazon Alexa & Cambridge) to train, fine-tune & optimise these generative ML models at the core of our products.

You’ll be the first full-time ML hire at a well-funded startup and play a critical role in shaping the future of MetaVoice as a core member of the founding team. We’re particularly interested in candidates who have been founders themselves or built impressive side projects.

If you think you have what it takes to be a part of an ambitious & high-performing team, please share links to Github code you've written, contributions you've made & papers you’ve published. Check out our Notion careers page for more information & FAQs on MetaVoice.

KEY RESPONSIBILITIES

You’ll be working on latency critical applications where ML modules are running on the edge in real-time.
You will create and train highly performant new models & training pipelines from scratch
You will optimise trained internal and open-source models for device-specific architectures across Apple, NVIDIA, Intel, AMD, and other platforms.
You will be writing write real-time audio pipelines in low-level code for both Windows and Mac.
Work with ML Audio or Digital Signal Processing techniques to analyse, clean, segment & filter speech data
Participate in research activities, including the application and evaluation of generative voice & speech-to-speech techniques
Research and implement novel ML and statistical approaches to add value to the business.

BASIC REQUIREMENTS

PhD in fields such as Deep Generative Models, STS, Deep Learning, TTS, ASR, NLU. Bachelor’s/Master’s degree considered with existing applied experience in industry
Deep Knowledge in fields such as Voice Conversion, Deep Generative Models, Machine Learning, Deep Learning, TTS, ASR, NLU or Statistical modelling
Hands on experience with machine learning frameworks such as PyTorch, Keras, Tensorflow
Deep experience with techniques to speed-up inference, reduce model memory consumption & reduce reliance on future context for streaming settings
Significant experience with Transformer architecture, diffusion models, and other Generative models
Experience deploying and managing ML models in an on-device setting: MPS/ANE on Apple Silicon & NVIDIA GPUs
Significant experience with Python & Rust or C/C++
4 years of applied research experience
Creative thinker & problem solver who can execute independently & quickly with a bias for action
An unrelenting desire to built world-class products which delight users
Outstanding written, spoken & interpersonal communication skills

PREFERRED REQUIREMENTS

Extensive experience of applied research. Ideally developing voice conversion, speech synthesis and natural language processing models.
PhD with specialisation in voice conversion, text-to-speech, natural language processing, or machine learning.
Scientific thinking and the ability to invent, a track record of thought leadership and contributions that have advanced the field.

Apply for this job

ML Production Engineer - Generative AI

Other AI Jobs like this

Director of Treasury

AI Performance Optimization Engineer

Enterprise Account Executive - Pennsylvania

Engineering

Data

Other Roles

Locations