Senior Machine Learning Engineer (Speech Synthesis)

What you will do

About Telnyx

Telnyx is an industry leader that's not just imagining the future of global connectivity—we're building it. From architecting and amplifying the reach of a private, global, multi-cloud IP network, to bringing hyperlocal edge technology right to your fingertips through intuitive APIs, we're shaping a new era of seamless interconnection between people, devices, and applications.
We're driven by a desire to transform and modernize what's antiquated, automate the manual, and solve real-world problems through innovative connectivity solutions. As a testament to our success, we're proud to stand as a financially stable and profitable company. Our robust profitability allows us not only to invest in pioneering technologies but also to foster an environment of continuous learning and growth for our team.
Our collective vision is a world where borderless connectivity fuels limitless innovation. By joining us, you can be part of laying the foundations for this interconnected future. We're currently seeking passionate individuals who are excited about the opportunity to contribute to an industry-shaping company while growing their own skills and careers.

The Impact You'll Drive

As a Senior ML Engineer (Speech Synthesis), you’ll be a founding member of the team building Telnyx’s next-generation speech synthesis systems. This is a greenfield opportunity — you’ll define the stack, architecture, and best practices for training and deploying state-of-the-art multilingual text-to-speech (TTS) models that power our voice AI agents.
You’ll build everything from distributed training pipelines to inference services that generate ultra-low-latency, lifelike voices across dozens of languages. Your work will bridge research and production — shaping how millions of people experience real-time conversational AI.

What You’ll Work On

Own the stack from day one: Design and implement the ML training and inference pipelines for multilingual speech synthesis.
Low-latency TTS: Engineer systems optimized for real-time, streaming speech generation with sub-100ms response targets.
Train cutting-edge models: Build and fine-tune multilingual TTS systems using modern architectures — including LLM-based, diffusion, and flow-matching approaches.
Massive-scale data processing: Develop pipelines for ingesting, aligning, and normalizing text, audio, and phonetic data across dozens of languages.
Experimentation at scale: Run distributed training across multi-node GPU clusters, tracking results and iterating quickly.
Cross-functional collaboration: Work with infrastructure and voice platform teams to deploy models that scale globally.
Research meets production: Evaluate emerging techniques (LLM-guided synthesis, zero/few-shot voice cloning, full-duplex modeling) and bring them to life in production-grade systems.

What You’ll Work With

Infrastructure: Docker, Kubernetes, Ray, Kubeflow, MLflow, Weights & Biases
Data Systems: Kafka, Redis, PostgreSQL, Parquet
You’ll define it: You’ll help select and implement the stack that supports distributed training, data processing, and inference for global deployment.

What we offer

Why Telnyx

You’ll be joining a company where voice, infrastructure, and AI converge. Telnyx is building the foundation for real-time, intelligent global communications — and your work on multilingual TTS will be at the core of that vision.

Requirements

What We’re Looking For

6+ years of experience in machine learning or speech systems engineering
Hands-on expertise with neural TTS, speech synthesis, or adjacent areas (ASR, voice cloning, multilingual modeling)
You’ve obsessed over one or two hard problems, whether it’s building multilingual TTS from noisy data, teaching LLMs to speak, designing self-supervised audio encoders, or making diffusion models run in real time.
Experience with LLM-based approaches to speech synthesis or prosody control
Strong proficiency in Python and PyTorch
Ability to deploy models efficiently (ONNX, TensorRT)
Experience leading small teams and defining technical direction or team executables
Production mindset: You build systems that run fast, stay stable, and are easy to maintain

Veröffentlicht	vor 8 Tagen
Läuft ab	in 22 Tagen

Senior Machine Learning Engineer (Speech Synthesis)

Status

Hexjobs Insights

Schlüsselwörter