Are you a seasoned Data Scientist with deep expertise in Text-to-Speech (TTS) and Speech-to-Text (STT) technologies? Ready to shape the future of voice AI and build systems that power global real-time communication? WorkForce International is proud to partner with one of the most exciting startups in the conversational AI space to find a passionate and talented Senior Data Scientist – Speech AI to join their growing AI core team.
Our client is a rising global startup transforming human-AI interaction. As the creators of Kalimera.ai — the world’s leading virtual voice assistant for call centers and real-time communication — they are redefining how businesses engage with customers. With offices in Cyprus, Greece, and India, they’re scaling rapidly and using cutting-edge innovations in TTS, STT, and large language models to lead the next wave of voice automation.
Key Responsibilities:
As a key member of the speech AI team, you’ll work on challenging, meaningful problems with real-world impact:
- Design, train, and optimize TTS models (e.g., Tacotron, FastSpeech, WaveNet) to deliver high-quality, multilingual voice synthesis.
- Enhance STT pipelines with a focus on real-time accuracy, speaker diarization, and robust transcription for diverse accents and environments.
- Collaborate closely with engineering and product teams to deploy models at scale in high-availability environments.
- Lead research and benchmarking of voice models for performance, naturalness, and use-case fit (call centers, voice assistants, etc.).
- Optionally integrate LLMs to boost STT performance with capabilities like sentiment analysis, summarization, and intent detection.
- Influence architecture decisions and mentor peers as the team expands.
Your Profile:
- 5+ years of hands-on experience in AI/ML with a strong focus on TTS and/or STT systems.
- Proficient in Python and deep learning libraries such as PyTorch, TensorFlow, torchaudio, librosa, etc.
- Practical experience with speech frameworks like ESPnet, NVIDIA NeMo, Coqui TTS, or similar.
- Solid knowledge of speech signal processing, neural vocoders, and real-time system optimization.
- Experience with deploying models in cloud-native or Kubernetes-based environments.
- Strong collaboration, communication, and documentation skills.
- Bonus: Experience working with LLMs or NLP tools in speech-related applications.
Why this role stands out:
- Shape the voice of tomorrow: Be part of a mission-driven team that’s already powering real-world enterprise deployments through Kalimera.ai.
- Innovate at the edge of AI: Work with a cutting-edge stack that combines speech and language models in real-time environments.
- Grow your impact: Take ownership in a high-visibility role, with a chance to lead and mentor as the AI team scales.
- Work globally, collaborate deeply: Join a distributed, multicultural team spanning Europe and Asia with ambitious plans for global expansion.
This role is exclusively managed by WorkForce International, a premier global talent partner. We specialize in connecting top-tier professionals with innovative tech companies shaping the future.
Ready to make your mark? Apply now and let’s talk about your next big move.