For companies
  • Hire developers
  • Hire designers
  • Hire marketers
  • Hire product managers
  • Hire project managers
  • Hire assistants
  • How Arc works
  • How much can you save?
  • Case studies
  • Pricing
    • Remote dev salary explorer
    • Freelance developer rate explorer
    • Job description templates
    • Interview questions
    • Remote work FAQs
    • Team bonding playbooks
    • Employer blog
For talent
  • Overview
  • Remote jobs
  • Remote companies
    • Resume builder and guide
    • Talent career blog
ClanX
ClanX

Senior Data Scientist

Location

Remote restrictions apply
See all remote locations

Salary Estimate

N/AIconOpenNewWindows

Seniority

Senior

Tech stacks

Research
AI
Python
+18

Permanent role
7 days ago
Apply now

Requirements

• 3+ years of applied or academic experience in speech, multimodal, or LLM research

• Bachelor’s or Master’s in Computer Science, AI, or Electrical Engineering

• Strong in Python and scientific computing, including JupyterHub environments

• Deep understanding of LLMs, transformer architectures, and multimodal embeddings

• Experience in speech modeling pipelines: ASR, TTS, speech-to-speech, or audio-language models

• Knowledge of turn-taking systems, VAD, prosody modeling, and real-time voice synthesis

• Familiarity with self-supervised learning, contrastive learning, and agentic reinforcement (ART)

• Skilled in dataset curation, experimental design, and model evaluation

• Comfortable with tools like Agno, Pipecat, HuggingFace, and PyTorch

• Exposure to LangChain, vector databases, and memory systems for agentic research

• Strong written communication and clarity in presenting research insights

• High research curiosity, independent ownership, and mission-driven mindset

• Currently employed at a product-based organisation

Responsibilities

• Research and develop direct speech-to-speech modeling using LLMs and audio encoders/decoders

• Model and evaluate conversational turn-taking, latency, and VAD for real-time AI

• Explore Agentic Reinforcement Training (ART) and self-learning mechanisms

• Design memory-augmented multimodal architectures for context-aware interactions

• Create expressive speech generation systems with emotion conditioning and speaker preservation

• Contribute to SOTA research in multimodal learning, audio-language alignment, and agentic reasoning

• Define long-term AI research roadmap with the Research Director

• Collaborate with MLEs on model training and evaluation, while leading dataset and experimentation design

Job Details

Location: Hybrid — Mumbai, Bengaluru, Chennai, India (this is optional, incase you're looking for onsite role)

Interview process

• Screening / HR round

• Technical round(s) — coding, system design, ML case studies

• ML / research deep dive

• Final / leadership round

About ClanX

🔗Website
Visit company profileIconOpenNewWindows

Unlock all Arc benefits!

  • Browse remote jobs in one place
  • Land interviews more quickly
  • Get hands-on recruiter support
PRODUCTS
Arc

The remote career platform for talent

Codementor

Find a mentor to help you in real time

LINKS
About usPricingArc Careers - Hiring Now!Remote Junior JobsRemote jobsCareer Success StoriesTalent Career BlogArc Newsletter
JOBS BY EXPERTISE
Remote Front End Developer JobsRemote Back End Developer JobsRemote Full Stack Developer JobsRemote Mobile Developer JobsRemote Data Scientist JobsRemote Game Developer JobsRemote Data Engineer JobsRemote Programming JobsRemote Design JobsRemote Marketing JobsRemote Product Manager JobsRemote Project Manager JobsRemote Administrative Support Jobs
JOBS BY TECH STACKS
Remote AWS Developer JobsRemote Java Developer JobsRemote Javascript Developer JobsRemote Python Developer JobsRemote React Developer JobsRemote Shopify Developer JobsRemote SQL Developer JobsRemote Unity Developer JobsRemote Wordpress Developer JobsRemote Web Development JobsRemote Motion Graphic JobsRemote SEO JobsRemote AI Jobs
© Copyright 2025 Arc
Cookie PolicyPrivacy PolicyTerms of Service