Role Overview
Join an innovative team building an ultra-low-latency, broadcast-to-transcription pipeline on Linux. We are looking for an expert ASR (Automatic Speech Recognition) Developer to own the critical layer where raw PCM audio signals are converted into real-time text streams.
This is not a standard AI implementation role. You will be working at the intersection of hardware and software, integrating clean audio output from an FPGA pipeline and optimizing the entire stack for deterministic, low-latency performance.
Core Responsibilities
- Pipeline Architecture: Build and optimize the end-to-end PCM-to-real-time-text streaming layer.
- Performance Tuning: Heavily optimize for latency and memory usage. You will be responsible for tuning buffering behaviors and model inference to ensure instantaneous results.
- GPU Optimization: Manage and optimize GPU acceleration to ensure high-speed processing without bottlenecks.
- Hardware Integration: Work with raw, uncompressed PCM data coming directly from a specialized FPGA hardware pipeline.
- Hardening: Test, debug, and harden the system to ensure it meets the rigorous demands of 24/7 real-time broadcast operations.
Nice-to-Haves (The "Standout" Candidate)
- Network Engineering: Background in high-performance networking or broadcast distribution.
- Hardware Expertise: Experience working with GPU acceleration (CUDA) and interfacing with FPGA/DSP hardware.
- Broadcast Knowledge: Understanding of contribution feeds and broadcast signal topology.