1. About Our Client:
The organization operates in the text-to-speech technology space with a mission to remove reading barriers to learning. Its products enable over 50 million users to convert written content—such as PDFs, books, documents, news articles, and websites—into audio, facilitating faster, more efficient reading and better retention. The company offers a suite of applications across iOS, Android, Mac, Chrome, and web platforms. Recognized by Google as Chrome Extension of the Year and by Apple for inclusivity in 2025, the organization operates with a fully distributed team of nearly 200 employees worldwide, including engineers, AI researchers, and professionals from leading tech firms and academic programs.
2. About the Opportunity:
The Software Engineer, Data Infrastructure & Acquisition is responsible for managing all aspects of data collection that support AI model training. This role focuses on developing and maintaining scalable, cost-effective data pipelines and infrastructure to enable the creation of high-quality datasets at petabyte scale. The position plays a key role in collaborating with data scientists and leadership to develop data strategies that power next-generation AI products, contributing directly to the organization’s ability to innovate and deliver impactful text-to-speech solutions.
3. Responsibilities:
• Identify and integrate new audio data sources into the ingestion pipeline
• Operate and enhance cloud infrastructure for data ingestion, primarily on GCP using Terraform
• Collaborate with scientists to improve data quality, scale, and cost efficiency
• Work with the AI team and leadership to design the dataset roadmap for future products
4. Requirements:
• BS, MS, or PhD in Computer Science or related field
• 5+ years of software development experience
• Proficiency in bash and Python scripting in Linux environments
• Experience with Docker, Infrastructure-as-Code, and at least one major cloud platform (preferably GCP)
• Familiarity with web crawlers and large-scale data processing workflows is a plus
• Ability to manage multiple priorities and adapt to change
• Strong written and verbal communication skills
5. Pay Range and Compensation Package:
• United States base salary range: $140,000 to $200,000 plus bonus and equity, depending on experience
Equal Opportunity Statement: Our client is an equal opportunity employer. They celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, or national origin.
Note:
RemoteHunter is not the Employer of Record (EOR) for this role. Our purpose in this opportunity is to connect exceptional candidates with leading employers. We help job seekers worldwide discover roles that match their goals and guide them to complete their full application directly through the hiring company’s career page or ATS.