Staff Software Engineer, Speculative Decoding

Location

Remote restrictions apply

See all remote locations

Salary Estimate

N/A

Seniority

Staff

Tech stacks

Software Development

Algorithm

+20

Visa

U.S. visa required

Permanent role

a month ago

Apply now

About Jobgether

Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

One of our companies is currently looking for a Staff Software Engineer, Speculative Decoding in California.

We're looking for a seasoned engineer with deep experience in Generative AI inference and a strong command of speculative decoding techniques. In this role, you'll be responsible for developing high-performance, scalable algorithms that enhance speed and accuracy within production-level AI systems. Working in a multi-data center Kubernetes environment, you'll help design and integrate state-of-the-art decoding methods while driving performance improvements across the inference stack. If you're passionate about transforming leading-edge AI research into production-ready solutions and mentoring others while doing so, this is the ideal opportunity.

Accountabilities:

Design and implement speculative decoding algorithms to enhance Generative AI inference performance and efficiency
Optimize system architecture and software infrastructure for real-time, large-scale AI model deployment
Develop and maintain high-performance codebases in C++ and Rust for production-grade distributed systems
Work within a multi-process, Kubernetes-based environment utilizing technologies such as MPI
Partner with software, research, and operations teams to improve model evaluation, post-training processes, and system scalability
Translate recent advancements in AI and speculative decoding into practical, robust implementations
Provide technical leadership and contribute to a culture of innovation, mentorship, and continuous improvement

Requirements

Master's degree in Computer Science, Electrical Engineering, or equivalent practical experience
5+ years of hands-on experience in generative AI inference, particularly with speculative decoding
Expertise in C++ with a proven record of building high-performance, distributed systems
Familiarity with PyTorch and performance evaluation methodologies for generative models
Deep understanding of AI infrastructure challenges, model architecture, and scalable deployment
Proficiency with cloud-native tools, Kubernetes environments, and inter-process communication
Strong problem-solving abilities, creativity, and collaboration skills in a fast-paced setting

Benefits

Competitive base salary between $175,900 and $307,800 (based on experience)
Equity participation
Comprehensive health and wellness benefits
Flexible work environment with potential site-based requirements
Continuous learning and growth opportunities
Inclusive culture committed to diversity, equity, and belonging
Opportunity to work at the forefront of AI innovation

Jobgether hiring process disclaimer

This job is posted on behalf of one of our partner companies. If you choose to apply, your application will go through our AI-powered 3-step screening process, where we automatically select the 5 best candidates.

Our AI thoroughly analyzes every line of your CV and LinkedIn profile to assess your fit for the role, evaluating each experience in detail. When needed, our team may also conduct a manual review to ensure only the most relevant candidates are considered.

Our process is fair, unbiased, and based solely on qualifications and relevance to the job. Only the best-matching candidates will be selected for the next round.

If you are among the top 5 candidates, you will be notified within 7 days.

If you do not receive feedback after 7 days, it means you were not selected. However, if you wish, we may consider your profile for other similar opportunities that better match your experience.

Thank you for your interest!

About Jobgether

🔗Website

Visit company profile

Unlock all Arc benefits!

Browse remote jobs in one place
Land interviews more quickly
Get hands-on recruiter support

PRODUCTS

Arc

The remote career platform for talent

Codementor

Find a mentor to help you in real time

LINKS

About us Pricing Arc Careers - Hiring Now!Remote Junior Jobs Remote jobs Career Success Stories Talent Career Blog Arc Newsletter

JOBS BY EXPERTISE

Remote Front End Developer Jobs Remote Back End Developer Jobs Remote Full Stack Developer Jobs Remote Mobile Developer Jobs Remote Data Scientist Jobs Remote Game Developer Jobs Remote Data Engineer Jobs Remote Programming Jobs Remote Design Jobs Remote Marketing Jobs Remote Product Manager Jobs Remote Project Manager Jobs Remote Administrative Support Jobs

JOBS BY TECH STACKS

Remote AWS Developer Jobs Remote Java Developer Jobs Remote Javascript Developer Jobs Remote Python Developer Jobs Remote React Developer Jobs Remote Shopify Developer Jobs Remote SQL Developer Jobs Remote Unity Developer Jobs Remote Wordpress Developer Jobs Remote Web Development Jobs Remote Motion Graphic Jobs Remote SEO Jobs Remote AI Jobs

Cookie Policy Privacy Policy Terms of Service