Audio AI Engineer

Audio AI Engineer

Contact sales

We’d love to see how we can streamline your hiring together.

Request a demo
A black heart is floating in the air on a white background.
Contact sales

We’d love to see how we can streamline your hiring together.

Request a demo
A black heart is floating in the air on a white background.
Contact sales

We’d love to see how we can streamline your hiring together.

Request a demo
A black heart is floating in the air on a white background.

Job At-A-Glance

Blue and green globe icon showing Earth with the Americas visible

Silver analog clock icon showing time around 8:00

Blue and green globe icon showing Earth with the Americas visible

Anticipated Position Close Date: 23-Jun-2026

Large group of people in blue jackets gathered outdoors on a snowy rocky landscape

Excited to grow your career?


We value our talented employees, and whenever possible strive to help one of our associates grow professionally before recruiting new talent to our open positions. If you think the open position you see is right for you, we encourage you to apply!

Our people make all the difference in our success.

What you can expect

As an Audio AI Engineer, you will research and develop algorithms for accent conversion, voice conversion, speech synthesis, and speech recognition on low-latency streaming architectures. You’ll prototype and refine end-to-end audio models that enhance intelligibility and naturalness while maintaining speaker identity. Working closely with product and platform teams, you’ll help bring these models into real-time communication systems. You will also evaluate and optimize model performance across dimensions such as quality, latency, and scalability. Staying current with advances in speech processing, you’ll contribute to innovation through patents and internal knowledge sharing.

About the Team

Zoom's Audio team develops real-time audio features based on AI algorithms. Members of the team are spread worldwide, including the U.S., China and Singapore.

What we’re looking for

  • Hold a PhD or equivalent experience in a relevant field in Streaming, Accent Conversion, Voice Conversion, TTS, or ASR. More than 2 years of relevant industry experience considered a plus.

  • Show proficiency in deep learning frameworks like PyTorch or TensorFlow.

  • Demonstrate effective programming skills in Python, C/C++, or similar languages.

  • Have an understanding of sequence modeling architectures (Transformers, RNNs, diffusion models, or conformers).

  • Demonstrate experience developing and deploying low-latency, real-time speech or audio models with streaming architectures and optimized pipelines.

  • Show familiarity with model compression and acceleration techniques, including quantization, pruning, and distillation.

  • Exhibit experience working with real-time audio systems in networked communication environments.

  • Publish in top-tier conferences such as ICASSP, INTERSPEECH, NeurIPS, and ICLR.

Apply Now

Are You Ready?

Apply Now

We respect the privacy of candidates for employment. Before we get started, please review our Candidate Privacy Statement to understand ‘the personal data we process’, ‘how we use it’ and ‘duration of retention’. By submitting above, you acknowledge our Candidate Privacy Statement, and you provide consent for us to process your personal information for recruiting purposes.

Zoom Workplace promotional banner with blue logo on a soft pink-to-blue gradient background, featuring AI Companion icons.

Share this job

SCHEMA MARKUP ( This text will only show on the editor. )

Lorem Ipsum Et

Other Jobs You May Be Interested In

Job Alerts Lorem

Sign Up for

Instant Job Alerts

Find roles that are just the right fit for you, delivered straight to your inbox. The next opportunity you see could become your new career.

Create Job Alerts & Register
Alert Details
Read our Privacy Policy

Fraudulent Employment Offers

Zoom is aware of scams that involve fake Zoom job listings posted on third-party sites. Responding applicants are contacted primarily over email, InMail and/or chat applications by people impersonating Zoom employees. Eventually a fake offer letter is sent in exchange for personal identification information as part of a fake new-hire screening process.


Please be advised that these offers, communications and impersonations are illegitimate and fraudulent. All communication with Zoom employees come from an “@zoom.us” email address. Zoom job applicants complete an interview process including in-person (on Zoom) meetings and phone calls. Our process also requires you to create an account with our applicant tracking system, Workday. If you have already completed an application, you can access it here.


Zoom will never ask for your personally identifying information during the interview process or ask you to pay money or purchase equipment. If you have received a message from Zoom that appears suspicious, please contact careers@zoom.us.

Contact sales

We’d love to see how we can streamline your hiring together.

Request a demo
Black heart icon on a white background
Contact sales

We’d love to see how we can streamline your hiring together.

Request a demo
Black heart icon on a white background
Contact sales

We’d love to see how we can streamline your hiring together.

Request a demo
Black heart icon on a white background