Excited to grow your career?

We value our talented employees, and whenever possible strive to help one of our associates grow professionally before recruiting new talent to our open positions. If you think the open position you see is right for you, we encourage you to apply!

Our people make all the difference in our success.

The Team

You will join a dynamic AI Infrastructure team focused on enabling high-performance AI across Zoom’s products and services. The team builds the core systems that support model training, deployment, and inference at scale, driving innovation in areas such as real-time communication, computer vision, and natural language understanding.

What You Can Expect

You'll design, implement, and own the inference systems that serve Zoom's AI models at production scale, across real-time communication, vision, and language workloads. You'll be hands-on with kernel-level optimisation, inference framework internals, and production serving infrastructure, working closely with research and platform teams to push the boundary on latency, throughput, and cost.

Responsibilities

Design and build high-performance inference serving systems for large-scale transformer and multimodal models (including 100B+ and MoE architectures)
Implement and tune inference optimisations: speculative decoding, continuous batching, KV cache management, prefill/decode disaggregation, and quantisation (INT4/INT8/FP8)
Contribute to and customise inference frameworks (vLLM, TensorRT-LLM, SGLang, or equivalent) for Zoom's production requirements
Write and profile CUDA kernels and custom ops where framework-level optimisation is insufficient
Own end-to-end deployment: from model packaging and serving API design to latency SLO monitoring and incident response
Partner with research to translate model architecture changes into inference-efficient implementations
Drive technical design and set the bar for inference eng practices across the team

What We're Looking For

A Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a related technical field
Advanced degrees (Master’s or PhD) are advantageous
5+ years of software engineering experience, with significant time spent on inference systems or ML infrastructure at production depth
Hands-on experience with at least one major inference framework: vLLM, TensorRT-LLM, SGLang, or ONNX Runtime (serving, not just export)
GPU programming experience: CUDA kernel development, memory optimisation, profiling with Nsight or equivalent
Production experience serving LLMs or large vision models, you've owned latency SLOs, debugged throughput regressions, and shipped optimisations that moved the needle
Depth in at least two of: speculative decoding, continuous batching, KV cache design, quantisation pipelines, prefill/decode disaggregation
Strong systems instincts in Python and C++; ability to read and modify framework internals

Preferred:

Experience with MoE models or 100B+ parameter deployments
Familiarity with disaggregated serving architectures or multi-node inference
Background in compiler-level optimisation (XLA, Triton, or similar)

Salary Range or On Target Earnings:

Minimum:

$151,800.00

Maximum:

$332,200.00

In addition to the base salary and/or OTE listed Zoom has a Total Direct Compensation philosophy that takes into consideration; base salary, bonus and equity value.

Information about Zoom’s benefits is on our careers page here .

Note: Starting pay will be based on a number of factors and commensurate with qualifications & experience.

We also have a location based compensation structure; there may be a different range for candidates in this and other locations.

Good news – this job posting is more like a marathon, not a sprint, so it could be available for a while! We're on the lookout for awesome folks to join Zoom in various similar roles. No need to rush, just hit us up whenever you're ready to apply. We're always keeping an eye out for amazing talent!

Our interviews are supported by BrightHire, a tool that helps us create a consistent and thoughtful interview experience and may include recordings. Please refer to our candidate privacy statement for more information of how we use your data.

Create Job Alerts & Register

First Name:

Last Name:

Email:

Password:

Password must contain the following: A capital (uppercase) letter A number Minimum 8 characters Password must contain special chars ex: !@#$%^&*?

----- or -----

Email:

Alert Details

Alert Name:

Alert Keyword:

Work Type:

Professions:

I agree to the terms of this site Read our Privacy Policy

Zoom is aware of scams that involve fake Zoom job listings posted on third-party sites. Responding applicants are contacted primarily over email, InMail and/or chat applications by people impersonating Zoom employees. Eventually a fake offer letter is sent in exchange for personal identification information as part of a fake new-hire screening process.

Please be advised that these offers, communications and impersonations are illegitimate and fraudulent. All communication with Zoom employees come from an “@zoom.us” email address. Zoom job applicants complete an interview process including in-person (on Zoom) meetings and phone calls. Our process also requires you to create an account with our applicant tracking system, Workday. If you have already completed an application, you can access it here.

Zoom will never ask for your personally identifying information during the interview process or ask you to pay money or purchase equipment. If you have received a message from Zoom that appears suspicious, please contact careers@zoom.us.

AI Software Engineer

Contact sales

Request a demo

Contact sales

Request a demo

Contact sales

Request a demo

Job At-A-Glance

Are You Ready?

Share this job

Other Jobs You May Be Interested In

Job Alerts Lorem

Sign Up for

Instant Job Alerts

Fraudulent Employment Offers

Contact sales

Request a demo

Contact sales

Request a demo

Contact sales

Request a demo

Imagine what you can build when you belong.

Imagine what you can achieve with Zoom

Imagine what you can build when you belong.

Imagine what you can beyond the meeting

Resources to help you imagine, prepare, and grow

AI Software Engineer

Contact sales

Request a demo

Contact sales

Request a demo

Contact sales

Request a demo

Job At-A-Glance

Are You Ready?

Share this job

Other Jobs You May Be Interested In

Job Alerts Lorem

Sign Up for

Instant Job Alerts

Fraudulent Employment Offers﻿

Contact sales

Request a demo

Contact sales

Request a demo

Contact sales

Request a demo

Imagine what you can build when you belong.

Imagine what you can achieve with Zoom

Imagine what you can build when you belong.

Imagine what you can beyond the meeting

Resources to help you imagine, prepare, and grow

Fraudulent Employment Offers

Imagine what you can beyond the meeting