-
AI Software Engineer
- Zoom (Seattle, WA)
-
AI Software Engineer
What you can expect
The AI Infra team at Zoom is dedicated to building a world-class inference infrastructure that powers all of Zoom’s AI services. Our mission is to deliver high efficiency, scalability, and cost optimization across a wide range of AI applications, including large language models (LLM), vision-language models (VLM), automatic speech recognition (ASR), and machine translation. We focus on creating a seamless collaboration between small and large models, ensuring cost-effective, privacy-preserving, and high-quality AI services for our customers.
About the Team
As an AI Software Engineer on Zoom’s AI Infra team, you will design, optimize, and scale the runtimes and services that power our AI models. Your work will directly improve efficiency, reduce latency, and lower costs across Zoom’s AI stack, ensuring reliable, high-performance AI experiences for millions of users.
Responsibilities
+ Develop and optimize AI runtimes for LLMs, ASR, and MT systems with a focus on performance and cost efficiency.
+ Apply GPU-level optimization techniques including CUDA, kernel fusion, and memory throughput improvements.
+ Implement inference optimizations such as TorchCompile, graph optimization, KV cache, and continuous batching.
+ Build scalable, highly available infrastructure services to support enterprise-grade AI workloads.
+ Optimize models for edge devices (laptops, PCs and mobile devices) as well as large-scale cloud deployments.
+ Continuously improve latency, throughput, and efficiency across serving pipelines.
+ Rapidly integrate and optimize new industry models to stay ahead in AI infrastructure.
What we’re looking for
+ Track record of building scalable, reliable AI infrastructure under real-world production constraints.
+ Strong expertise in GPU programming and optimization (CUDA, kernel-level development).
+ Deep experience with transformer-based models and inference frameworks (vLLM, TensorRT-LLM, SGLang, ONNX Runtime).
+ Proficiency in Python and C++ (Java is a plus).
+ Hands-on experience with PyTorch (TorchCompile, graph-level optimization) and/or TensorFlow.
+ Knowledge of low-level hardware concepts (GPU memory hierarchy, caching, vectorization).
+ Familiarity with cloud platforms (AWS, GCP, Azure) and AI deployment tools (Docker, Kubernetes, MLflow).
Salary Range or On Target Earnings:
Minimum:
$143 000,00
Maximum:
$312 800,00
In addition to the base salary and/or OTE listed Zoom has a Total Direct Compensation philosophy that takes into consideration; base salary, bonus and equity value.
Note: Starting pay will be based on a number of factors and commensurate with qualifications & experience.
We also have a location based compensation structure; there may be a different range for candidates in this and other locations
At Zoom, we offer a window of at least 5 days for you to apply because we believe in giving you every opportunity. Below is the potential closing date, just in case you want to mark it on your calendar. We look forward to receiving your application!
Anticipated Position Close Date:
12/31/25
Ways of WorkingOur structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In-Person is indicated in the job description/posting.
BenefitsAs part of our award-winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways. Click Learn (https://careers.zoom.us/benefits) for more information.
About UsZoomies help people stay connected so they can get more done together. We set out to build the best collaboration platform for the enterprise, and today help people communicate better with products like Zoom Contact Center, Zoom Phone, Zoom Events, Zoom Apps, Zoom Rooms, and Zoom Webinars.We’re problem-solvers, working at a fast pace to design solutions with our customers and users in mind. Find room to grow with opportunities to stretch your skills and advance your career in a collaborative, growth-focused environment.
Our Commitment
At Zoom, we believe great work happens when people feel supported and empowered. We’re committed to fair hiring practices that ensure every candidate is evaluated based on skills, experience, and potential. If you require an accommodation during the hiring process, let us know—we’re here to support you at every step.
If you need assistance navigating the interview process due to a medical disability, please submit an Accommodations Request Form (https://form.asana.com/?k=OIuqpO5Tv9XQTWp1bNYd8w&d=1127274756253361) and someone from our team will reach out soon. This form is solely for applicants who require an accommodation due to a qualifying medical disability. Non-accommodation-related requests, such as application follow-ups or technical issues, will not be addressed.
-
Recent Jobs
-
AI Software Engineer
- Zoom (Seattle, WA)
-
Sr. Payroll Technical Analyst
- Sharp HealthCare (San Diego, CA)
-
Sr. Principal Systems Engineer
- Northrop Grumman (Aurora, CO)