Interested in this position?
Upload your resume and we'll match you with this and other relevant opportunities.
Upload Your ResumeAbout This Role
This role involves building a world-class inference infrastructure to power Zoom's AI services, focusing on high efficiency, scalability, and cost optimization across various AI applications. The AI Software Engineer will design, optimize, and scale runtimes and services for AI models to ensure reliable, high-performance AI experiences for millions of users.
Responsibilities
- Develop and optimize AI runtimes for LLMs, ASR, and MT systems with a focus on performance and cost efficiency
- Apply GPU-level optimization techniques including CUDA, kernel fusion, and memory throughput improvements
- Implement inference optimizations such as TorchCompile, graph optimization, KV cache, and continuous batching
- Build scalable, highly available infrastructure services to support enterprise-grade AI workloads
- Optimize models for edge devices (laptops, PCs and mobile devices) as well as large-scale cloud deployments
- Continuously improve latency, throughput, and efficiency across serving pipelines
- Rapidly integrate and optimize new industry models to stay ahead in AI infrastructure
Requirements
- Track record of building scalable, reliable AI infrastructure under real-world production constraints
- Strong expertise in GPU programming and optimization (CUDA, kernel-level development)
- Deep experience with transformer-based models and inference frameworks (vLLM, TensorRT-LLM, SGLang, ONNX Runtime)
- Proficiency in Python and C++
- Hands-on experience with PyTorch (TorchCompile, graph-level optimization) and/or TensorFlow
- Knowledge of low-level hardware concepts (GPU memory hierarchy, caching, vectorization)
- Familiarity with cloud platforms (AWS, GCP, Azure)
- Familiarity with AI deployment tools (Docker, Kubernetes, MLflow)
Qualifications
- Track record of building scalable, reliable AI infrastructure under real-world production constraints.
Nice to Have
- Java proficiency
Skills
Python
*
AWS
*
Azure
*
Java
*
Kubernetes
*
Docker
*
C++
*
TensorFlow
*
PyTorch
*
GCP
*
LLM
*
CUDA
*
vLLM
*
MLflow
*
ASR
*
MT systems
*
TensorRT-LLM
*
SGLang
*
ONNX Runtime
*
TorchCompile
*
* Required skills
Benefits
Equity
Bonus
About Zoom
Zoomies help people stay connected so they can get more done together, building the best collaboration platform for the enterprise with products like Zoom Contact Center, Zoom Phone, Zoom Events, Zoom Apps, Zoom Rooms, and Zoom Webinars.
Technology
View all jobs at Zoom →
Related Searches
Similar Jobs
Solutions Engineer/Lab Manager
Active
Zoom
·
$131,200 - $262,400
LMS platforms
Zoom Phone
Zoom Workplace
Zoom Contact Center (ZCX)
+4 more
3 weeks ago
Partner Solutions Engineer
Active
Zoom
·
$146,000 - $319,400
AI
Microsoft
Cisco
Google
+11 more
3 weeks ago
Web Designer
Active
Zoom
·
$87,600 - $186,000
4 weeks ago
Mid Market Contact Center Solutions Engineer
Expired
Zoom
·
$111,800 - $223,500
Quality Management
CCaaS
Conversational AI
Workforce Management
1 month ago
Senior Commercial Counsel (Channel)
Expired
Zoom
·
San Jose, CA
·
$124,000 - $271,200
Communication Skills
Organizational skills
Interpersonal Skills
Negotiation
+2 more
1 month ago