Engineering Manager, ML Infrastructure
ServiceNow
Mountain View, CA
Full Time
Mid Level
5+ years
Posted 3 weeks ago
Interested in this position?
Upload your resume and we'll match you with this and other relevant opportunities.
Upload Your ResumeAbout This Role
This role is for an Engineering Manager to lead the Machine Learning Infrastructure team, focusing on developing and scaling the platform that powers Moveworks' conversational AI. The manager will guide the technical vision and scale end-to-end systems for the ML/LLM lifecycle.
Responsibilities
- Lead, mentor, and grow a world-class team of ML and Systems Engineers, fostering a culture of innovation, ownership, and operational excellence.
- Own the technical vision and roadmap for the end-to-end ML platform that powers the entire lifecycle for hundreds of production models.
- Drive the strategy for model performance and efficiency, making critical architectural decisions to optimize GPU infrastructure.
- Partner with leaders across agentic platform, search platform, product engineering, and core infrastructure teams to define and deliver foundational infrastructure.
- Champion a product mindset for your platform, building powerful abstractions and tools to accelerate machine learning engineers and researchers.
Requirements
- 5+ years industry experience leading/managing high-performing ML or infrastructure teams
- Deep technical expertise in designing, building, and scaling end-to-end machine learning systems in production environments
- Strong command of Python
- Experience with performant languages such as C++ or GoLang
- Extensive experience with deep learning frameworks like PyTorch or Hugging Face
- Hands-on experience with modern LLM infrastructure, including distributed training frameworks (e.g., Deepspeed) and inference/serving frameworks (e.g., vLLM, TensorRT-LLM, Kubernetes)
- Strategic mindset balancing robust infrastructure with forward-looking research and development
- Excellent communication and collaboration skills
Qualifications
- Master's or Ph.D. in Computer Science, Machine Learning, or a related field
- 5+ years of industry experience with a proven track record of leading or managing high-performing machine learning or infrastructure teams.
Skills
Python
*
Kubernetes
*
C++
*
PyTorch
*
Golang
*
DeepSpeed
*
vLLM
*
Hugging Face
*
TensorRT-LLM
*
* Required skills
About ServiceNow
Global market leader bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500, with an intelligent cloud-based platform connecting people, systems, and processes.
Technology
View all jobs at ServiceNow →
Related Searches
Similar Jobs
Sr Advisory Solution Consultant - Hyperscaler Accounts
Active
ServiceNow
·
Santa Clara, CA
·
$168,750 - $278,475
AI
ServiceNow platform
1 week ago
Manager, Software Engineering Management
Active
ServiceNow
·
New York, NY
·
$166,500 - $291,400
AI
Go
Distributed Systems
Rust
+1 more
2 weeks ago
Senior Software Engineer, Agentic AI Systems
Active
ServiceNow
·
Mountain View, CA
Python
Java
AI agents
Golang
+3 more
2 weeks ago
Solution Sales Executive - Moveworks
Active
ServiceNow
·
Washington, DC
AI
LLMs
3 weeks ago
Tech Lead, Agentic AI Platform
Active
ServiceNow
·
Mountain View, CA
Generative AI
LLMs
Optimization
API design
+4 more
3 weeks ago