Engineering Manager, ML Infrastructure

ServiceNow Mountain View, CA
Full Time Mid Level 5+ years

Posted 3 weeks ago

Interested in this position?

Upload your resume and we'll match you with this and other relevant opportunities.

Upload Your Resume

About This Role

This role is for an Engineering Manager to lead the Machine Learning Infrastructure team, focusing on developing and scaling the platform that powers Moveworks' conversational AI. The manager will guide the technical vision and scale end-to-end systems for the ML/LLM lifecycle.

Responsibilities

  • Lead, mentor, and grow a world-class team of ML and Systems Engineers, fostering a culture of innovation, ownership, and operational excellence.
  • Own the technical vision and roadmap for the end-to-end ML platform that powers the entire lifecycle for hundreds of production models.
  • Drive the strategy for model performance and efficiency, making critical architectural decisions to optimize GPU infrastructure.
  • Partner with leaders across agentic platform, search platform, product engineering, and core infrastructure teams to define and deliver foundational infrastructure.
  • Champion a product mindset for your platform, building powerful abstractions and tools to accelerate machine learning engineers and researchers.

Requirements

  • 5+ years industry experience leading/managing high-performing ML or infrastructure teams
  • Deep technical expertise in designing, building, and scaling end-to-end machine learning systems in production environments
  • Strong command of Python
  • Experience with performant languages such as C++ or GoLang
  • Extensive experience with deep learning frameworks like PyTorch or Hugging Face
  • Hands-on experience with modern LLM infrastructure, including distributed training frameworks (e.g., Deepspeed) and inference/serving frameworks (e.g., vLLM, TensorRT-LLM, Kubernetes)
  • Strategic mindset balancing robust infrastructure with forward-looking research and development
  • Excellent communication and collaboration skills

Qualifications

  • Master's or Ph.D. in Computer Science, Machine Learning, or a related field
  • 5+ years of industry experience with a proven track record of leading or managing high-performing machine learning or infrastructure teams.

Skills

Python * Kubernetes * C++ * PyTorch * Golang * DeepSpeed * vLLM * Hugging Face * TensorRT-LLM *

* Required skills

About ServiceNow

Global market leader bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500, with an intelligent cloud-based platform connecting people, systems, and processes.

Technology
View all jobs at ServiceNow →