Principal Machine Learning Researcher, On-Device Optimization

HP Palo Alto, CA $150,000 - $250,000
Full Time Lead Level 2+ years

Posted 1 month ago Expired

This job has expired

Looking for a job like Principal Machine Learning Researcher, On-Device Optimization in or near Palo Alto, CA? Upload your resume and we'll notify you when similar positions become available.

Upload Your Resume

About This Role

Advance on-device AI optimization techniques, bridging applied research and product development at HP IQ, an AI innovation lab. Develop and implement model compression techniques to deploy state-of-the-art transformer and vision models on HP laptops and edge devices.

Responsibilities

  • Research and implement model compression techniques including quantization, low-rank factorization, distillation, and pruning
  • Develop methods to deploy SOTA transformer and vision models on-device under hardware constraints
  • Lead investigations into hardware-aware training strategies to optimize latency, throughput, and memory usage
  • Collaborate with software engineers and system architects to integrate models into AI companion apps
  • Evaluate and benchmark different frameworks and quantization strategies (e.g., AWQ, GPTQ, SmoothQuant)

Requirements

  • PhD in Computer Science, Electrical Engineering, or related field with focus on efficient ML, systems ML, or compiler design for ML
  • 2+ years of industry or applied research experience
  • Strong background in model optimization for edge computing or mobile/embedded deployment
  • Familiarity with PyTorch, ONNX, TensorRT, OpenVINO, QNN, or Llama.cpp
  • Understanding of tradeoffs in asymmetric/symmetric quantization, calibration methods, and inference tuning

Qualifications

  • PhD in Computer Science, Electrical Engineering, or related field with focus on efficient ML, systems ML, or compiler design for ML
  • 2+ years of industry or applied research experience

Nice to Have

  • Experience publishing at top ML/Systems conferences (e.g., NeurIPS, ICML, MLSys)
  • Familiarity with embedded ML for consumer devices
  • GPU and system-level profiling tools (e.g., CUDA, nvprof, perf)
  • Contributions to open-source ML optimization frameworks

Skills

PyTorch * perf * CUDA * ONNX * TensorRT * OpenVINO * QNN * Llama.cpp * nvprof *

* Required skills

Benefits

Health Insurance
Life Insurance
11 paid holidays
Additional flexible paid vacation and sick leave
Dental Insurance
Employee Assistance Program
4-12 weeks fully paid parental leave based on tenure
Vision Insurance
Flexible Spending Account
Long term/short term disability insurance

About HP

HP IQ is HP's new AI innovation lab. Combining startup agility with HP's global scale, building intelligent technologies that redefine how the world works, creates, and collaborates.

Technology
View all jobs at HP →