RL Environments Specialist

Remote
xAI Palo Alto, CA $100 - $100
Full Time Entry Level

Posted 3 months ago Expired

This job has expired

Looking for a job like RL Environments Specialist in or near Palo Alto, CA? Upload your resume and we'll notify you when similar positions become available.

Upload Your Resume

About This Role

Create full Reinforcement Learning (RL) environments, including UI and backend, to programmatically generate and validate tasks for training computer-use agents, taking ownership of the entire task creation process for specific environments.

Responsibilities

  • Build sandbox UIs that agents and RL actors will interact with
  • Create tasks for built environments
  • Programmatically validate task completion
  • Take ownership of the entire task creation process for a given environment

Requirements

  • Strong professional experience with React.js (hooks, modern state management, TypeScript preferred)
  • Strong professional experience building backend services in Python (FastAPI, Flask, or Django)
  • Hands-on experience with containerization (Docker required)
  • Strong front-end design skills and exceptionally high taste in UI/UX, polish, and visual detail
  • Proven ability to design a relational database schema in Python and populate it with large-scale, realistic mock data
  • Experience creating and exposing clean, well-documented API endpoints (REST or GraphQL)
  • Exceedingly high standards for code quality, readability, testing, and front-end craftsmanship
  • Extensive day-to-day experience using coding agents / AI assistants as a power user (Cursor, Claude, Copilot, Grok, Aider, etc.)
  • Good understanding of the Reinforcement Learning paradigm (RLHF, PPO, DPO, reward modeling, etc.)

Nice to Have

  • Experience with Docker Compose/Kubernetes
  • Posses strong logical reasoning skills, is detail-oriented, and thrives in a fast-paced work environment
  • Eager to teach to and learn from teammates
  • Enthusiasm to collaboratively build the best truth-seeking AI

Skills

Python * Copilot * Kubernetes * Docker * REST * TypeScript * Django * FastAPI * Flask * ReactJS * Claude * Reinforcement learning * RLHF * DPO * PPO * Cursor * GraphQL * Docker Compose * Grok * Aider *

* Required skills

About xAI

xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.

Technology
View all jobs at xAI →