RL Environments Specialist

Remote

xAI Palo Alto, CA $100 - $100

Full Time Entry Level

Posted 3 months ago Expired

This job has expired

Looking for a job like RL Environments Specialist in or near Palo Alto, CA? Upload your resume and we'll notify you when similar positions become available.

Upload Your Resume

About This Role

Create full Reinforcement Learning (RL) environments, including UI and backend, to programmatically generate and validate tasks for training computer-use agents, taking ownership of the entire task creation process for specific environments.

Responsibilities

Build sandbox UIs that agents and RL actors will interact with
Create tasks for built environments
Programmatically validate task completion
Take ownership of the entire task creation process for a given environment

Requirements

Strong professional experience with React.js (hooks, modern state management, TypeScript preferred)
Strong professional experience building backend services in Python (FastAPI, Flask, or Django)
Hands-on experience with containerization (Docker required)
Strong front-end design skills and exceptionally high taste in UI/UX, polish, and visual detail
Proven ability to design a relational database schema in Python and populate it with large-scale, realistic mock data
Experience creating and exposing clean, well-documented API endpoints (REST or GraphQL)
Exceedingly high standards for code quality, readability, testing, and front-end craftsmanship
Extensive day-to-day experience using coding agents / AI assistants as a power user (Cursor, Claude, Copilot, Grok, Aider, etc.)
Good understanding of the Reinforcement Learning paradigm (RLHF, PPO, DPO, reward modeling, etc.)

Nice to Have

Experience with Docker Compose/Kubernetes
Posses strong logical reasoning skills, is detail-oriented, and thrives in a fast-paced work environment
Eager to teach to and learn from teammates
Enthusiasm to collaboratively build the best truth-seeking AI

Skills

Python * Copilot * Kubernetes * Docker * REST * TypeScript * Django * FastAPI * Flask * ReactJS * Claude * Reinforcement learning * RLHF * DPO * PPO * Cursor * GraphQL * Docker Compose * Grok * Aider *

* Required skills

About xAI

xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.

Technology

View all jobs at xAI →

Similar Jobs

RL Environments Specialist

This job has expired

About This Role

Responsibilities

Requirements

Nice to Have

Skills

About xAI

Related Searches

Similar Jobs

AI Tutor - Image Specialist

AI Finance Tutor - Sell-Side

AI Tutor - Earth Science Specialist

IT Services Technician

IT Services Technician