RL Environments Specialist
Remote
Full Time
Entry Level
Posted 3 weeks ago
Interested in this position?
Upload your resume and we'll match you with this and other relevant opportunities.
Upload Your ResumeAbout This Role
Create full Reinforcement Learning (RL) environments, including UI and backend, to programmatically generate and validate tasks for training computer-use agents, taking ownership of the entire task creation process for specific environments.
Responsibilities
- Build sandbox UIs that agents and RL actors will interact with
- Create tasks for built environments
- Programmatically validate task completion
- Take ownership of the entire task creation process for a given environment
Requirements
- Strong professional experience with React.js (hooks, modern state management, TypeScript preferred)
- Strong professional experience building backend services in Python (FastAPI, Flask, or Django)
- Hands-on experience with containerization (Docker required)
- Strong front-end design skills and exceptionally high taste in UI/UX, polish, and visual detail
- Proven ability to design a relational database schema in Python and populate it with large-scale, realistic mock data
- Experience creating and exposing clean, well-documented API endpoints (REST or GraphQL)
- Exceedingly high standards for code quality, readability, testing, and front-end craftsmanship
- Extensive day-to-day experience using coding agents / AI assistants as a power user (Cursor, Claude, Copilot, Grok, Aider, etc.)
- Good understanding of the Reinforcement Learning paradigm (RLHF, PPO, DPO, reward modeling, etc.)
Nice to Have
- Experience with Docker Compose/Kubernetes
- Posses strong logical reasoning skills, is detail-oriented, and thrives in a fast-paced work environment
- Eager to teach to and learn from teammates
- Enthusiasm to collaboratively build the best truth-seeking AI
Skills
Python
*
Copilot
*
Kubernetes
*
Docker
*
REST
*
TypeScript
*
Django
*
FastAPI
*
Flask
*
ReactJS
*
Claude
*
Reinforcement learning
*
RLHF
*
DPO
*
PPO
*
Cursor
*
GraphQL
*
Docker Compose
*
Grok
*
Aider
*
* Required skills
About xAI
xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.
Technology
View all jobs at xAI →
Related Searches
Similar Jobs
AI Tutor - Image Specialist
Active Remote
xAI
·
$45 - $75
Prompt Engineering
Midjourney
Flux
Grok Imagine
+3 more
2 weeks ago
AI Finance Tutor - Sell-Side
Active
xAI
·
Palo Alto, CA
·
$45 - $100
AI
Data Annotation
Chromebook
macOS 11.0 or later
+1 more
3 weeks ago
AI Tutor - Earth Science Specialist
Active
xAI
·
Palo Alto, CA
·
$45 - $75
Databases
Proprietary software applications
3 weeks ago
IT Services Technician
Expired
xAI
·
New York, NY
·
$75,000 - $95,000
Microsoft Office Suite
ServiceNow
Jira
TCP/IP
+6 more
1 month ago
IT Services Technician
Expired
xAI
·
Bastrop, TX
·
$75,000 - $95,000
Microsoft Office Suite
ServiceNow
Jira
TCP/IP
+6 more
1 month ago