Interested in this position?
Upload your resume and we'll match you with this and other relevant opportunities.
Upload Your ResumeAbout This Role
Evaluate and refine AI language models' behavior in engineering scenarios by writing and refining prompts, assessing technical accuracy, and ensuring alignment with conversational guidelines.
Responsibilities
- Write and refine prompts to guide model behavior in engineering scenarios
- Evaluate LLM-generated responses for technical accuracy and applied reasoning
- Conduct fact-checking and verify technical claims using authoritative sources
- Annotate model responses, identifying strengths and areas for improvement
- Assess clarity and appropriateness of explanations for different audiences
- Ensure model responses align with expected conversational behavior and guidelines
Requirements
- PhD in Engineering or a closely related field
- Expertise in Mechanical & Physical Systems Engineering, Electrical, Electronic & Computer Engineering, Chemical, Materials & Process Engineering, or Civil, Environmental & Infrastructure Engineering
- Significant experience using large language models (LLMs)
- Excellent writing skills
- Strong attention to detail
- Experience reviewing or editing technical or academic writing
Qualifications
- PhD in Engineering or a closely related field
Nice to Have
- Experience with applied research, industry engineering workflows, or systems design
- Prior experience with RLHF, model evaluation, or data annotation work
- Experience teaching or explaining engineering concepts to non-expert audiences
- Familiarity with evaluation rubrics or structured review frameworks
Skills
LLM
*
RLHF
*
* Required skills
About Mercor
Mercor connects elite creative and technical talent with leading AI research labs.
Technology
View all jobs at Mercor →
Related Searches
Similar Jobs
Data Science Scientist
Active Remote
Mercor
·
$56 - $77
Python
SQL
Snowflake
BigQuery
+6 more
1 week ago
Engineering Program Coordinator
Active Remote
Mercor
·
$65 - $80
1 week ago
Frontend Engineer
Active Remote
Mercor
·
$70 - $80
JavaScript
REACT
TypeScript
Vanilla JS
1 week ago
Software Engineer – Codepath Expert
Active Remote
Mercor
·
$90 - $105
Python
C++
MySQL
1 week ago
Content Analyst
Active Remote
Mercor
·
$30 - $35
1 week ago