Software Engineer Evaluator (US)
RemotePosted 2 weeks ago
Interested in this position?
Upload your resume and we'll match you with this and other relevant opportunities.
Upload Your ResumeAbout This Role
As a Software Engineering Evaluator, you will contribute to cutting-edge datasets for training, benchmarking, and advancing large language models by curating and correcting code examples in various programming languages. Your role involves evaluating AI-generated code for efficiency and reliability, and collaborating with cross-functional teams to enhance AI-driven coding solutions.
Responsibilities
- Create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers.
- Curate code examples, provide precise solutions, and make corrections in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go.
- Evaluate and refine AI-generated code for efficiency, scalability, and reliability.
- Work with cross-functional teams to enhance enterprise-level AI-driven coding solutions against industry performance benchmarks.
- Build agents that can verify the quality of the code and identify error patterns.
- Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them.
- Design verification mechanisms that can automatically verify a solution to a software engineering task.
Requirements
- 5+ years of software engineering experience
- 2+ years of continuous full-time experience at a top-tier product company
- Strong expertise in building full-stack applications and deploying scalable, production-grade software using modern languages and tools
- Deep understanding of software architecture, design, development, debugging, and code quality/review assessment
- Excellent oral and written communication skills
Qualifications
- Several years of software engineering experience (+5 years), including 2+ years of continuous full-time experience at a top-tier product company (e.g., Google, Stripe, Amazon, Apple, Meta, Netflix, Microsoft, Datadog, Dropbox, Shopify, PayPal, IBM Research).
Skills
* Required skills
About Turing
Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems, based in San Francisco, California.