Evaluate AI-generated search responses for factual accuracy and quality. Assess model responses and provide concise rationales. Apply project guidelines and identify recurring failure modes.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Nice to Have
Job Description
About The Job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
Position: Search Generalist Expert
Type: Contract
Compensation: $10–$30/hour
Location: Remote
Role Responsibilities
- Evaluate AI-generated search responses for factual accuracy, helpfulness, clarity, completeness, and overall quality.
- Assess whether models use search appropriately and whether search queries are well-formed and effective.
- Compare model responses side by side and provide concise, defensible rationales.
- Write and refine prompts, golden answers, rubric criteria, and edge cases for search-related evaluations.
- Apply project guidelines consistently across ambiguous, multi-step, and real-world search tasks.
- Identify recurring failure modes and escalate unclear cases or rubric gaps to project leads.
Interested in remote work opportunities in Human Resource? Discover Human Resource Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
Must-Have
- Excellent written English and strong online research skills.
- Strong judgment when synthesizing information from multiple sources.
- Ability to distinguish factual accuracy from fluency, confidence, or style.
- High attention to detail and comfort following structured guidelines.
- Reliable, self-directed, and responsive in an asynchronous remote environment.
- Experience in search quality, fact-checking, content evaluation, trust and safety, annotation, QA, or prompt/rubric writing.
- Familiarity with search evaluation concepts such as factuality, helpfulness, severity, side-by-side comparisons, or tool-use assessment.
- Experience working with LLM evaluation workflows or human data projects.
- Multilingual skills are a plus.
- Bachelor’s degree preferred; advanced degree or strong professional background is a plus.
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
- Upload resume
- AI interview based on your resume
- Submit form
- For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome
- For any help or support, reach out to: support@mercor.com
,
Similar Jobs
Explore other opportunities that match your interests
Global Payroll Manager
Haystack
blusource recruitment