Bilingual (English & Bengali) AI Model Evaluator - Remote Contract Role

Mercor • United State
Remote
Apply
AI Summary

Evaluate AI model responses, conduct fact-checking, and improve AI performance. Native Bengali speaker with strong English writing skills required. Experience with large language models and structured analytical thinking preferred.

Key Highlights
Fact-check responses using public sources and tools
Assess response quality, clarity, and tone
Ensure model responses align with guidelines
Work independently and asynchronously to meet deadlines
Key Responsibilities
Conduct fact-checking using trusted public sources and external tools
Generate high-quality human evaluation data by identifying response strengths, areas for improvement, and factual inaccuracies
Assess reasoning quality, clarity, tone, and completeness of responses
Ensure model responses align with expected conversational behavior and system guidelines
Work independently and asynchronously to meet deadlines while improving AI model performance
Technical Skills Required
Large Language Models (LLMs) Fact-checking Data Analysis
Benefits & Perks
Competitive hourly compensation ($15-$20/hour)
Remote work
Nice to Have
Prior experience with RLHF, model evaluation, or data annotation work
Experience writing or editing high-quality written content
Experience comparing multiple outputs and making fine-grained qualitative judgments

Job Description


About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position: Generalist - English & Bengali

Type: Contract

Compensation: $15–$20/hour

Location: Remote

Role Responsibilities

  • Conduct fact-checking using trusted public sources and external tools.
  • Generate high-quality human evaluation data by identifying response strengths, areas for improvement, and factual inaccuracies.
  • Assess reasoning quality, clarity, tone, and completeness of responses.
  • Ensure model responses align with expected conversational behavior and system guidelines.
  • Work independently and asynchronously to meet deadlines while improving AI model performance.


Qualifications

Must-Have

  • Bachelor's degree
  • Native speaker in Bengali
  • Significant experience using large language models (LLMs)
  • Excellent writing skills in English
  • Strong attention to detail
  • Background or experience in domains requiring structured analytical thinking (e.g., research, policy, analytics, linguistics, engineering)


Preferred

  • Prior experience with RLHF, model evaluation, or data annotation work
  • Experience writing or editing high-quality written content
  • Experience comparing multiple outputs and making fine-grained qualitative judgments


Application Process (Takes 20–30 mins to complete)

  • Upload resume
  • AI interview based on your resume
  • Submit form


Resources & Support

  • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome
  • For any help or support, reach out to: support@mercor.com


PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Similar Jobs

Explore other opportunities that match your interests

HR Services Specialist

Hr
•
52m ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

arizona state university

United State

Payroll Manager

Hr
•
57m ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

AssetWatch®

United State

Payroll Specialist

Hr
•
8h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

GE Aerospace

United State

Subscribe our newsletter

New Things Will Always Update Regularly