Bilingual (English & Bengali) AI Model Evaluator - Remote Contract Role

Mercor • United State

Remote

Apply

AI Summary

Evaluate AI model responses, conduct fact-checking, and improve AI performance. Native Bengali speaker with strong English writing skills required. Experience with large language models and structured analytical thinking preferred.

Key Highlights

Fact-check responses using public sources and tools

Assess response quality, clarity, and tone

Ensure model responses align with guidelines

Work independently and asynchronously to meet deadlines

Key Responsibilities

Conduct fact-checking using trusted public sources and external tools

Generate high-quality human evaluation data by identifying response strengths, areas for improvement, and factual inaccuracies

Assess reasoning quality, clarity, tone, and completeness of responses

Ensure model responses align with expected conversational behavior and system guidelines

Work independently and asynchronously to meet deadlines while improving AI model performance

Technical Skills Required

Large Language Models (LLMs) Fact-checking Data Analysis

Benefits & Perks

Competitive hourly compensation ($15-$20/hour)

Remote work

Nice to Have

Prior experience with RLHF, model evaluation, or data annotation work

Experience writing or editing high-quality written content

Experience comparing multiple outputs and making fine-grained qualitative judgments

Job Description

About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position: Generalist - English & Bengali

Type: Contract

Compensation: $15–$20/hour

Location: Remote

Role Responsibilities

Conduct fact-checking using trusted public sources and external tools.
Generate high-quality human evaluation data by identifying response strengths, areas for improvement, and factual inaccuracies.
Assess reasoning quality, clarity, tone, and completeness of responses.
Ensure model responses align with expected conversational behavior and system guidelines.
Work independently and asynchronously to meet deadlines while improving AI model performance.

Interested in remote work opportunities in Human Resource? Discover Human Resource Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.

Qualifications

Must-Have

Bachelor's degree
Native speaker in Bengali
Significant experience using large language models (LLMs)
Excellent writing skills in English
Strong attention to detail
Background or experience in domains requiring structured analytical thinking (e.g., research, policy, analytics, linguistics, engineering)

Preferred

Prior experience with RLHF, model evaluation, or data annotation work
Experience writing or editing high-quality written content
Experience comparing multiple outputs and making fine-grained qualitative judgments

Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.

Application Process (Takes 20–30 mins to complete)

Upload resume
AI interview based on your resume
Submit form

Resources & Support

For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome
For any help or support, reach out to: support@mercor.com

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Job Overview

Posted Date Jun 04, 2026

Employment Type Part-time

Experience Level Not Applicable

Location United State

Category Hr

Company Mercor

Mentioned Skills

Similar Jobs

Explore other opportunities that match your interests

HR Services Specialist

•

52m ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Not Applicable

arizona state university

United State

Payroll Manager

•

57m ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Not Applicable

AssetWatch®

United State

Payroll Specialist

•

8h ago

Premium Job

•••••• •••••• ••••••

Job Type ••••••

Experience Level ••••••

GE Aerospace

United State

Bilingual (English & Bengali) AI Model Evaluator - Remote Contract Role

Key Highlights

Key Responsibilities

Technical Skills Required

Benefits & Perks

Nice to Have

Job Description

Job Overview

Mentioned Skills

Industries

Similar Jobs

HR Services Specialist

arizona state university

Payroll Manager

AssetWatch®

Payroll Specialist

Premium Job

GE Aerospace

Subscribe our newsletter