Seemless Integration, Exceptional Results

Chain‑of‑Thought Data Curator

micro1

About the job

Job Title: Chain‑of‑Thought Data Curator

Job Type: Full-time or Part-time contract

Location: Remote

Job Summary:

Join our customer’s team as a Chain‑of‑Thought Data Curator and play a pivotal role in advancing large-language-model reasoning. You’ll be responsible for crafting and evaluating gold-standard datasets that push the limits of multi-step reasoning in AI. Leverage your STEM-oriented and generalist mindset to create benchmarks that set the industry standard.

Key Responsibilities:

  • Develop and curate gold-standard Chain-of-Thought (CoT) datasets across diverse reasoning-heavy tasks.
  • Design clear, scalable rubrics and instructions to evaluate and annotate multi-step reasoning processes.
  • Write precise, well-structured CoT responses that demonstrate high-level generalist reasoning, with a preference for STEM contexts.
  • Critically assess logical flow, correctness, and justification within reasoning chains, ensuring rigor and fidelity.
  • Identify and document common model failure types, such as hallucination, shortcut reasoning, and unsupported leaps.
  • Collaborate with AI trainers, model evaluators, and RLHF annotators to refine CoT benchmarks and annotation protocols.
  • Stress-test the depth and reliability of LLM reasoning across varied benchmarks.

Required Skills and Qualifications:

  • Extensive experience in creating or curating CoT or instruction tuning datasets for AI/LLMs.
  • Proven ability to design and implement binary or graded rubrics for evaluating multi-step reasoning outputs.
  • Robust generalist analytical skills, ideally with a STEM or competitive exam background.
  • Exceptional written and verbal communication abilities, with attention to clarity and structure.
  • A deep understanding of LLM failure modes and reasoning pitfalls in model outputs.
  • Experience balancing fine-grained evaluation criteria with scalable instructions for diverse teams.
  • Background in RLHF annotation, AI model evaluation, or prompt engineering highly valued.

Preferred Qualifications:

  • Experience with instruction tuning, model evaluation, or advanced prompt engineering projects.
  • Exposure to cross-disciplinary reasoning tasks and datasets.
  • Strong track record of collaborating with AI research or data curation teams.

Share this job

Categories

Recruiter Features

Related Jobs

Old Mutual

Data Analyst

Responsible for ensuring the completeness, accuracy & integrity of client data

International Water Management Institute (IWMI)

Consultant – Hydrological Data Scientist

Review and compile relevant hydro-climatological datasets

Proxify

Senior ML Engineer (Python)

We are looking for Machine Learning Engineers with strong Python experience 

Fido

Data Analyst

5+ years of experience as a Data Analyst

Renew Capital

Sr. Financial Analyst

The sr. financial analyst will work closely with investment managers

M-KOPA

Senior Data Scientist – Credit Modeling

You’ll be joining a team that’s rapidly expanding our credit capabilities

AmaliTech

Data Engineer

AmaliTech is looking for an experienced Data Engineer to join our data team

Aya Data

AI SOLUTIONS – PRODUCT ASSOCIATE

Manage project timelines and client communications

Raya

Senior Data Engineer, Data Products

You will be a founding member of the data engineering team

Turing

Remote Business Analyst

You will be working on projects to help fine-tune large language models