BACK_TO_FEEDAICRIER_2
Code Review Dataset drops 355k review rows
OPEN_SOURCE ↗
REDDIT · REDDIT// 33d agoOPENSOURCE RELEASE

Code Review Dataset drops 355k review rows

Ronan Takizawa released an open Hugging Face dataset of 355,807 code review examples built from 725 permissively licensed GitHub repos across 37 languages, pairing human review comments with before/after code changes plus negative examples where no review comment was needed. The same release also claims a Qwen2.5-Coder-32B fine-tune on the dataset delivered roughly 4x better BLEU-4, ROUGE-L, and SBERT scores than the base model for review-style tasks.

// ANALYSIS

This is the kind of dataset AI coding research has been missing: real reviewer feedback tied to actual code edits instead of synthetic instruction fluff.

  • The strongest signal here is the triplet structure: diff context, reviewer comment, and resulting code change all in one row
  • Negative examples matter almost as much as positive ones because they teach models when clean code should pass without noisy comments
  • Coverage across 725 repos and 37 languages makes it more useful for generalist coding models than single-language benchmark sets
  • The permissive-license filtering lowers legal friction for teams experimenting with fine-tuning on real OSS review data
  • The reported model gains are promising, but they are still self-reported metrics rather than an independently validated benchmark
// TAGS
code-review-datasetcode-reviewai-codingfine-tuningopen-sourceresearch

DISCOVERED

33d ago

2026-03-09

PUBLISHED

33d ago

2026-03-09

RELEVANCE

8/ 10

AUTHOR

Ok_Employee_6418