Dataset features

rftd contains human-applied semantic tags representing hundreds of thousands of legal and factual issues across more than 20,000 judgments. More detailed statistics are provided with the sample dataset.
Semantic tags are applied following a strict QA process with auditing and redundancy.
rftd includes human-readable descriptions of the semantic tags, tests for annotation decisions, and sample source text for each annotation.
Each case record contains judgment metadata (date, judge(s), online sources, etc.) as well as machine-readable case outcomes.
rftd is provided in JSON format for easy ingestion by machine learning models.

Please contact us for further information.