AI RESEARCH

OARelatedWork: A Large-Scale Dataset of Related Work Sections with Full-texts from Open Access Sources

arXiv CS.CL

This paper introduces OARelatedWork: a dataset for related work generation from open-access sources. It is the first large-scale multi-document summarization dataset for related work generation, containing whole related work sections and full texts of cited papers. Its validation and test splits are constructed so that every cited paper is available in full text, enabling controlled evaluation of full-text related work generation.