AI RESEARCH

BullingerDB: A Dataset for Handwritten Text Recognition and Writer Retrieval

arXiv CS.CV

ArXi:2605.30235v1 Announce Type: new We present BullingerDB, a large-scale benchmark dataset for historical document analysis based on the correspondence of Heinrich Bullinger (1504-1575). The corpus comprises 20,898 pages and 499,222 text lines