Historical text normalization datasets Parallel corpora of historical spellings and normalizations. Can be used for training a normalizer. textplus-bbaw/dtak-transnormer-basic-v1 Viewer • Updated Apr 1 • 5.65M • 88 textplus-bbaw/dtak-transnormer-full-v1 Preview • Updated Feb 26 • 70 textplus-bbaw/dta-reviEvalCorpus-v1 Viewer • Updated Nov 27, 2024 • 245k • 17
Transnormer models Byte-level seq2seq models that can normalize historical German spellings. textplus-bbaw/transnormer-18-19c-beta-v01 Text Generation • Updated Mar 13 • 11 textplus-bbaw/transnormer-19c-beta-v02 Text Generation • 0.3B • Updated Mar 13 • 59 • 1 ybracke/transnormer-19c-beta-v01 0.3B • Updated Feb 3 • 4 • 1
Historical text normalization datasets Parallel corpora of historical spellings and normalizations. Can be used for training a normalizer. textplus-bbaw/dtak-transnormer-basic-v1 Viewer • Updated Apr 1 • 5.65M • 88 textplus-bbaw/dtak-transnormer-full-v1 Preview • Updated Feb 26 • 70 textplus-bbaw/dta-reviEvalCorpus-v1 Viewer • Updated Nov 27, 2024 • 245k • 17
Transnormer models Byte-level seq2seq models that can normalize historical German spellings. textplus-bbaw/transnormer-18-19c-beta-v01 Text Generation • Updated Mar 13 • 11 textplus-bbaw/transnormer-19c-beta-v02 Text Generation • 0.3B • Updated Mar 13 • 59 • 1 ybracke/transnormer-19c-beta-v01 0.3B • Updated Feb 3 • 4 • 1