This collection contains currated text similarity datasets that are available in huggingface dataset
-
jakartaresearch/id-paraphrase-detection
Viewer • Updated • 5.8k • 95 • 3 -
andreaschandra/quora-question-pairs-id
Viewer • Updated • 1k • 199 • 1 -
sentence-transformers/parallel-sentences-global-voices
Viewer • Updated • 2.2M • 711 -
sentence-transformers/parallel-sentences-opensubtitles
Viewer • Updated • 274M • 2.31k • 3