This collection contains currated text similarity datasets that are available in huggingface dataset
-
jakartaresearch/id-paraphrase-detection
Viewer • Updated • 5.8k • 95 • 3 -
andreaschandra/quora-question-pairs-id
Viewer • Updated • 1k • 167 • 1 -
sentence-transformers/parallel-sentences-global-voices
Viewer • Updated • 2.2M • 613 -
sentence-transformers/parallel-sentences-opensubtitles
Viewer • Updated • 274M • 1.89k • 3