cointegrated's picture
Update README.md
710e46e
---
language:
- ru
tags:
- fluency
---
This is a [ruRoberta-large](https://huggingface.co/sberbank-ai/ruRoberta-large) model trained on the [RuCoLa](https://rucola-benchmark.com/) dataset. It can be used to classify Russian sentences into fluent or non-fluent ones, where fluency is understood as linguistic acceptability.
Training notebook: `task_oriented_TST/fluency/rucola_classifier_v1.ipynb` (in a private repo).
Training parameters:
* optimizer: Adam
* `lr=2e-6`
* `batch_size=32`
* `epochs=10`
* `clip_grad_norm=1.0`
Test accuracy (on the [leaderboard](https://rucola-benchmark.com/leaderboard) this model is submitted as `ruroberta-base-cased-rucola-v1`): 0.81.