File size: 676 Bytes
d2bc0bc
 
 
 
 
 
 
710e46e
d2bc0bc
71b1155
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
---
language:
- ru
tags:
- fluency
---

This is a [ruRoberta-large](https://huggingface.co/sberbank-ai/ruRoberta-large) model trained on the [RuCoLa](https://rucola-benchmark.com/) dataset. It can be used to classify Russian sentences into fluent or non-fluent ones, where fluency is understood as linguistic acceptability. 

Training notebook: `task_oriented_TST/fluency/rucola_classifier_v1.ipynb` (in a private repo). 

Training parameters: 
* optimizer: Adam
* `lr=2e-6`
* `batch_size=32`
* `epochs=10`
* `clip_grad_norm=1.0`

Test accuracy (on the [leaderboard](https://rucola-benchmark.com/leaderboard) this model is submitted as `ruroberta-base-cased-rucola-v1`): 0.81.