Kirili4ik
/

ruDialoGpt3-medium-finetuned-telegram

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Kirill Gelvan commited on Oct 23, 2021

Commit

505f6f7

•

1 Parent(s): a0f2756

add some descriptions

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -5,11 +5,15 @@ tags:
 ---
 ### Description
-### Inference
-```python
 def get_length_param(text: str, tokenizer) -> str:
     tokens_count = len(tokenizer.encode(text))
     if tokens_count <= 15:

 ---
 ### Description
+DialoGPT trained on Russian language and fine tuned on my telegram chat.
+This model was created by [sberbank-ai](https://hf.co/sberbank-ai) and trained on Russian forums (see [Grossmend's model](https://hf.co/Grossmend/rudialogpt3_medium_based_on_gpt2)). You can find info about how it has been trained on [habr](https://habr.com/ru/company/icl_services/blog/548244/) (in Russian). I have created a **simple pipeline** and **fine tuned** that model on my own **exported telegram chat** (~30mb json). It is in fact very easy to get the data from telegram and fine tune a model. Therefore, I made a **colab tutorial** for it: link
+### How to use
+```python
 def get_length_param(text: str, tokenizer) -> str:
     tokens_count = len(tokenizer.encode(text))
     if tokens_count <= 15: