datasets: | |
- wikimedia/wikipedia | |
- nthngdy/oscar-small | |
language: | |
- pl | |
base_model: | |
- distilbert/distilgpt2 | |
license: apache-2.0 | |
distilgpt2 with new tokenizer, trained from scratch with polish datasets. | |
Needs more training, however it's able to generate correct polish sentences. |