mlenjoyneer
/

rut5_large_sum_gazeta

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rut5_large_sum_gazeta / README.md

mlenjoyneer's picture

Upload 6 files

e107cff over 1 year ago

|

history blame contribute delete

1.67 kB

	---
	language: ru
	license: apache-2.0
	datasets:
	- IlyaGusev/gazeta
	---

	# RuT5LargeSumGazeta

	## Model description

	This is the model for abstractive summarization for Russian based on ai-forever/ruT5-large.

	## Intended uses & limitations

	### How to use

	Here is how to use this model in PyTorch:

	```python
	from transformers import AutoTokenizer, T5ForConditionalGeneration

	model_name = "mlenjoyneer/rut5_large_sum_gazeta"
	tokenizer = AutoTokenizer.from_pretrained(model_name)
	model = T5ForConditionalGeneration.from_pretrained(model_name)

	article_text = "..."

	input_ids = tokenizer(
	[article_text],
	max_length=600,
	add_special_tokens=True,
	padding="max_length",
	truncation=True,
	return_tensors="pt"
	)["input_ids"]

	output_ids = model.generate(
	input_ids=input_ids,
	no_repeat_ngram_size=4
	)[0]

	summary = tokenizer.decode(output_ids, skip_special_tokens=True)
	print(summary)
	```

	## Training data

	- Dataset: [Gazeta](https://huggingface.co/datasets/IlyaGusev/gazeta)

	## Evaluation results

	\| Model \| R-1-f \| R-2-f \| R-L-f \| chrF \| BLEU \| Avg char length \|
	\| -------------------------------------------- \| ----- \| ----- \| ----- \| ---- \| ---- \| --------------- \|
	\| IlyaGusev/mbart_ru_sum_gazeta \| 28.7 \| 11.1 \| 24.4 \| 37.3 \| 9.4 \| 373 \|
	\| IlyaGusev/rut5_base_sum_gazeta \| 28.6 \| 11.1 \| 24.5 \| 37.2 \| 9.4 \| 331 \|
	\| IlyaGusev/rugpt3medium_sum_gazeta \| 24.1 \| 6.5 \| 19.8 \| 32.1 \| 3.6 \| 242 \|
	\| rut5-large_sum_gazeta \| 29.6 \| 11.7 \| 25.2 \| 37.3 \| 9.4 \| 304 \|