bart-large-cnn-finetuned-paper

This model is a fine-tuned version of facebook/bart-large-cnn on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 6.7509
  • Rouge1: 27.2291
  • Rouge2: 4.8436
  • Rougel: 19.2576
  • Rougelsum: 27.0591
  • Gen Len: 513.0

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 5
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
9.1371 1.0 125 6.9805 32.8773 5.5962 18.6551 32.6594 513.0
6.9049 2.0 250 6.8309 33.3706 5.7215 18.9109 33.0595 513.0
6.7776 3.0 375 6.7827 34.4481 5.8386 18.8797 34.0574 513.0
6.7029 4.0 500 6.7592 26.9866 4.8121 19.2095 26.8459 513.0
6.6832 5.0 625 6.7509 27.2291 4.8436 19.2576 27.0591 513.0

Framework versions

  • Transformers 4.45.2
  • Pytorch 2.5.1+cu121
  • Datasets 3.2.0
  • Tokenizers 0.20.3
Downloads last month
10
Safetensors
Model size
459M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no pipeline_tag.

Model tree for rosadecsai/bart-large-cnn-finetuned-paper

Finetuned
(350)
this model