nickprock's picture
Update README.md
a0c8c08
|
raw
history blame
6.32 kB
metadata
license: mit
tags:
  - generated_from_trainer
datasets:
  - banking77
metrics:
  - accuracy
model-index:
  - name: xlm-roberta-base-banking77-classification
    results:
      - task:
          name: Text Classification
          type: text-classification
        dataset:
          name: banking77
          type: banking77
          config: default
          split: train
          args: default
        metrics:
          - name: Accuracy
            type: accuracy
            value: 0.9321428571428572
widget:
  - text: 'Can I track the card you sent to me? '
    example_title: Card Arrival Example - English
  - text: 'Posso tracciare la carta che mi avete spedito? '
    example_title: Card Arrival Example - Italian
  - text: Can you explain your exchange rate policy to me?
    example_title: Exchange Rate Example - English
  - text: Potete spiegarmi la vostra politica dei tassi di cambio?
    example_title: Exchange Rate Example - Italian
  - text: I can't pay by my credit card
    example_title: Card Not Working Example - English
  - text: Non riesco a pagare con la mia carta di credito
    example_title: Card Not Working Example - Italian

xlm-roberta-base-banking77-classification

This model is a fine-tuned version of xlm-roberta-base on the banking77 dataset. It achieves the following results on the evaluation set:

  • Loss: 0.3034
  • Accuracy: 0.9321
  • F1 Score: 0.9321

Model description

Experiment on a cross-language model to assess how accurate the classification is by using for fine tuning an English dataset but later querying the model in Italian.

Intended uses & limitations

The model can be used on text classification. In particular is fine tuned on banking domain for multilingual task.

Training and evaluation data

The dataset used is banking77

The 77 labels are:

label intent
0 activate_my_card
1 age_limit
2 apple_pay_or_google_pay
3 atm_support
4 automatic_top_up
5 balance_not_updated_after_bank_transfer
6 balance_not_updated_after_cheque_or_cash_deposit
7 beneficiary_not_allowed
8 cancel_transfer
9 card_about_to_expire
10 card_acceptance
11 card_arrival
12 card_delivery_estimate
13 card_linking
14 card_not_working
15 card_payment_fee_charged
16 card_payment_not_recognised
17 card_payment_wrong_exchange_rate
18 card_swallowed
19 cash_withdrawal_charge
20 cash_withdrawal_not_recognised
21 change_pin
22 compromised_card
23 contactless_not_working
24 country_support
25 declined_card_payment
26 declined_cash_withdrawal
27 declined_transfer
28 direct_debit_payment_not_recognised
29 disposable_card_limits
30 edit_personal_details
31 exchange_charge
32 exchange_rate
33 exchange_via_app
34 extra_charge_on_statement
35 failed_transfer
36 fiat_currency_support
37 get_disposable_virtual_card
38 get_physical_card
39 getting_spare_card
40 getting_virtual_card
41 lost_or_stolen_card
42 lost_or_stolen_phone
43 order_physical_card
44 passcode_forgotten
45 pending_card_payment
46 pending_cash_withdrawal
47 pending_top_up
48 pending_transfer
49 pin_blocked
50 receiving_money
51 Refund_not_showing_up
52 request_refund
53 reverted_card_payment?
54 supported_cards_and_currencies
55 terminate_account
56 top_up_by_bank_transfer_charge
57 top_up_by_card_charge
58 top_up_by_cash_or_cheque
59 top_up_failed
60 top_up_limits
61 top_up_reverted
62 topping_up_by_card
63 transaction_charged_twice
64 transfer_fee_charged
65 transfer_into_account
66 transfer_not_received_by_recipient
67 transfer_timing
68 unable_to_verify_identity
69 verify_my_identity
70 verify_source_of_funds
71 verify_top_up
72 virtual_card_not_working
73 visa_or_mastercard
74 why_verify_identity
75 wrong_amount_of_cash_received
76 wrong_exchange_rate_for_cash_withdrawal

Training procedure

from transformers import pipeline
pipe = pipeline("text-classification", model="nickprock/xlm-roberta-base-banking77-classification")
pipe("Non riesco a pagare con la carta di credito")

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 64
  • eval_batch_size: 64
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 20

Training results

Training Loss Epoch Step Validation Loss Accuracy F1 Score
3.8002 1.0 157 2.7771 0.5159 0.4483
2.4006 2.0 314 1.6937 0.7140 0.6720
1.4633 3.0 471 1.0385 0.8308 0.8153
0.9234 4.0 628 0.7008 0.8789 0.8761
0.6163 5.0 785 0.5029 0.9068 0.9063
0.4282 6.0 942 0.4084 0.9123 0.9125
0.3203 7.0 1099 0.3515 0.9253 0.9253
0.245 8.0 1256 0.3295 0.9227 0.9225
0.1863 9.0 1413 0.3092 0.9269 0.9269
0.1518 10.0 1570 0.2901 0.9338 0.9338
0.1179 11.0 1727 0.2938 0.9318 0.9319
0.0969 12.0 1884 0.2906 0.9328 0.9328
0.0805 13.0 2041 0.2963 0.9295 0.9295
0.063 14.0 2198 0.2998 0.9289 0.9288
0.0554 15.0 2355 0.2933 0.9351 0.9349
0.046 16.0 2512 0.2960 0.9328 0.9326
0.04 17.0 2669 0.3032 0.9318 0.9318
0.035 18.0 2826 0.3061 0.9312 0.9312
0.0317 19.0 2983 0.3030 0.9331 0.9330
0.0315 20.0 3140 0.3034 0.9321 0.9321

Framework versions

  • Transformers 4.21.1
  • Pytorch 1.12.1+cu113
  • Datasets 2.4.0
  • Tokenizers 0.12.1