BramVanroy
/

xlm-roberta-base-hebban-reviews

Text Classification

sentiment-analysis

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

xlm-roberta-base-hebban-reviews

Dataset

dataset_name: BramVanroy/hebban-reviews
dataset_config: filtered_sentiment
dataset_revision: 2.0.0
labelcolumn: review_sentiment
textcolumn: review_text_without_quotes

Training

optim: adamw_hf
learning_rate: 5e-05
per_device_train_batch_size: 64
per_device_eval_batch_size: 64
gradient_accumulation_steps: 1
max_steps: 5001
save_steps: 500
metric_for_best_model: qwk

Best checkedpoint based on validation

best_metric: 0.741533273748008
best_model_checkpoint: trained/hebban-reviews/xlm-roberta-base/checkpoint-2000

Test results of best checkpoint

accuracy: 0.8094674556213017
f1: 0.812677483587223
precision: 0.8173602585519025
qwk: 0.7369243423166991
recall: 0.8094674556213017

Confusion matrix

Normalized confusion matrix

Environment

cuda_capabilities: 8.0; 8.0
cuda_device_count: 2
cuda_devices: NVIDIA A100-SXM4-80GB; NVIDIA A100-SXM4-80GB
finetuner_commit: 66294c815326c93682003119534cb72009f558c2
platform: Linux-4.18.0-305.49.1.el8_4.x86_64-x86_64-with-glibc2.28
python_version: 3.9.5
toch_version: 1.10.0
transformers_version: 4.21.0

Downloads last month: 106

Safetensors

Model size

278M params

Tensor type

I64

·

F32

·

Inference Providers NEW

Text Classification

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Evaluation results

Test accuracy on BramVanroy/hebban-reviews - filtered_sentiment - 2.0.0
test set self-reported

0.809
Test f1 on BramVanroy/hebban-reviews - filtered_sentiment - 2.0.0
test set self-reported

0.813
Test precision on BramVanroy/hebban-reviews - filtered_sentiment - 2.0.0
test set self-reported

0.817
Test qwk on BramVanroy/hebban-reviews - filtered_sentiment - 2.0.0
test set self-reported

0.737
Test recall on BramVanroy/hebban-reviews - filtered_sentiment - 2.0.0
test set self-reported

0.809

View on Papers With Code