Fine-tuned Flair Model on CO-Fun NER Dataset

This Flair model was fine-tuned on the CO-Fun NER Dataset using GBERT Base as backbone LM.

Dataset

The Company Outsourcing in Fund Prospectuses (CO-Fun) dataset consists of 948 sentences with 5,969 named entity annotations, including 2,340 Outsourced Services, 2,024 Companies, 1,594 Locations and 11 Software annotations.

Overall, the following named entities are annotated:

  • Auslagerung (engl. outsourcing)
  • Unternehmen (engl. company)
  • Ort (engl. location)
  • Software

Fine-Tuning

The latest Flair version is used for fine-tuning.

A hyper-parameter search over the following parameters with 5 different seeds per configuration is performed:

  • Batch Sizes: [8, 16]
  • Learning Rates: [3e-05, 5e-05]

More details can be found in this repository. All models are fine-tuned on a Hetzner GEX44 with an NVIDIA RTX 4000.

Results

A hyper-parameter search with 5 different seeds per configuration is performed and micro F1-score on development set is reported:

Configuration Seed 1 Seed 2 Seed 3 Seed 4 Seed 5 Average
bs8-e10-lr5e-05 0.9477 0.935 0.9517 0.9443 0.9342 0.9426 ± 0.0077
bs16-e10-lr5e-05 0.9214 0.9364 0.9334 0.9489 0.9257 0.9332 ± 0.0106
bs8-e10-lr3e-05 0.928 0.9248 0.9421 0.9295 0.9263 0.9301 ± 0.0069
bs16-e10-lr3e-05 0.918 0.9256 0.9331 0.9273 0.9196 0.9247 ± 0.0061

The result in bold shows the performance of the current viewed model.

Additionally, the Flair training log and TensorBoard logs are also uploaded to the model hub.

Downloads last month
16
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for stefan-it/flair-co-funer-gbert_base-bs8-e10-lr5e-05-1

Base model

deepset/gbert-base
Finetuned
(47)
this model

Dataset used to train stefan-it/flair-co-funer-gbert_base-bs8-e10-lr5e-05-1