Thomas Lemberger
commited on
Commit
•
fb6b7e9
1
Parent(s):
10b1efd
card updated
Browse files
README.md
CHANGED
@@ -51,9 +51,9 @@ The training was run on a NVIDIA DGX Station with 4XTesla V100 GPUs.
|
|
51 |
|
52 |
Training code is available at https://github.com/source-data/soda-roberta
|
53 |
|
54 |
-
- Command: `python -m tokcl.train
|
55 |
- Tokenizer vocab size: 50265
|
56 |
-
- Training data: EMBO/
|
57 |
- Training with 31410 examples.
|
58 |
- Evaluating on 8861 examples.
|
59 |
- Training on 15 features: O, I-SMALL_MOLECULE, B-SMALL_MOLECULE, I-GENEPROD, B-GENEPROD, I-SUBCELLULAR, B-SUBCELLULAR, I-CELL, B-CELL, I-TISSUE, B-TISSUE, I-ORGANISM, B-ORGANISM, I-EXP_ASSAY, B-EXP_ASSAY
|
|
|
51 |
|
52 |
Training code is available at https://github.com/source-data/soda-roberta
|
53 |
|
54 |
+
- Command: `python -m tokcl.train NER --num_train_epochs=3.5`
|
55 |
- Tokenizer vocab size: 50265
|
56 |
+
- Training data: EMBO/sd-nlp NER
|
57 |
- Training with 31410 examples.
|
58 |
- Evaluating on 8861 examples.
|
59 |
- Training on 15 features: O, I-SMALL_MOLECULE, B-SMALL_MOLECULE, I-GENEPROD, B-GENEPROD, I-SUBCELLULAR, B-SUBCELLULAR, I-CELL, B-CELL, I-TISSUE, B-TISSUE, I-ORGANISM, B-ORGANISM, I-EXP_ASSAY, B-EXP_ASSAY
|