Update README.md
Browse files
README.md
CHANGED
@@ -36,11 +36,6 @@ tokenizer = AutoTokenizer.from_pretrained("new5558/HoogBERTa")
|
|
36 |
model = AutoModel.from_pretrained("new5558/HoogBERTa")
|
37 |
```
|
38 |
|
39 |
-
To annotate POS, NE, and clause boundary, use the following commands
|
40 |
-
```
|
41 |
-
|
42 |
-
```
|
43 |
-
|
44 |
To extract token features, based on the RoBERTa architecture, use the following commands
|
45 |
|
46 |
```python
|
@@ -87,6 +82,14 @@ with torch.no_grad():
|
|
87 |
features = model(token_ids, output_hidden_states = True).hidden_states[-1] # where token_ids is a tensor with type "long".
|
88 |
```
|
89 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
90 |
|
91 |
## Conversion Code
|
92 |
If you are interested in how to convert Fairseq and subword-nmt Roberta into Huggingface hub here is my code used to do the conversion and test for parity match:
|
|
|
36 |
model = AutoModel.from_pretrained("new5558/HoogBERTa")
|
37 |
```
|
38 |
|
|
|
|
|
|
|
|
|
|
|
39 |
To extract token features, based on the RoBERTa architecture, use the following commands
|
40 |
|
41 |
```python
|
|
|
82 |
features = model(token_ids, output_hidden_states = True).hidden_states[-1] # where token_ids is a tensor with type "long".
|
83 |
```
|
84 |
|
85 |
+
# Huggingface Models
|
86 |
+
1. `HoogBERTaEncoder`
|
87 |
+
- [HoogBERTa](https://huggingface.co/new5558/HoogBERTa): `Feature Extraction` and `Mask Language Modeling`
|
88 |
+
2. `HoogBERTaMuliTaskTagger`:
|
89 |
+
- [HoogBERTa-NER-lst20](https://huggingface.co/new5558/HoogBERTa-NER-lst20): `Named-entity recognition (NER)` based on LST20
|
90 |
+
- [HoogBERTa-POS-lst20](https://huggingface.co/new5558/HoogBERTa-POS-lst20): `Part-of-speech tagging (POS)` based on LST20
|
91 |
+
- [HoogBERTa-SENTENCE-lst20](https://huggingface.co/new5558/HoogBERTa-SENTENCE-lst20): `Clause Boundary Classification` based on LST20
|
92 |
+
|
93 |
|
94 |
## Conversion Code
|
95 |
If you are interested in how to convert Fairseq and subword-nmt Roberta into Huggingface hub here is my code used to do the conversion and test for parity match:
|