law-ai
/

InLegalBERT

Inference Endpoints

Model card Files Files and versions Community

law-ai commited on Sep 11, 2022

Commit

f0171f5

•

1 Parent(s): c8d6ef6

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -19,9 +19,10 @@ In total, our dataset contains around 5.4 million Indian legal documents (all in
 The raw text corpus size is around 27 GB.
 ### Training Objective
-This model is initialized with the [LEGAL-BERT-SC model](https://huggingface.co/nlpaueb/legal-bert-base-uncased) from the paper [LEGAL-BERT: The Muppets straight out of Law School](https://aclanthology.org/2020.findings-emnlp.261/), and trained for an additional 300K steps on our data on the MLM and NSP objective.
 ### Usage
 ```python
 from transformers import AutoTokenizer, AutoModel, BertForPreTraining
 tokenizer = AutoTokenizer.from_pretrained("nlpaueb/legal-bert-base-uncased")

 The raw text corpus size is around 27 GB.
 ### Training Objective
+This model is initialized with the [LEGAL-BERT-SC model](https://huggingface.co/nlpaueb/legal-bert-base-uncased) from the paper [LEGAL-BERT: The Muppets straight out of Law School](https://aclanthology.org/2020.findings-emnlp.261/), anri
 ### Usage
+Using the tokenizer (same as LegalBERT
 ```python
 from transformers import AutoTokenizer, AutoModel, BertForPreTraining
 tokenizer = AutoTokenizer.from_pretrained("nlpaueb/legal-bert-base-uncased")