MassMin
/

Multilingual-NER-tagging

Token Classification

Customer Support

Model card Files Files and versions Community

MassMin commited on Aug 14, 2024

Commit

4a5c6ae

·

verified ·

1 Parent(s): 2eb8adf

Update README.md

Files changed (1) hide show

README.md +31 -39

README.md CHANGED Viewed

@@ -117,45 +117,37 @@ The model's performance is evaluated using the F1 score for NER. The predictions
 [More Information Needed]
-## Evaluation
-  '''python
-  import torch
-  from transformers import AutoTokenizer, AutoModelForTokenClassification, pipeline
-  import pandas as pd
-  model_checkpoint = "MassMin/xlm-roberta-base-finetuned-panx-de"  # Replace with your Hugging Face model name
-  device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
-  tokenizer = AutoTokenizer.from_pretrained(model_checkpoint)
-  model = AutoModelForTokenClassification.from_pretrained(model_checkpoint).to(device)
-  ner_pipeline = pipeline("ner", model=model, tokenizer=tokenizer, framework="pt", device=0 if torch.cuda.is_available() else -1)
-  def tag_text_with_pipeline(text, ner_pipeline):
-      # Use the NER pipeline to get predictions
-      results = ner_pipeline(text)
-      # Convert results to a DataFrame for easy viewing
-      df = pd.DataFrame(results)
-      df = df[['word', 'entity', 'score']]
-      df.columns = ['Tokens', 'Tags', 'Score']  # Rename columns for clarity
-      return df
-  text = "Jeff Dean works at Google in California."
-  result = tag_text_with_pipeline(text, ner_pipeline)
-  print(result)
 <!-- This section describes the evaluation protocols and provides the results. -->

 [More Information Needed]
+ ## Evaluation
+```python
+import torch
+from transformers import AutoTokenizer, AutoModelForTokenClassification, pipeline
+import pandas as pd
+model_checkpoint = "MassMin/xlm-roberta-base-finetuned-panx-de"  # Replace with your Hugging Face model name
+device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+tokenizer = AutoTokenizer.from_pretrained(model_checkpoint)
+model = AutoModelForTokenClassification.from_pretrained(model_checkpoint).to(device)
+ner_pipeline = pipeline("ner", model=model, tokenizer=tokenizer, framework="pt", device=0 if torch.cuda.is_available() else -1)
+def tag_text_with_pipeline(text, ner_pipeline):
+    # Use the NER pipeline to get predictions
+    results = ner_pipeline(text)
+    # Convert results to a DataFrame for easy viewing
+    df = pd.DataFrame(results)
+    df = df[['word', 'entity', 'score']]
+    df.columns = ['Tokens', 'Tags', 'Score']  # Rename columns for clarity
+    return df
+text = "Jeff Dean works at Google in California."
+result = tag_text_with_pipeline(text, ner_pipeline)
+print(result)
 <!-- This section describes the evaluation protocols and provides the results. -->