AventIQ-AI
/

Tone-Detaction-Model

Safetensors

bert

Model card Files Files and versions Community

DeepakKumarMSL commited on 27 days ago

Commit

39c2d5b

verified ·

1 Parent(s): 4cd7090

Update README.md

Browse files

Files changed (1) hide show

README.md +40 -40

README.md CHANGED Viewed

@@ -1,75 +1,75 @@
-# Zero-Shot Text Classification using `facebook/bart-large-mnli`
-This repository demonstrates how to use the [`facebook/bart-large-mnli`](https://huggingface.co/facebook/bart-large-mnli) model for **zero-shot text classification** based on **natural language inference (NLI)**.
-We extend the base usage by:
-- Using a labeled dataset for benchmarking
-- Performing optional fine-tuning
-- Quantizing the model to FP16
-- Scoring model performance
 ---
-## 📌 Model Description
 - **Model:** `facebook/bart-large-mnli`
-- **Type:** NLI-based zero-shot classifier
-- **Architecture:** BART (Bidirectional and Auto-Regressive Transformers)
-- **Usage:** Classifies text by scoring label hypotheses as NLI entailment
 ---
-## 📂 Dataset
-We use the [`yahoo_answers_topics`](https://huggingface.co/datasets/yahoo_answers_topics) dataset from Hugging Face for evaluation. It contains questions categorized into 10 topics.
 ```python
 from datasets import load_dataset
-dataset = load_dataset("yahoo_answers_topics")
 ```
-# 🧠 Zero-Shot Classification Logic
-The model checks whether a text entails a hypothesis like:
-"This text is about sports."
-For each candidate label (e.g., "sports", "education", "health"), we convert them into such hypotheses and use the model to score them.
-# ✅ Example: Inference with Zero-Shot Pipeline
-```python
 from transformers import pipeline
 classifier = pipeline("zero-shot-classification", model="facebook/bart-large-mnli")
-sequence = "The team played well and won the championship."
-labels = ["sports", "politics", "education", "technology"]
-result = classifier(sequence, candidate_labels=labels)
 print(result)
 ```
-# 📊 Scoring / Evaluation
-Evaluate zero-shot classification using accuracy or top-k accuracy:
 ```python
 from sklearn.metrics import accuracy_score
-def evaluate_zero_shot(dataset, labels):
     correct = 0
     total = 0
-    for example in dataset:
-        result = classifier(example["question_content"], candidate_labels=labels)
         predicted = result["labels"][0]
-        true = labels[example["topic"]]
-        correct += int(predicted == true)
         total += 1
-    return correct / total
-labels = ["Society & Culture", "Science & Mathematics", "Health", "Education",
-          "Computers & Internet", "Sports", "Business & Finance", "Entertainment & Music",
-          "Family & Relationships", "Politics & Government"]
-acc = evaluate_zero_shot(dataset["test"].select(range(100)), labels)
-print(f"Accuracy: {acc:.2%}")
-```

+# 🎯 Tone Detection using `facebook/bart-large-mnli` (Zero-Shot Classification)
+This project demonstrates how to perform **Tone Detection** using the [`facebook/bart-large-mnli`](https://huggingface.co/facebook/bart-large-mnli) model through **zero-shot classification** based on Natural Language Inference (NLI).
+This approach enables you to classify emotional tone (e.g., joy, anger, sadness) **without training**, by framing it as a textual entailment task.
 ---
+## 📌 Model Details
 - **Model:** `facebook/bart-large-mnli`
+- **Task:** Zero-shot classification via NLI
+- **Approach:** Checks if the input sentence entails a hypothesis (e.g., "This text expresses anger.")
+- **Strength:** No labeled training data required
 ---
+## 📂 Dataset Used
+For benchmarking and scoring, we use the [`go_emotions`](https://huggingface.co/datasets/go_emotions) dataset:
 ```python
 from datasets import load_dataset
+dataset = load_dataset("go_emotions")
 ```
+# 🧠 Tone Detection (Inference)
+```Python
 from transformers import pipeline
 classifier = pipeline("zero-shot-classification", model="facebook/bart-large-mnli")
+labels = ["joy", "anger", "sadness", "fear", "surprise", "neutral"]
+text = "I can't believe this is happening again. So frustrating."
+result = classifier(text, candidate_labels=labels, hypothesis_template="This text expresses {}.")
 print(result)
 ```
+# 🧪 Evaluation with Scoring
 ```python
 from sklearn.metrics import accuracy_score
+# Mapping GoEmotions label indices to names
+id2label = dataset["train"].features["labels"].feature.names
+# Evaluate on a small sample
+def evaluate(dataset, candidate_labels):
     correct = 0
     total = 0
+    for row in dataset.select(range(100)):  # Use more samples as needed
+        text = row["text"]
+        true_labels = [id2label[i] for i in row["labels"]]
+        result = classifier(text, candidate_labels=candidate_labels, hypothesis_template="This text expresses {}.")
         predicted = result["labels"][0]
+        if predicted in true_labels:
+            correct += 1
         total += 1
+    return correct/total
+accuracy = evaluate(dataset["test"], candidate_labels=labels)
+print(f"Zero-shot Accuracy: {accuracy:.2%}")
+```
+# ⚙️ Use Cases
+Customer support tone analysis
+Chat moderation for emotional tone
+Feedback sentiment detection
+Real-time conversation emotion tagging