Update README.md
Browse files
README.md
CHANGED
@@ -10,4 +10,71 @@ tags:
|
|
10 |
- code
|
11 |
library_name: transformers
|
12 |
pipeline_tag: text-classification
|
13 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
- code
|
11 |
library_name: transformers
|
12 |
pipeline_tag: text-classification
|
13 |
+
metrics:
|
14 |
+
- accuracy
|
15 |
+
---
|
16 |
+
datasets:
|
17 |
+
- stanfordnlp/imdb
|
18 |
+
language:
|
19 |
+
- en
|
20 |
+
base_model:
|
21 |
+
- distilbert/distilbert-base-uncased
|
22 |
+
tags:
|
23 |
+
- torch
|
24 |
+
- code
|
25 |
+
library_name: transformers
|
26 |
+
pipeline_tag: text-classification
|
27 |
+
---
|
28 |
+
# Model Card for DistilBERT Fine-Tuned on IMDB Sentiment Analysis
|
29 |
+
|
30 |
+
This model is a fine-tuned version of `DistilBERT` for sentiment analysis on the IMDB movie reviews dataset. It classifies movie reviews into two categories: positive and negative sentiments.
|
31 |
+
|
32 |
+
## Model Details
|
33 |
+
|
34 |
+
### Model Description
|
35 |
+
|
36 |
+
This model has been fine-tuned on the IMDB dataset, which contains movie reviews labeled with sentiments: `positive` or `negative`. The model is based on the `DistilBERT` architecture, which is a lighter, more efficient variant of BERT, offering faster inference without significantly sacrificing accuracy.
|
37 |
+
|
38 |
+
- **Developed by:** [Leonuraht/Scilineo]
|
39 |
+
- **Shared by [optional]:** [Leonuraht/Scilineo]
|
40 |
+
- **Model type:** Transformer-based model for text classification (sentiment analysis)
|
41 |
+
- **Language(s) (NLP):** English
|
42 |
+
- **Finetuned from model [optional]:** distilbert-base-uncased
|
43 |
+
|
44 |
+
|
45 |
+
## Uses
|
46 |
+
|
47 |
+
### Direct Use
|
48 |
+
|
49 |
+
This model is directly usable for sentiment analysis tasks. It predicts the sentiment of text by classifying it as either "positive" or "negative".
|
50 |
+
|
51 |
+
### Downstream Use [optional]
|
52 |
+
|
53 |
+
This model can be further fine-tuned for other text classification tasks or integrated into larger applications where sentiment analysis is required.
|
54 |
+
|
55 |
+
### Out-of-Scope Use
|
56 |
+
|
57 |
+
This model is not intended for multilingual sentiment analysis or for handling text outside of movie reviews. It may not perform well on domains with vastly different vocabularies or sentiment expression styles.
|
58 |
+
|
59 |
+
## Bias, Risks, and Limitations
|
60 |
+
|
61 |
+
The model has been trained on the IMDB movie reviews dataset, and as such, it may exhibit biases inherent in the data (e.g., biases in sentiment based on genre, culture, or language). It is important to be mindful of these limitations when using the model in real-world applications.
|
62 |
+
|
63 |
+
### Recommendations
|
64 |
+
|
65 |
+
Users should be aware of the model's biases and limitations. It is recommended to further fine-tune the model with a diverse dataset if it is to be used in domains beyond movie reviews.
|
66 |
+
|
67 |
+
## How to Get Started with the Model
|
68 |
+
|
69 |
+
To use the model for sentiment analysis, you can load it via the Hugging Face `transformers` library. Here's an example:
|
70 |
+
|
71 |
+
```python
|
72 |
+
from transformers import pipeline
|
73 |
+
|
74 |
+
# Load the fine-tuned model from Hugging Face
|
75 |
+
model = "Leonuraht/IMDBert"
|
76 |
+
classifier = pipeline("sentiment-analysis", model=model)
|
77 |
+
|
78 |
+
# Test the model with a sample text
|
79 |
+
result = classifier("This movie was amazing!")
|
80 |
+
print(result) # Outputs: [{'label': 'POSITIVE' }]
|