Spaces:
Running
Running
Naive Bayes Text Classification with Pre-trained Model | |
This project demonstrates how to use a pre-trained Naive Bayes model and vectorizer for text classification. It includes data preprocessing, text vectorization, and evaluation of the model's accuracy on a given dataset. | |
Prerequisites | |
Make sure you have the following installed: | |
Python 3.7 or later | |
Required Python libraries: | |
pandas | |
nltk | |
scikit-learn | |
joblib | |
To install the necessary libraries, run: pip install pandas scikit-learn nltk joblib | |
The input data should be a CSV file (data.csv) located in the ./data directory. The file must include the following columns: | |
title: The text data to classify. | |
news: The target label, where fox will be encoded as 1 and all other values as 0. | |
Place your dataset in a CSV file named data.csv under the ./data directory. | |
Ensure it has the required columns (title and news). | |
open the jupyternotebook and run the Prediction section in beginning, the model will predict and compare the result with true answer, and accuracy score is printed. |