baseline_model / README.me
chh042's picture
Upload 6 files
aef755d verified
Naive Bayes Text Classification with Pre-trained Model
This project demonstrates how to use a pre-trained Naive Bayes model and vectorizer for text classification. It includes data preprocessing, text vectorization, and evaluation of the model's accuracy on a given dataset.
Prerequisites
Make sure you have the following installed:
Python 3.7 or later
Required Python libraries:
pandas
nltk
scikit-learn
joblib
To install the necessary libraries, run: pip install pandas scikit-learn nltk joblib
The input data should be a CSV file (data.csv) located in the ./data directory. The file must include the following columns:
title: The text data to classify.
news: The target label, where fox will be encoded as 1 and all other values as 0.
Place your dataset in a CSV file named data.csv under the ./data directory.
Ensure it has the required columns (title and news).
open the jupyternotebook and run the Prediction section in beginning, the model will predict and compare the result with true answer, and accuracy score is printed.