CIS5190abcd
/

svm

yitingliii commited on Dec 13, 2024

Commit

7044552

verified ·

1 Parent(s): b98218f

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -20,14 +20,7 @@ import pandas as pd
 from sklearn.svm import SVC
 ```
-2. File Description
-- config.json: Configuration file for model and dataset parameters.
-- ml.py: Python script containing the machine learning pipeline.
-- model.pkl: Trained SVM model saved as a pickle file.
-- tfidf.pkl: TF-IDF vectorizer saved as a pickle file.
-- README.md: Documentation for the repository.
-3. Data Cleaning
 <br> The clean() function performs data preprocessing to prepare the input data for training. This includes:
 - Removing HTML tags using BeautifulSoup.
 - Removing non-alphanumeric characters and extra spaces.
@@ -37,10 +30,10 @@ from sklearn.svm import SVC
 ```python
-from clean_data import clean
 # Load your data
-df = pd.read_csv('your_dataset.csv')
 # Clean the data
 cleaned_df = clean(df)

 from sklearn.svm import SVC
 ```
+2. Data Cleaning
 <br> The clean() function performs data preprocessing to prepare the input data for training. This includes:
 - Removing HTML tags using BeautifulSoup.
 - Removing non-alphanumeric characters and extra spaces.
 ```python
+from data_cleaning import clean
 # Load your data
+df = pd.read_csv('test_data_random_subset.csv')
 # Clean the data
 cleaned_df = clean(df)