Spaces:

nabeel857
/

SMS_Spam_Classifier

Sleeping

App Files Files Community

nabeel857 commited on Dec 14, 2024

Commit

d9cedb1

verified ·

1 Parent(s): 898b709

Upload 5 files

Browse files

This app helps you easily classify SMS messages as spam or not, using machine learning. Whether it's promotional offers or suspicious links, it determines if your message is safe or unwanted.
➤Accurate Spam Detection: Classifies messages into "spam" or "not spam" (ham) with high accuracy.
➤Simple and User-Friendly: Just enter the message, hit "Predict," and get results instantly.
➤Fast Processing: Quickly analyzes and classifies any SMS or message in real-time.
➤Machine Learning Powered: Uses a trained model to ensure reliable predictions based on patterns in text.
➤Easy to Use: No technical knowledge required; the app does all the work.
➤Safe and Secure: Helps you avoid phishing or scam messages by detecting spam automatically.

Files changed (5) hide show

app.py +70 -0
kingthings_exeter.ttf +0 -0
model.pkl +3 -0
requirements.txt +5 -0
vectorizer.pkl +3 -0

app.py ADDED Viewed

	@@ -0,0 +1,70 @@

+import streamlit as st
+import pickle
+import string
+import nltk
+from nltk.corpus import stopwords
+from nltk.stem.porter import PorterStemmer
+nltk.download('stopwords')
+nltk.download('punkt')
+# Initialize the PorterStemmer
+ps = PorterStemmer()
+# Load models and resources
+def load_resources():
+    tfidf = pickle.load(open('vectorizer.pkl', 'rb'))
+    model = pickle.load(open('model.pkl', 'rb'))
+    return tfidf, model
+# Text preprocessing function
+def transform_text(text):
+    text = text.lower()
+    tokens = nltk.word_tokenize(text)
+    # Remove non-alphanumeric tokens and stopwords, and apply stemming
+    filtered_tokens = [ps.stem(word) for word in tokens if word.isalnum() and word not in stopwords.words('english')]
+    return " ".join(filtered_tokens)
+# Predict whether a message is spam or not
+def predict_spam(input_text, tfidf, model):
+    transformed_text = transform_text(input_text)
+    vector_input = tfidf.transform([transformed_text])
+    result = model.predict(vector_input)[0]
+    return result
+# Display result in Streamlit
+def display_prediction(result):
+    if result == "spam":
+        st.success("This is spam 🚫")
+    elif result == "ham":
+        st.success("This is not spam 👍")
+# Main Streamlit app function
+def main():
+    # Load resources
+    tfidf, model = load_resources()
+    # Set the app title
+    st.title("Email/SMS Spam Classifier")
+    # Input text area for user message
+    input_sms = st.text_area("Enter your message here:")
+    # Placeholder for prediction result
+    prediction_placeholder = st.empty()
+    # Predict button
+    if st.button('Predict'):
+        if input_sms.strip() == "":
+            prediction_placeholder.markdown(
+                "<h3 style='color: #f24b4b; font-size: 1.75rem;'>Please enter a message first ⚠️</h3>",
+                unsafe_allow_html=True)
+        else:
+            result = predict_spam(input_sms, tfidf, model)
+            with prediction_placeholder:
+                display_prediction(result)
+# Run the app
+if __name__ == "__main__":
+    main()

kingthings_exeter.ttf ADDED Viewed

Binary file (10.5 kB). View file

model.pkl ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f90b4ceea8d5f6396e8450dda1a8466064bec9f09e058d049e812d87ab8771ec
+size 96617

requirements.txt ADDED Viewed

	@@ -0,0 +1,5 @@

+pandas==2.2.2
+numpy==1.26.4
+streamlit==1.37.1
+scikit-learn==1.5.1
+nltk==3.8.1

vectorizer.pkl ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:093c46197d8e1cb29f987096a82f913d7459feeb0e62df74cec08c7d4d6e7883
+size 94142