Spaces:

DexterSptizu
/

sentence-transformer-visualization

Sleeping

App Files Files Community

DexterSptizu commited on Nov 4, 2024

Commit

1ffc0e8

verified ·

1 Parent(s): 3fe05b6

Create app.py

Browse files

Files changed (1) hide show

app.py +111 -0

app.py ADDED Viewed

	@@ -0,0 +1,111 @@

+import streamlit as st
+import numpy as np
+from sentence_transformers import SentenceTransformer, util
+# Initialize sentence transformer model
+@st.cache_resource
+def load_model():
+    return SentenceTransformer('all-MiniLM-L6-v2')
+model = load_model()
+def get_embedding_and_similarity(text1, text2):
+    # Get embeddings
+    embedding1 = model.encode(text1, convert_to_tensor=True)
+    embedding2 = model.encode(text2, convert_to_tensor=True)
+    # Calculate cosine similarity
+    similarity = util.pytorch_cos_sim(embedding1, embedding2).item()
+    return similarity
+st.title("🤗 Interactive Sentence Embeddings Explorer")
+st.markdown("""
+This demo helps you understand how sentence transformers work by comparing text similarities.
+Try different sentences to see how the model captures semantic meaning!
+""")
+# Main comparison section
+st.header("Compare Two Texts")
+col1, col2 = st.columns(2)
+with col1:
+    st.markdown("**First Text**")
+    text1 = st.text_area("Enter first text", height=100,
+                         value="I love programming in Python")
+with col2:
+    st.markdown("**Second Text**")
+    text2 = st.text_area("Enter second text", height=100,
+                         value="Python is my favorite programming language")
+if st.button("Calculate Similarity"):
+    similarity = get_embedding_and_similarity(text1, text2)
+    st.markdown("### Similarity Score")
+    st.progress(similarity)
+    st.write(f"Cosine Similarity: {similarity:.4f}")
+    if similarity > 0.8:
+        st.success("These texts are very similar!")
+    elif similarity > 0.5:
+        st.info("These texts are somewhat similar")
+    else:
+        st.warning("These texts are quite different")
+# Interactive examples section
+st.header("Try These Examples")
+st.markdown("Click on any example to see how similar sentences are handled by the model")
+examples = {
+    "Similar Meaning, Different Words": {
+        "text1": "The cat is sleeping on the couch",
+        "text2": "A feline is resting on the sofa"
+    },
+    "Similar Words, Different Meaning": {
+        "text1": "The bank is by the river",
+        "text2": "I need to go to the bank for money"
+    },
+    "Technical Similarity": {
+        "text1": "Python is a programming language",
+        "text2": "Java is used for coding software"
+    },
+    "Opposite Meanings": {
+        "text1": "The stock market is going up",
+        "text2": "The stock market is going down"
+    }
+}
+selected_example = st.selectbox("Choose an example", list(examples.keys()))
+if st.button("Try this example"):
+    example = examples[selected_example]
+    similarity = get_embedding_and_similarity(example["text1"], example["text2"])
+    st.markdown("### Example Texts")
+    st.write("Text 1:", example["text1"])
+    st.write("Text 2:", example["text2"])
+    st.markdown("### Similarity Score")
+    st.progress(similarity)
+    st.write(f"Cosine Similarity: {similarity:.4f}")
+# Educational section
+st.header("📚 How It Works")
+st.markdown("""
+1. **Text to Embeddings**: The model converts each text into a high-dimensional vector (embedding)
+2. **Similarity Calculation**: Cosine similarity between vectors is calculated
+3. **Score Interpretation**:
+   - 1.0 = Identical meaning
+   - >0.8 = Very similar
+   - >0.5 = Somewhat similar
+   - <0.5 = Different meanings
+""")
+# Advanced settings
+with st.expander("🔧 Advanced Settings"):
+    st.markdown("""
+    **Current Model**: all-MiniLM-L6-v2
+    - Embedding Size: 384 dimensions
+    - Optimized for semantic similarity tasks
+    - Fast and efficient for real-time applications
+    """)