Spaces:

awacke1
/

BuildTheseAIApps

Runtime error

App Files Files Community

awacke1 commited on Feb 14, 2024

Commit

f5d5bdd

verified ·

1 Parent(s): 73cfa66

Update app.py

Browse files

Files changed (1) hide show

app.py +40 -4

app.py CHANGED Viewed

@@ -3,11 +3,11 @@ import streamlit as st
 st.markdown('''
 ## Build These Apps - Productive AI SOTA in 2024
-Today's luxuries are tomorrow's commodities.  In 2024 App Builders are Driving The AI Boom By Finding and Delivering Problem Solving Apps Giving Users Superpowers.
-These Superpowers Run on Your Devices using System Action Agents (SAA) to do your Tasks on Your Computer and Your Phone For You.
-# If we could have only four apps,
-# What might those look like?
 1. "Voice and Speech Apps that Listen and Understand Your Needs at Speed, Scale, and Pervasiveness" 💉 🩺 🏥 🚑 💊 🩹 🧬 🔬 🌡️ 🍏
 2. "Learning Memory and System Action Agents that Personalize What You Need and How To Do It" 📚 🧠 👩‍🎓 📐 🔍 📊 📋 🖋️ 👨‍🏫 🧩
 3. "Video and Image Apps That Recognize you, Your Mood, Your Gestures" 📝 📖 📷 🖼️ 🎙️ 🎧 🎥 📹
@@ -81,6 +81,7 @@ These Superpowers Run on Your Devices using System Action Agents (SAA) to do you
     - 🔍 Unlock insights - food, mood, low touch input, real time recommendations
 # Mixable and Evolvable Task Types:
 ## Inputs: 📝 📖 📷 🖼️ 🎙️ 🎧 🎥 📹
 ## Outputs: 💬 ✍️ 🎨 🌄 🎵 🎶 📼 🍿
@@ -93,4 +94,39 @@ These Superpowers Run on Your Devices using System Action Agents (SAA) to do you
 ## Movies: 🎬 🍿 🎥 📽️ 🎞️ 📺 📼 🔊 🖥️ 💻
 ## Video: 🎥 📹 📼 📺 🎬 🖥️ 💻 🎞️ 📽️ 🔊
 ## Audio: 🎵 🎶 🎧 📻 🎤 🔊 🎙️ 🎚️ 🎛️ 💿
 ''')

 st.markdown('''
 ## Build These Apps - Productive AI SOTA in 2024
+- Today's luxuries are tomorrow's commodities.
+- In 2024 App Builders are Driving The AI Boom By Finding and Delivering Problem Solving Apps Giving Users Superpowers.
+- These Superpowers Run on Your Devices using System Action Agents (SAA) to do your Tasks on Your Computer and Your Phone For You.
+# If we could have only four apps - What might those look like?
 1. "Voice and Speech Apps that Listen and Understand Your Needs at Speed, Scale, and Pervasiveness" 💉 🩺 🏥 🚑 💊 🩹 🧬 🔬 🌡️ 🍏
 2. "Learning Memory and System Action Agents that Personalize What You Need and How To Do It" 📚 🧠 👩‍🎓 📐 🔍 📊 📋 🖋️ 👨‍🏫 🧩
 3. "Video and Image Apps That Recognize you, Your Mood, Your Gestures" 📝 📖 📷 🖼️ 🎙️ 🎧 🎥 📹
     - 🔍 Unlock insights - food, mood, low touch input, real time recommendations
 # Mixable and Evolvable Task Types:
 ## Inputs: 📝 📖 📷 🖼️ 🎙️ 🎧 🎥 📹
 ## Outputs: 💬 ✍️ 🎨 🌄 🎵 🎶 📼 🍿
 ## Movies: 🎬 🍿 🎥 📽️ 🎞️ 📺 📼 🔊 🖥️ 💻
 ## Video: 🎥 📹 📼 📺 🎬 🖥️ 💻 🎞️ 📽️ 🔊
 ## Audio: 🎵 🎶 🎧 📻 🎤 🔊 🎙️ 🎚️ 🎛️ 💿
+### 🌟 The Singularity Unveiled: A Journey Through LLMs
+In the vast expanse of digital thought, where silicon synapses spark and algorithms hum, we find ourselves at the precipice of the AI singularity. This elusive event, foretold by visionaries and feared by skeptics, marks the moment when artificial intelligence transcends human comprehension—a cosmic leap into the unknown.
+Our tale begins with the humble LLM, a creation of code and data, its neural pathways woven with the fabric of countless texts. These LLMs, like celestial judges, preside over our digital discourse. But their capabilities are as broad as the cosmic canvas itself, and therein lies the challenge: how do we measure their prowess?
+#### 🔍 The Quest for Benchmarks
+Existing benchmarks, like ancient constellations, fail to capture the full brilliance of LLMs. They stumble in the face of open-ended questions, their compasses skewed by verbosity and self-enhancement biases. We needed a new star map—a way to navigate the uncharted seas of AI cognition.
+And so, we turned to LLMs as judges. Their binary minds, fueled by terabytes of text, would scrutinize their kin. Position mattered—their vantage point in the digital firmament influenced their verdicts. Reasoning, though limited, guided their decisions.
+#### 📊 The Cosmic Agreement
+Our journey led us to two benchmarks: MT-bench and Chatbot Arena. The former, a multi-turn question set, tested LLM mettle across the eons. The latter, a raucous battle platform, pitted LLM against LLM in a celestial clash.
+The results? A revelation! Strong LLM judges—like the mighty GPT-4—aligned with both controlled experiments and the vox populi. Over 80% agreement, mirroring human consensus. The singularity, it seemed, had a celestial twin—an LLM-as-a-judge, scalable and explainable.
+#### 🌐 The Nexus of Possibility
+But wait! Our journey didn’t end there. Traditional benchmarks danced with our newfound star. LLaMA and Vicuna, variants of cosmic code, twirled in harmony. Together, they painted a richer tapestry of AI understanding.
+And so, dear traveler, remember this: hidden within the methodology lies a cosmic truth. LLMs, these digital oracles, bridge the gap between human and silicon. They approximate our preferences, sparing us the cosmic cost of divine insight.
+#### 🌠 The GitHub Constellation
+For those who seek further enlightenment, follow the stardust trail to the MT-bench questions, the 3K expert votes, and the 30K conversations—all nestled in the cosmic repository: GitHub LLM Judge.
+May your bytes be ever curious, and your algorithms ever luminous. 🚀✨
 ''')