Spaces:

mgbam
/

PhoenixUI

Running

App Files Files Community

mgbam commited on 4 days ago

Commit

68a2453

verified ·

1 Parent(s): 0d6622c

Update README.md

Browse files

Files changed (1) hide show

README.md +77 -101

README.md CHANGED Viewed

@@ -4,113 +4,89 @@ emoji: 👀
 colorFrom: green
 colorTo: indigo
 sdk: gradio
-sdk_version: 5.34.0
 app_file: app.py
 pinned: false
 short_description: Analytic
 ---
 # 🔥 Odyssey: The AI Data Science Workspace
-![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)
-![Python Version](https://img.shields.io/badge/python-3.9+-indigo.svg)
-![Status](https://img.shields.io/badge/status-beta-green.svg)
-![Built with Gradio](https://img.shields.io/badge/Built%20with-Gradio-orange)
-Odyssey is not just an analytic tool; it's an AI-native, collaborative workspace designed to augment and accelerate the entire data science workflow. It moves beyond reactive profiling to a proactive, guided exploration experience, making you feel like you have a senior data scientist as your co-pilot.
-*(A conceptual image of the Odyssey UI)*
-## ✨ Core Features
-Odyssey is built around four intelligent modules and a project-based workflow, providing a seamless journey from raw data to actionable insight.
-*   🔭 **Helios Overview**: A living, proactive dashboard that automatically runs upon data upload. It doesn't just show you stats; it surfaces critical insights like data quality issues, strong correlations, outlier alerts, and even suggests potential target variables for machine learning.
-*   🧪 **Asclepius Data Lab**: An interactive data preparation environment. Go beyond simple imputation with advanced methods like KNN for numeric data and smart categorical handling. See the impact of your changes instantly with live before-and-after visualizations.
-*   🚀 **Prometheus Launchpad**: A rapid machine learning modeling environment. Select a target and features, and with one click, train a model using robust 5-fold cross-validation. Instantly receive key performance metrics and advanced visualizations like ROC curves and residual plots to assess model viability.
-*   💡 **Athena Co-pilot**: A true AI collaborator. Athena understands the full context of your session—from the original data to the cleaned dataset and the models you've built. Ask it to perform complex analyses, generate plots, or even **build new, dynamic dashboards on the fly** right inside the chat.
-*   🗂️ **Project-Based Workflow**: Save your entire session—including cleaned data, chat history, and insights—into a single `.odyssey` file. Load projects later to pick up exactly where you left off.
-*   📄 **One-Click HTML Reports**: Generate a comprehensive, self-contained HTML report of your entire analysis, perfect for sharing with colleagues or stakeholders.
-## 🚀 Getting Started
-Follow these steps to get Odyssey running on your local machine.
-### Prerequisites
-*   Python 3.9 or higher
-*   `pip` package manager
-*   `git` for cloning the repository
-### 1. Clone the Repository
-Open your terminal and clone the project:
-```bash
-git clone https://github.com/your-username/odyssey-ai-workspace.git
-cd odyssey-ai-workspace
-```
-### 2. Set Up a Virtual Environment
-It is highly recommended to use a virtual environment to manage dependencies and avoid conflicts.
-**On macOS/Linux:**
-```bash
-python3 -m venv venv
-source venv/bin/activate
-```
-**On Windows:**
-```bash
 python -m venv venv
-.\venv\Scripts\activate
-```
-### 3. Install Dependencies
-Install all required packages using the `requirements.txt` file:
-```bash
 pip install -r requirements.txt
-```
-### 4. Set Up Your API Key
-Odyssey's AI features are powered by the Google Gemini API.
-1.  Obtain a free API key from [Google AI Studio](https://aistudio.google.com/).
-2.  When you launch the application, you will see a field labeled "Gemini API Key". Paste your key there to activate the Athena Co-pilot and other AI features.
-### 5. Run the Application
-Launch the Gradio application with the following command:
-```bash
-python odyssey_app.py
-```
-*(Assuming the main script is named `odyssey_app.py`)*
-Open your web browser and navigate to the local URL provided in the terminal (usually `http://127.0.0.1:7860`).
-## 🧭 How to Use Odyssey
-1.  **Start a Project**: Give your project a name and upload a CSV file.
-2.  **Consult Helios**: Once uploaded, the **Helios Overview** will automatically populate with proactive insights. Review these findings to understand your data's strengths and weaknesses.
-3.  **Cleanse in the Lab**: Navigate to the **Asclepius Data Lab**. Use the dropdowns to select columns with missing data and apply imputation methods, previewing the effects in real-time.
-4.  **Launch a Model**: Go to the **Prometheus Launchpad**. Based on the suggestions from Helios, select a target variable and features. Choose a model and click "Launch" to see its predictive potential.
-5.  **Collaborate with Athena**: Open the **Athena Co-pilot**. Ask complex questions, request specific plots, or even ask it to build a custom dashboard (e.g., *"Build me a dashboard showing sales trends by region and product category."*).
-6.  **Save or Export**: Use the "Save" button to create a `.odyssey` file of your session, or click "Export Report" to generate a shareable HTML summary.
-## 🤝 Contributing
-Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are **greatly appreciated**.
-Please feel free to submit a pull request or open an issue for any bugs, feature requests, or suggestions.
-## 📝 License
-This project is licensed under the MIT License. See the `LICENSE` file for more details.
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 colorFrom: green
 colorTo: indigo
 sdk: gradio
+sdk_version: 5.34.1
 app_file: app.py
 pinned: false
 short_description: Analytic
 ---
 # 🔥 Odyssey: The AI Data Science Workspace
+🚀 CognitiveEDA: The Adaptive Intelligence Engine
+![alt text](https://img.shields.io/badge/version-4.0-blue.svg)
+![alt text](https://img.shields.io/badge/python-3.9+-indigo.svg)
+![alt text](https://img.shields.io/badge/license-MIT-green.svg)
+CognitiveEDA is not just another EDA tool; it's a world-class data discovery platform that intelligently adapts to your data.
+This enterprise-grade application goes beyond static profiling by automatically detecting the nature of your dataset (e.g., time-series, text-heavy) and unlocking specialized analysis modules on the fly. Powered by Google's Gemini LLM, it delivers a rich, context-aware, and deeply insightful user experience that transforms raw data into a clear narrative with actionable recommendations.
+(A GIF showcasing the adaptive UI revealing specialized tabs after data upload)
+✨ Key Features: The "Wow" Factor
+CognitiveEDA is designed to impress data professionals by providing intelligent, context-aware analysis that feels magical.
+🧠 Adaptive Analysis Modules: The UI isn't static. It intelligently detects your data's characteristics and dynamically reveals specialized tabs:
+⌛ Time-Series Analysis: Automatically appears if date/time columns are found. Perform decomposition, check for stationarity (ADF Test), and visualize trends.
+📝 Text Analysis: Unlocks if long-form text columns are present. Instantly generate word clouds to visualize high-frequency terms.
+🧩 Clustering (K-Means): Becomes available for datasets with strong numeric features, allowing you to discover latent groups and customer segments.
+🤖 Hyper-Contextual AI Narrative: The integrated Gemini AI doesn't give a generic report. It receives context about the type of data it's analyzing, leading to far more specific and valuable insights (e.g., suggesting ARIMA for time-series or sentiment analysis for text).
+** Universal Data Ingestion:** Don't be limited to CSV. CognitiveEDA handles CSV and Excel files seamlessly.
+⚡ Performance-Aware: For massive datasets, the tool automatically samples the data for UI interactions to ensure a fast, responsive experience, while still using the full dataset for backend calculations where feasible.
+📊 Comprehensive Core EDA: All the essentials, done better:
+Detailed Data Profiling (Missing values, numeric stats, categorical stats).
+At-a-glance overview visuals (Data types, missing data heatmap, correlation matrix).
+Interactive deep-dive tools for exploring individual features.
+🛠️ Tech Stack
+This project leverages a modern, powerful stack for data science and web applications:
+Backend & Data Analysis: Python, Pandas, NumPy, scikit-learn, statsmodels
+Web Framework & UI: Gradio
+AI Integration: Google Generative AI (Gemini)
+Visualization: Plotly, Matplotlib, WordCloud
+🚀 Getting Started
+You can get your own instance of CognitiveEDA running in just two steps.
+1. Prerequisites
+Python 3.9 or higher.
+A Google Gemini API Key. You can get a free key from Google AI Studio.
+2. Installation & Launch
+First, clone the repository to your local machine:
+Generated bash
+git clone https://github.com/your-repo/CognitiveEDA.git
+cd CognitiveEDA
+Use code with caution.
+Bash
+Next, install all the required dependencies using the requirements.txt file. It's highly recommended to do this within a Python virtual environment.
+Generated bash
+# Create and activate a virtual environment (optional but recommended)
 python -m venv venv
+source venv/bin/activate  # On Windows, use `venv\Scripts\activate`
+# Install all dependencies
 pip install -r requirements.txt
+Use code with caution.
+Bash
+Finally, run the application:
+Generated bash
+python app.py
+Use code with caution.
+Bash
+The application will start and provide a local URL (e.g., http://127.0.0.1:7860) that you can open in your web browser.
+📖 How to Use
+Launch the application and open the URL in your browser.
+Upload your data file using the "Upload Data File" component. Supported formats are .csv, .xlsx, and .xls.
+Enter your Google Gemini API Key in the provided text field.
+Click "Build My Dashboard".
+Explore! The application will process your data and build a custom dashboard. The standard tabs (AI Narrative, Profile, Overview) will be populated, and any relevant specialized tabs (Time-Series, Text, Clustering) will automatically appear.
+Interact with the dropdowns and sliders in each tab to perform deep-dive analyses.
+💡 Future Roadmap & Contributions
+CognitiveEDA is an evolving platform. We welcome contributions from the community!
+Potential Future Enhancements:
+Geospatial Analysis Module: Automatically detect latitude/longitude or location names and generate map-based visualizations.
+Interactive HTML Report Export: Export a single, beautiful, and fully interactive HTML file with embedded Plotly charts.
+Database Connectors: Allow users to connect directly to PostgreSQL, MySQL, or BigQuery.
+Background Job Processing: For extremely large datasets, allow full analysis to run as a background task with progress updates.
+Advanced Caching: Implement more sophisticated caching to speed up re-analysis of the same data.
+How to Contribute
+Fork the repository.
+Create a new branch for your feature (git checkout -b feature/AmazingNewFeature).
+Commit your changes (git commit -m 'Add some AmazingNewFeature').
+Push to the branch (git push origin feature/AmazingNewFeature).
+Open a Pull Request.
+📄 License
+This project is licensed under the MIT License - see the LICENSE file for details.