PhoenixUI / README.md
mgbam's picture
Update README.md
e0c35f9 verified
|
raw
history blame
5.7 kB
metadata
title: EDAONSTERIOD
emoji: πŸ‘€
colorFrom: green
colorTo: indigo
sdk: gradio
sdk_version: 5.34.1
app_file: app.py
pinned: false
short_description: Analytic

πŸ”₯ Odyssey: The AI Data Science Workspace

πŸš€ CognitiveEDA: The Adaptive Intelligence Engine alt text

alt text

alt text CognitiveEDA is not just another EDA tool; it's a world-class data discovery platform that intelligently adapts to your data. This enterprise-grade application goes beyond static profiling by automatically detecting the nature of your dataset (e.g., time-series, text-heavy) and unlocking specialized analysis modules on the fly. Powered by Google's Gemini LLM, it delivers a rich, context-aware, and deeply insightful user experience that transforms raw data into a clear narrative with actionable recommendations. (A GIF showcasing the adaptive UI revealing specialized tabs after data upload)

✨ Key Features: The "Wow" Factor

CognitiveEDA is designed to impress data professionals by providing intelligent, context-aware analysis that feels magical.

🧠 Adaptive Analysis Modules: The UI isn't static. It intelligently detects your data's characteristics and dynamically reveals specialized tabs:

βŒ› Time-Series Analysis: Automatically appears if date/time columns are found. Perform decomposition, check for stationarity (ADF Test), and visualize trends.

πŸ“ Text Analysis: Unlocks if long-form text columns are present. Instantly generate word clouds to visualize high-frequency terms.

🧩 Clustering (K-Means): Becomes available for datasets with strong numeric features, allowing you to discover latent groups and customer segments.

πŸ€– Hyper-Contextual AI Narrative: The integrated Gemini AI doesn't give a generic report. It receives context about the type of data it's analyzing, leading to far more specific and valuable insights (e.g., suggesting ARIMA for time-series or sentiment analysis for text). ** Universal Data Ingestion:** Don't be limited to CSV. CognitiveEDA handles CSV and Excel files seamlessly.

⚑ Performance-Aware: For massive datasets, the tool automatically samples the data for UI interactions to ensure a fast, responsive experience, while still using the full dataset for backend calculations where feasible. πŸ“Š Comprehensive Core EDA: All the essentials, done better:

Detailed Data Profiling (Missing values, numeric stats, categorical stats). At-a-glance overview visuals (Data types, missing data heatmap, correlation matrix). Interactive deep-dive tools for exploring individual features.

πŸ› οΈ Tech Stack This project leverages a modern, powerful stack for data science and web applications: Backend & Data Analysis: Python, Pandas, NumPy, scikit-learn, statsmodels Web Framework & UI: Gradio AI Integration: Google Generative AI (Gemini) Visualization: Plotly, Matplotlib, WordCloud

πŸš€ Getting Started You can get your own instance of CognitiveEDA running in just two steps.

  1. Prerequisites Python 3.9 or higher. A Google Gemini API Key. You can get a free key from Google AI Studio.
  2. Installation & Launch First, clone the repository to your local machine: Generated bash git clone https://github.com/your-repo/CognitiveEDA.git

cd CognitiveEDA Use code with caution. Bash Next, install all the required dependencies using the requirements.txt file. It's highly recommended to do this within a Python virtual environment. Generated bash

Create and activate a virtual environment (optional but recommended)

python -m venv venv source venv/bin/activate # On Windows, use venv\Scripts\activate

Install all dependencies

pip install -r requirements.txt Use code with caution. Bash Finally, run the application: Generated bash python app.py Use code with caution. Bash The application will start and provide a local URL (e.g., http://127.0.0.1:7860) that you can open in your web browser.

πŸ“– How to Use Launch the application and open the URL in your browser. Upload your data file using the "Upload Data File" component. Supported formats are .csv, .xlsx, and .xls. Enter your Google Gemini API Key in the provided text field. Click "Build My Dashboard".

Explore! The application will process your data and build a custom dashboard. The standard tabs (AI Narrative, Profile, Overview) will be populated, and any relevant specialized tabs (Time-Series, Text, Clustering) will automatically appear. Interact with the dropdowns and sliders in each tab to perform deep-dive analyses. πŸ’‘ Future Roadmap & Contributions CognitiveEDA is an evolving platform. We welcome contributions from the community! Potential Future Enhancements:

Geospatial Analysis Module: Automatically detect latitude/longitude or location names and generate map-based visualizations. Interactive HTML Report Export: Export a single, beautiful, and fully interactive HTML file with embedded Plotly charts. Database Connectors: Allow users to connect directly to PostgreSQL, MySQL, or BigQuery. Background Job Processing: For extremely large datasets, allow full analysis to run as a background task with progress updates. Advanced Caching: Implement more sophisticated caching to speed up re-analysis of the same data. How to Contribute Fork the repository.

Create a new branch for your feature (git checkout -b feature/AmazingNewFeature). Commit your changes (git commit -m 'Add some AmazingNewFeature'). Push to the branch (git push origin feature/AmazingNewFeature). Open a Pull Request.

πŸ“„ License This project is licensed under the MIT License - see the LICENSE file for details.