File size: 2,104 Bytes
481b009
 
 
 
 
 
4a0e67c
481b009
 
 
 
 
4a0e67c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
---
title: SQL Generation
emoji: 🦀
colorFrom: red
colorTo: gray
sdk: gradio
sdk_version: 5.9.1
app_file: app.py
pinned: false
license: mit
---

# SQL Generation 🦀

Welcome to the **SQL Generation** Gradio application! This tool leverages advanced machine learning models to assist in generating SQL queries based on natural language inputs. Whether you're a developer, data analyst, or just curious about SQL, this app aims to simplify the process of crafting SQL queries.

## Features

- **Natural Language to SQL**: Convert plain English descriptions into SQL queries.
- **Multiple Datasets**: Trained on diverse datasets to handle various SQL generation tasks.
- **User-Friendly Interface**: Built with Gradio for an intuitive and interactive experience.

## Installation

To run this application locally, ensure you have Python 3.10 or higher installed. Then, install the required dependencies:

```bash
pip install gradio transformers datasets
```
## Usage
After installing the dependencies, you can start the application by running:

```bash
python app.py
```
This will launch a local server. Open your browser and navigate to http://127.0.0.1:7860 to access the interface.

### Datasets Used
The model has been trained on the following datasets:
- b-mc2/sql-create-context: Provides context for SQL query generation.
- TuneIt/o1-python: Offers examples of Python code snippets.
- HuggingFaceFW/fineweb-2: Includes various language models for fine-tuning.
- sentence-transformers/embedding-training-data: Supplies data for training sentence embeddings.

## Model
The application utilizes the distilbert-base-uncased model from Hugging Face, known for its efficiency and performance in natural language processing tasks.

## License
This project is licensed under the MIT License.

## Acknowledgments
- **Gradio** for providing an easy-to-use interface for machine learning models.
- **Hugging Face** for hosting the pre-trained models and datasets.
- **Datasets** for offering a wide range of datasets for training and evaluation.

For more information, refer to the **Gradio** documentation.