bsenst's picture
add xpath und css selector options
9bc8ed7
---
title: Webseiten URL Extraktor
emoji: ๐Ÿ 
colorFrom: indigo
colorTo: pink
sdk: streamlit
sdk_version: 1.40.2
app_file: app.py
pinned: false
short_description: URLs extrahieren
---
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
## Installation and Setup
1. **Clone the Repository**
Open a terminal and run the following command to clone the repository:
```bash
git clone https://huggingface.co/spaces/datenwerkzeuge/Webseiten-URL-Extraktor
```
2. **Navigate to the Streamlit Application Directory**
Change your directory to the `streamlit` folder where the `app.py` and `requirements.txt` files are located:
```bash
cd Webseiten-URL-Extraktor
```
3. **Create and Activate a Virtual Environment (Optional but Recommended)**
It's a good practice to use a virtual environment to manage dependencies. To create and activate a virtual environment, use the following commands:
```bash
# Create a virtual environment
python -m venv venv
# Activate the virtual environment
# On Windows:
.\venv\Scripts\activate
# On macOS/Linux:
source venv/bin/activate
```
4. **Install the Required Packages**
Install the dependencies listed in `requirements.txt`:
```bash
pip install -r requirements.txt
```
5. **Run the Streamlit Application**
To start the Streamlit app, use the following command:
```bash
streamlit run app.py
```
6. **Access the Application**
Once the server starts, you can access the web application by visiting:
```
http://localhost:8501
```