Spaces:

datenwerkzeuge
/

Webseiten-URL-Extraktor

Sleeping

Webseiten-URL-Extraktor / README.md

add xpath und css selector options

9bc8ed7 3 months ago

1.62 kB

	---
	title: Webseiten URL Extraktor
	emoji: 🐠
	colorFrom: indigo
	colorTo: pink
	sdk: streamlit
	sdk_version: 1.40.2
	app_file: app.py
	pinned: false
	short_description: URLs extrahieren
	---

	Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

	## Installation and Setup

	1. Clone the Repository
	Open a terminal and run the following command to clone the repository:

	```bash
	git clone https://huggingface.co/spaces/datenwerkzeuge/Webseiten-URL-Extraktor
	```

	2. Navigate to the Streamlit Application Directory
	Change your directory to the `streamlit` folder where the `app.py` and `requirements.txt` files are located:

	```bash
	cd Webseiten-URL-Extraktor
	```

	3. Create and Activate a Virtual Environment (Optional but Recommended)
	It's a good practice to use a virtual environment to manage dependencies. To create and activate a virtual environment, use the following commands:

	```bash
	# Create a virtual environment
	python -m venv venv

	# Activate the virtual environment
	# On Windows:
	.\venv\Scripts\activate

	# On macOS/Linux:
	source venv/bin/activate
	```

	4. Install the Required Packages
	Install the dependencies listed in `requirements.txt`:

	```bash
	pip install -r requirements.txt
	```

	5. Run the Streamlit Application
	To start the Streamlit app, use the following command:

	```bash
	streamlit run app.py
	```

	6. Access the Application
	Once the server starts, you can access the web application by visiting:

	```
	http://localhost:8501
	```