Spaces:
Sleeping
Sleeping
ghadaAlmuaikel
commited on
Commit
•
8fd5977
1
Parent(s):
137b9d5
Update README.md
Browse files
README.md
CHANGED
@@ -15,18 +15,6 @@ Check out the configuration reference at https://huggingface.co/docs/hub/spaces-
|
|
15 |
## Purpose of the Project
|
16 |
This project aims to help job seekers by matching their CVs to job descriptions from major companies using semantic similarity. By analyzing the text content of the CV, the system identifies and presents the top three job descriptions that best match the provided CV. Additionally, it provides translations for CVs that are not in English, as well as the corresponding job descriptions.
|
17 |
|
18 |
-
## Main Files and Their Functions
|
19 |
-
- **project.ipynb**: This is the primary script that runs the project. It includes the following functionalities:
|
20 |
-
- Contains predefined job descriptions from major companies like Google, Amazon, Microsoft, etc.
|
21 |
-
- Detects the language of the input CV.
|
22 |
-
- Translates the CV to English if necessary using Hugging Face’s translation pipeline.
|
23 |
-
- Compares the CV with the job descriptions using semantic similarity through SentenceTransformer.
|
24 |
-
- Outputs the top 3 matching job descriptions and their similarity scores in a bar chart.
|
25 |
-
- Translates job descriptions to the CV’s original language if the input CV is not in English.
|
26 |
-
|
27 |
-
- **Pipeline.ipynb**: This Python notebook demonstrates how the Hugging Face text-to-text pipeline works independently, without the Gradio interface. It walks through using Hugging Face’s translation and semantic similarity tools with hardcoded inputs.
|
28 |
-
|
29 |
-
- **Gradio.ipynb**: This notebook shows how Gradio components work using hardcoded data. It demonstrates the basic setup of a Gradio interface with similar functionalities, but without relying on external inputs or complex workflows.
|
30 |
## Models
|
31 |
### Semantic Similarity Model
|
32 |
The project uses the **SentenceTransformer** model, specifically the `sentence-transformers/all-MiniLM-L6-v2`, to compute semantic similarity between the CV and job descriptions. The model works by converting both the CV and job descriptions into embeddings (vector representations of the text). It then calculates the cosine similarity between these embeddings to determine how closely the texts match in meaning. The higher the cosine similarity score, the more similar the texts are. This allows the system to rank the job descriptions and find the top 3 that best match the content of the provided CV.
|
|
|
15 |
## Purpose of the Project
|
16 |
This project aims to help job seekers by matching their CVs to job descriptions from major companies using semantic similarity. By analyzing the text content of the CV, the system identifies and presents the top three job descriptions that best match the provided CV. Additionally, it provides translations for CVs that are not in English, as well as the corresponding job descriptions.
|
17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
## Models
|
19 |
### Semantic Similarity Model
|
20 |
The project uses the **SentenceTransformer** model, specifically the `sentence-transformers/all-MiniLM-L6-v2`, to compute semantic similarity between the CV and job descriptions. The model works by converting both the CV and job descriptions into embeddings (vector representations of the text). It then calculates the cosine similarity between these embeddings to determine how closely the texts match in meaning. The higher the cosine similarity score, the more similar the texts are. This allows the system to rank the job descriptions and find the top 3 that best match the content of the provided CV.
|