Spaces:
Running
Running
kolumbuslindh
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -56,19 +56,22 @@ The platform uses two primary data sources:
|
|
56 |
|
57 |
### Workflow
|
58 |
|
59 |
-
1. **
|
|
|
|
|
|
|
60 |
|
61 |
- Job data is fetched via the JobStream API and stored in Pinecone after being vectorized.
|
62 |
- Metadata such as job title, description, location, and contact details is extracted.
|
63 |
|
64 |
-
|
65 |
- User-uploaded resumes are vectorized using the same sentence transformer model.
|
66 |
- Pinecone is queried for the top-k most similar job embeddings, which are then displayed to the user alongside their similarity scores.
|
67 |
|
68 |
-
|
69 |
- If a user chooses to leave feedback, by either clicking *Relevant* or *Not Relevant*, the users CV is uploaded to Hopsworks together with the specific ad data, and the selected choice.
|
70 |
|
71 |
-
|
72 |
- Once every seven days, a chrone job on *Github Actions* runs, where the base model is finetuned on the total data stored in the feature store.
|
73 |
|
74 |
---
|
|
|
56 |
|
57 |
### Workflow
|
58 |
|
59 |
+
1. **Flowchart of JobsAI**
|
60 |
+
[JobsAI flowchart structure](https://www.imghippo.com/i/CZk3216mnA.png)
|
61 |
+
|
62 |
+
2. **Data Retrieval**:
|
63 |
|
64 |
- Job data is fetched via the JobStream API and stored in Pinecone after being vectorized.
|
65 |
- Metadata such as job title, description, location, and contact details is extracted.
|
66 |
|
67 |
+
3. **Similarity Search**:
|
68 |
- User-uploaded resumes are vectorized using the same sentence transformer model.
|
69 |
- Pinecone is queried for the top-k most similar job embeddings, which are then displayed to the user alongside their similarity scores.
|
70 |
|
71 |
+
4. **Feature Uploading**:
|
72 |
- If a user chooses to leave feedback, by either clicking *Relevant* or *Not Relevant*, the users CV is uploaded to Hopsworks together with the specific ad data, and the selected choice.
|
73 |
|
74 |
+
5. **Model Training**:
|
75 |
- Once every seven days, a chrone job on *Github Actions* runs, where the base model is finetuned on the total data stored in the feature store.
|
76 |
|
77 |
---
|