Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
LeoWalker
/
jobDescriptionParser
like
2
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
jobDescriptionParser
Ctrl+K
Ctrl+K
3 contributors
History:
25 commits
LeoWalker
turned off "google_domain" after measuring increased failure rate from api call.
c8a9bc1
about 1 year ago
.vscode
re organized the file structure in order to work with building a streamlit app to test the job description extraction.
about 1 year ago
images
added image from draw.io for overall flow
about 1 year ago
notebooks
section extraction to mongodb is working through the notebook but needs to be moved to a function.
about 1 year ago
utils
turned off "google_domain" after measuring increased failure rate from api call.
about 1 year ago
.DS_Store
Safe
6.15 kB
section extraction to mongodb is working through the notebook but needs to be moved to a function.
about 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 year ago
.gitignore
Safe
107 Bytes
updated gitignore to include pycache and removed it from the githistory as its not needed there. data24.csv was updated to some recent jobs but is still usable.
about 1 year ago
README.md
Safe
3.07 kB
updated config in readme
about 1 year ago
jobDetailExtractor.drawio.png
Safe
87 kB
added diagram of the current flow
about 1 year ago
requirements.txt
Safe
1.4 kB
updated requirements to not look at local files
about 1 year ago
sf_recent_jobs.csv
Safe
353 kB
successfully able to extract the description information back into a pandas dataframe. Need to look how to write to the database after every call instead of doing it in such large batches.
about 1 year ago
streamlit_app.py
Safe
5.36 kB
ratio of top
about 1 year ago