Spaces:
Sleeping
Sleeping
Commit History
added diagram of the current flow
37a41c8
section extraction to mongodb is working through the notebook but needs to be moved to a function.
0f180d3
set up to dump to MongoDB instead of PostgreSQL
4fad2af
ratio of top
47cc304
changed shape of top with image
e20829a
removed benefits
616189c
removed city for streamlit app
ede952f
updated gitignore to include pycache and removed it from the githistory as its not needed there. data24.csv was updated to some recent jobs but is still usable.
08ac07d
Remove __pycache__ directory
d130605
successfully able to extract the description information back into a pandas dataframe. Need to look how to write to the database after every call instead of doing it in such large batches.
b78565b
able to parse objects out of the job description, but extracting in mass w/ error handling is a little difficult. Most recent run is in parse_description_test notebook thats able to connect to psql and then extract a sample set of 74 descriptions.
66700bd
had to make it smaller
a21b3b7
15x the height of the text entry
3c1b90e
expanded the text entry box
41334c8
updated requirements to not look at local files
d5cb71d
added requirements.txt for dependency control
8ed23dc
updated config in readme
a1581dc
Resolve merge conflicts
2254e3b
inital commit to spaces
f5aa239
initial commit
8ed6449
re organized the file structure in order to work with building a streamlit app to test the job description extraction.
f7fc876
added image from draw.io for overall flow
0e8c8f5
Initial commit: Google_jobs has the main functions that pulls jobs from google search.
1b96fb3
Initial commit
8f87470
Mishahal Palakuniyil
commited on