Aqsa Kausar commited on
Commit
0de3a7e
·
unverified ·
1 Parent(s): 89cf21f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md CHANGED
@@ -50,6 +50,24 @@ You should define your own project by writing at most one page description of th
50
  ### What to deliver
51
  You should deliver your project as a stand alone serverless ML system. You should submit a URL for your service, a zip file containing your code, and a short report (two to three pages) about what you have done, the dataset, your method, your results, and how to run the code. I encourage you to have the README.md for your project in your Github report as the report for your project.
52
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
53
  # Skill Embeddings and Visualization
54
 
55
  We generate embeddings for technical skills listed in .txt files and visualizes their relationships using dimensionality reduction and clustering techniques. The visualizations are created for both 2D and 3D embeddings, and clustering is performed using KMeans to identify groups of similar skills.
 
50
  ### What to deliver
51
  You should deliver your project as a stand alone serverless ML system. You should submit a URL for your service, a zip file containing your code, and a short report (two to three pages) about what you have done, the dataset, your method, your results, and how to run the code. I encourage you to have the README.md for your project in your Github report as the report for your project.
52
 
53
+
54
+ # Job Scraping
55
+
56
+ This component scrapes job descriptions from the LinkedIn Job Search API for Machine Learning, and saves them in text files for further analysis.
57
+
58
+ ## Workflow
59
+
60
+ 1. **API Configuration**:
61
+ - The script uses the `linkedin-job-search-api.p.rapidapi.com` endpoint to fetch job data.
62
+ - API access is authenticated using a RapidAPI key stored as an environment variable `RAPID_API_KEY`.
63
+
64
+ 2. **Data Retrieval**:
65
+ - The script fetches jobs matching the keyword `machine learning`.
66
+ - It retrieves job details including the description, which is saved for further analysis.
67
+
68
+ 3. **Job Description Extraction**:
69
+ - Each job description is saved in a `.txt` file under the `job-postings/<date>` folder.
70
+
71
  # Skill Embeddings and Visualization
72
 
73
  We generate embeddings for technical skills listed in .txt files and visualizes their relationships using dimensionality reduction and clustering techniques. The visualizations are created for both 2D and 3D embeddings, and clustering is performed using KMeans to identify groups of similar skills.