Robzy commited on
Commit
4f274c5
·
1 Parent(s): d5ecb26

project outline done

Browse files
Files changed (3) hide show
  1. README.md +36 -1
  2. extract.py +38 -0
  3. job-ad.txt +0 -0
README.md CHANGED
@@ -1 +1,36 @@
1
- # in-demand
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Compilation of in-demand tech skills
2
+
3
+ # Project overview
4
+
5
+ ## Model: skills extraction model
6
+
7
+ [Model: skills extraction model from HuggingFace](https://huggingface.co/spaces/jjzha/skill_extraction_demo)
8
+
9
+ ## Inference
10
+ 1. Extracting new job abs from Indeed/LinkedIn
11
+ 2. Extract skills from job ads via skills extraction model
12
+
13
+ ## Online training
14
+ Extract ground truth via LLM and few-shot learning.
15
+
16
+ ## Skill compilation
17
+ Save all skills. Make a comprehensive overview by:
18
+
19
+ 1. Embed skills to a vector with an embedding model
20
+ 2. Perform clustering with HDBSCAN
21
+ 2. Visualize clustering with dimensionality reduction (UMAP)
22
+
23
+ Inspiration: [link](https://dylancastillo.co/posts/clustering-documents-with-openai-langchain-hdbscan.html)
24
+
25
+
26
+ ## Project requirements:
27
+
28
+ You should define your own project by writing at most one page description of the project. The proposed project should be approved by the examiner. The project proposal should cover the following headings:
29
+
30
+ ### Problem description: what are the data sources and the prediction problem that you will be building a ML System for?
31
+ ### Tools: what tools you are going to use? In the course we mainly used Decision Trees and PyTorch/Tensorflow, but you are free to explore new tools and technologies.
32
+ ### Data: what data will you use and how are you going to collect it?
33
+ ### Methodology and algorithm: what method(s) or algorithm(s) are you proposing?
34
+ ### What to deliver
35
+ You should deliver your project as a stand alone serverless ML system. You should submit a URL for your service, a zip file containing your code, and a short report (two to three pages) about what you have done, the dataset, your method, your results, and how to run the code. I encourage you to have the README.md for your project in your Github report as the report for your project.
36
+
extract.py ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ About the job
2
+ Grow with us
3
+
4
+ About This Opportunity
5
+
6
+ Ericsson is a world-leading provider of telecommunications equipment and services to mobile and fixed network operators. Over 1,000 networks in more than 180 countries use Ericsson equipment, and more than 40 percent of the world's mobile traffic passes through Ericsson networks. Using innovation to empower people, business and society, Ericsson is working towards the Networked Society: a world connected in real time that will open opportunities to create freedom, transform society and drive solutions to some of our planet’s greatest challenges.
7
+
8
+ Ericsson's 6G vision, first introduced in 2020, remains pivotal for transforming business and society in the 2030s through secure, efficient, and sustainable communication services. As 6G development progresses into a more concrete phase of regulation and standardization we are looking for researchers that would like to join us, co-creating a cyber-physical world
9
+
10
+ Within Ericsson, Ericsson Research develops new communication solutions and standards which have made Ericsson the industry leader in defining five generations of mobile communication. As we gear up for the 6th generation, we would like to fully embrace and utilize cloud native principles, hyperscalers and internal cloud infrastructure in our research. We are now looking for a MLOps research engineer to develop and support our workflows.
11
+
12
+ In this role, you will
13
+
14
+ Contribute to the direction and implementation of ML-based ways of working
15
+ Study, design and develop workflows and solutions for AI based R&D
16
+ Work across internal compute and external cloud platforms
17
+ Working closely with researchers driving 6G standardization
18
+
19
+ Join our Team
20
+
21
+ Qualifications
22
+
23
+ MSc in Data Science or related field, or have equivalent practical experience
24
+ Technical skills and/or professional experience, particularly in:
25
+ Programming in various languages (Python, Go, etc)
26
+ MLOps technologies and tooling (e.g. MLFlow, Kubeflow)
27
+ Dispatching and computational Python packages (Hydra, numpy, TensorFlow, etc.)
28
+ DevOps and CI/CD experience, runner deployment & management, pipeline creation, testing etc. for validating ML-driven code
29
+ Familiarity in the following is a plus:
30
+ ML frameworks (PyTorch, TensorFlow, or Jax)
31
+ Containers technologies (engines, orchestration tools and frameworks such as Docker, Kaniko, Kubernetes, Helm, etc.)
32
+ Cloud ecosystems along with the respective infrastructure, in particular AWS
33
+ Infrastructure management (Ansible, Terraform, etc.)
34
+ Team skills is a necessity. Daily cross-functional collaboration and interaction with other skilled researchers are the basis for our ways of working.
35
+ You should enjoy working with people having diverse backgrounds and competences.
36
+ It is important that you have strong personal drive and a strong focus on the tasks at hand.
37
+ Ability to translate high-level objectives into detailed tasks and actionable steps.
38
+ Location: Luleå, Sweden
job-ad.txt ADDED
File without changes