Spaces:

Robzy
/

jobbert_knowledge_extraction

Running

App Files Files Community

Robzy commited on Jan 2

Commit

4f274c5

1 Parent(s): d5ecb26

project outline done

Browse files

Files changed (3) hide show

README.md +36 -1
extract.py +38 -0
job-ad.txt +0 -0

README.md CHANGED Viewed

	@@ -1 +1,36 @@
1	- # in-demand

+# Compilation of in-demand tech skills
+# Project overview
+## Model: skills extraction model
+[Model: skills extraction model from HuggingFace](https://huggingface.co/spaces/jjzha/skill_extraction_demo)
+## Inference
+1. Extracting new job abs from Indeed/LinkedIn
+2. Extract skills from job ads via skills extraction model
+## Online training
+Extract ground truth via LLM and few-shot learning.
+## Skill compilation
+Save all skills. Make a comprehensive overview by:
+1. Embed skills to a vector with an embedding model
+2. Perform clustering with HDBSCAN
+2. Visualize clustering with dimensionality reduction (UMAP)
+Inspiration: [link](https://dylancastillo.co/posts/clustering-documents-with-openai-langchain-hdbscan.html)
+## Project requirements:
+You should define your own project by writing at most one page description of the project. The proposed project should be approved by the examiner. The project proposal should cover the following headings:
+### Problem description: what are the data sources and the prediction problem that you will be building a ML System for?
+### Tools: what tools you are going to use? In the course we mainly used Decision Trees and PyTorch/Tensorflow, but you are free to explore new tools and technologies.
+### Data: what data will you use and how are you going to collect it?
+### Methodology and algorithm: what method(s) or algorithm(s) are you proposing?
+### What to deliver
+You should deliver your project as a stand alone serverless ML system. You should submit a URL for your service, a zip file containing your code, and a short report (two to three pages) about what you have done, the dataset, your method, your results, and how to run the code. I encourage you to have the README.md for your project in your Github report as the report for your project.

extract.py ADDED Viewed

	@@ -0,0 +1,38 @@

+About the job
+Grow with us
+About This Opportunity
+Ericsson is a world-leading provider of telecommunications equipment and services to mobile and fixed network operators. Over 1,000 networks in more than 180 countries use Ericsson equipment, and more than 40 percent of the world's mobile traffic passes through Ericsson networks. Using innovation to empower people, business and society, Ericsson is working towards the Networked Society: a world connected in real time that will open opportunities to create freedom, transform society and drive solutions to some of our planet’s greatest challenges.
+Ericsson's 6G vision, first introduced in 2020, remains pivotal for transforming business and society in the 2030s through secure, efficient, and sustainable communication services. As 6G development progresses into a more concrete phase of regulation and standardization we are looking for researchers that would like to join us, co-creating a cyber-physical world
+Within Ericsson, Ericsson Research develops new communication solutions and standards which have made Ericsson the industry leader in defining five generations of mobile communication. As we gear up for the 6th generation, we would like to fully embrace and utilize cloud native principles, hyperscalers and internal cloud infrastructure in our research. We are now looking for a MLOps research engineer to develop and support our workflows.
+In this role, you will
+Contribute to the direction and implementation of ML-based ways of working
+Study, design and develop workflows and solutions for AI based R&D
+Work across internal compute and external cloud platforms
+Working closely with researchers driving 6G standardization
+Join our Team
+Qualifications
+MSc in Data Science or related field, or have equivalent practical experience
+Technical skills and/or professional experience, particularly in:
+Programming in various languages (Python, Go, etc)
+MLOps technologies and tooling (e.g. MLFlow, Kubeflow)
+Dispatching and computational Python packages (Hydra, numpy, TensorFlow, etc.)
+DevOps and CI/CD experience, runner deployment & management, pipeline creation, testing etc. for validating ML-driven code
+Familiarity in the following is a plus:
+ML frameworks (PyTorch, TensorFlow, or Jax)
+Containers technologies (engines, orchestration tools and frameworks such as Docker, Kaniko, Kubernetes, Helm, etc.)
+Cloud ecosystems along with the respective infrastructure, in particular AWS
+Infrastructure management (Ansible, Terraform, etc.)
+Team skills is a necessity. Daily cross-functional collaboration and interaction with other skilled researchers are the basis for our ways of working.
+You should enjoy working with people having diverse backgrounds and competences.
+It is important that you have strong personal drive and a strong focus on the tasks at hand.
+Ability to translate high-level objectives into detailed tasks and actionable steps.
+Location: Luleå, Sweden

job-ad.txt ADDED Viewed

File without changes