Abstract
Recent approaches in skill-to-surface-form matching, employing synthetic training data for classification or similarity model training, have shown promising results, eliminating the need for time-consuming and expensive annotation. However, previous datasets have limitations, such as featuring only one skill per sentence and generally comprising short sentences. This paper introduces JobSkape, a framework to generate synthetic data that resembles real-world job postings, specifically designed to enhance skill-to-taxonomy matching. Within this framework, we create SkillSkape, a comprehensive open-source synthetic dataset of job postings tailored for skill-matching tasks. We introduce several offline metrics that show our dataset is more diverse, realistic, and follows a higher quality based on similarities. Additionally, we present a multi-step pipeline utilizing large language models (LLMs), benchmarking against supervised methodologies. We outline that the performances are comparable and that each method can be used for different use cases.
| Original language | English |
|---|---|
| Title of host publication | 1st Workshop on Natural Language Processing for Human Resources |
| Number of pages | 16 |
| Publisher | Association for Computational Linguistics |
| Publication date | Mar 2024 |
| Pages | 43–58 |
| Publication status | Published - Mar 2024 |
| Event | Natural Language Processing for Human Resources workshop - St. Julians, Malta Duration: 22 Mar 2024 → 22 Mar 2024 Conference number: 1 https://megagon.ai/nlp4hr-2024/ https://megagon.ai/workshops/nlp4hr-2024/ https://openreview.net/group?id=eacl.org/EACL/2024/Workshop/NLP4HR#tab-recent-activity |
Workshop
| Workshop | Natural Language Processing for Human Resources workshop |
|---|---|
| Number | 1 |
| Country/Territory | Malta |
| City | St. Julians |
| Period | 22/03/2024 → 22/03/2024 |
| Internet address |
Keywords
- Skill-to-surface-form matching
- Synthetic training data
- Job postings
- Skill-matching tasks
- Large language models (LLMs)
Fingerprint
Dive into the research topics of 'JobSkape: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver