Abstract
We extensively analyse the correlations and drawbacks of conventionally employed evaluation metrics for word segmentation. Unlike in standard information retrieval, precision favours under-splitting systems and therefore can be misleading in word segmentation. Overall, based on both theoretical and experimental analysis, we propose that precision should be excluded from the standard evaluation metrics and that the evaluation score obtained by using only recall is sufficient and better correlated with the performance of word segmentation systems.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 2: Short Papers) |
| Number of pages | 5 |
| Publication date | 1 Dec 2017 |
| Pages | 86–90 |
| ISBN (Print) | 978-1-948087-01-8 |
| Publication status | Published - 1 Dec 2017 |
| Externally published | Yes |
Keywords
- Word segmentation
- Evaluation metrics
- Recall-based evaluation
- Under-splitting
- Information retrieval
Fingerprint
Dive into the research topics of 'Recall is the Proper Evaluation Metric for Word Segmentation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver