Abstract
Languages with logographic writing systems present a difficulty for traditional character-level models. Leveraging the subcharacter information was recently shown to be beneficial for a number of intrinsic and extrinsic tasks in Chinese. We examine whether the same strategies could be applied for Japanese, and contribute a new analogy dataset for this language.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the Workshop on the Relevance of Linguistic Structure in Neural Architectures for NLP |
| Number of pages | 10 |
| Place of Publication | Melbourne, Australia |
| Publisher | Association for Computational Linguistics |
| Publication date | 2018 |
| Pages | 28-37 |
| Publication status | Published - 2018 |
Keywords
- Logographic writing systems
- Character-level models
- Subcharacter information
- Chinese language processing
- Japanese language analogies
Fingerprint
Dive into the research topics of 'Subcharacter Information in Japanese Embeddings: When Is It Worth It?'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver