Abstract
Abusive phenomena are commonplace in language on the web. The scope of recognizing abusive language is broad, covering many behaviors and forms of expression. This work addresses automatic detection of abusive language in Russian. The lexical, grammatical and morphological diversity of Russian language present potential difficulties for this task, which is addressed using a variety of machine learning approaches. Finally, competitive performance is reached over multiple domains for this investigation into automatic detection of abusive language in Russian.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing |
Forlag | Association for Computational Linguistics |
Publikationsdato | 20 apr. 2021 |
Sider | 20-25 |
Status | Udgivet - 20 apr. 2021 |
Emneord
- Abusive language detection
- Russian language processing
- Machine learning
- Lexical diversity
- Morphological analysis