ITU

Matching Theory and Data with Personal-ITY: What a Corpus of Italian YouTube Comments Reveals About Personality

Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

View graph of relations

As a contribution to personality detection in languages other than English, we rely on distant supervision to create Personal-ITY, a novel corpus of YouTube comments in Italian, where authors are labelled with personality traits. The traits are derived from one of the mainstream personality theories in psychology research, named MBTI. Using personality prediction experiments, we (i) study the task of personality prediction in itself on our corpus as well as on TWISTY, a Twitter dataset also annotated with MBTI labels; (ii) carry out an extensive, in-depth analysis of the
features used by the classifier, and view them specifically under the light of the original theory that we used to create the corpus in the first place. We observe that no single model is best at personality detection, and that while some traits are easier than others to detect, and also to match back to theory, for other, less frequent traits the picture is much more blurred.
Original languageEnglish
Title of host publicationProceedings of the Third Workshop on Computational Modeling of People's Opinions, Personality, and Emotion's in Social Media
PublisherAssociation for Computational Linguistics
Publication date2020
Pages11-22
Publication statusPublished - 2020
EventPEOPLES - THIRD WORKSHOP ON COMPUTATIONAL MODELING OF PEOPLE’S OPINIONS, PERSONALITY, AND EMOTIONS IN SOCIAL MEDIA - Barcelona, Barcelona, Spain
Duration: 14 Sep 2020 → …
https://peopleswksh.github.io/

Workshop

WorkshopPEOPLES - THIRD WORKSHOP ON COMPUTATIONAL MODELING OF PEOPLE’S OPINIONS, PERSONALITY, AND EMOTIONS IN SOCIAL MEDIA
LocationBarcelona
LandSpain
ByBarcelona
Periode14/09/2020 → …
Internetadresse

ID: 85513010