Personal-ITY: A Novel YouTube-based Corpus for Personality Prediction in Italian

Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

View graph of relations

We present a novel corpus for personality prediction in Italian, containing a
larger number of authors and a different genre compared to previously available
resources. The corpus is built exploiting Distant Supervision, assigning Myers-
Briggs Type Indicator (MBTI) labels to YouTube comments, and can lend itself to
a variety of experiments. We report on preliminary experiments on Personal-ITY,
which can serve as a baseline for future work, showing that some types are easier
to predict than others, and discussing the perks of cross-dataset prediction.
Original languageEnglish
Title of host publicationSeventh Italian Conference on Computational Linguistics
PublisherAssociation for Computational Linguistics
Publication date2020
Publication statusPublished - 2020
Externally publishedYes
EventSeventh Italian Conference on Computational Linguistics - Bologna, Italy
Duration: 1 Mar 20213 Mar 2021


ConferenceSeventh Italian Conference on Computational Linguistics

ID: 85513053