Título: | TxPI-u: A resource for personality identification of undergraduates |
Autor(es): | RAMIREZ DE LA ROSA, ADRIANA GABRIELA VILLATORO TELLO, ESAU JIMENEZ SALAZAR, HECTOR |
Temas: | Recurso lingüístico Identificación de personalidad Elaboración de perfiles de autor Procesamiento natural del lenguaje |
Fecha: | 2018 |
Editorial: | Nueva York : Cornell University |
Citation: | arXiv.org Cornell University 2018 |
Resumen: | Resources such as labeled corpora are necessary to train automatic models within the natural language processing (NLP) field. Historically, a large number of resources regarding a broad number of problems are available mostly in English. One of such problems is known as Personality Identification where based on a psychological model (e.g. The Big Five Model), the goal is to find the traits of a subject’s personality given, for instance, a text written by the same subject. In this paper we introduce a new corpus in Spanish called Texts for Personality Identification (TxPI). This corpus will help to develop models to automatically assign a personality trait to an author of a text document. Our corpus, TxPI-u, contains information of 416 Mexican undergraduate students with some demographics information such as, age, gender, and the academic program they are enrolled. Finally, as an additional contribution, we present a set of baselines to provide a comparison scheme for further research. |
URI: | http://ilitia.cua.uam.mx:8080/jspui/handle/123456789/885 |
Aparece en las colecciones: | Artículos |
Fichero | Descripción | Tamaño | Formato | |
---|---|---|---|---|
TxPI-u A Resource for Personality Identification.pdf | 659.57 kB | Adobe PDF | Visualizar/Abrir |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.