Overview of PoliticES at IberLEF 2023Political Ideology Detection in Spanish Texts

  1. Ureña López, Luis Alfonso
  2. Valencia García, Rafael
  3. García Díaz, José Antonio
  4. Jiménez Zafra, Salud M.
  5. Martín Valdivia, María Teresa
  6. García Sánchez, Francisco
Revista:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Año de publicación: 2023

Número: 71

Páginas: 409-416

Tipo: Artículo

Otras publicaciones en: Procesamiento del lenguaje natural

Resumen

Este artículo describe PoliticES 2023, una tarea organizada dentro del taller IberLEF 2023 en el marco de la 39 edición del Congreso Internacional de la Sociedad Española para el Procesamiento del Lenguaje Natural. Esta segunda edición de la tarea comparte el objetivo de la primera edición de PoliticES, extraer la ideología política y otros rasgos psicográficos y demográficos de usuarios en redes sociales. Las novedades son que este año los rasgos se extraen de clústers de textos de usuarios que comparten los mismos rasgos y que se ha incluido celebridades como tipo de profesión. Esta edición ha atraído a 43 equipos, de los cuales 11 enviaron resultados y 8 presentaron artículos describiendo sus sistemas. La mayoría de los participantes propusieron enfoques basados en Transformers, pero también otros utilizaron algoritmos tradicionales de aprendizaje automático.

Referencias bibliográficas

  • Acosta-Pacheco, A.-M., D.-P. De-La- Cruz-Sierra, H. Gu, J.-M. Suárez- Bautista, J. Hernández-Espinoza, M.-G. Hernández-Lom, L.-R. Merino-Vázquez, and O. Juárez Gambino. 2023. Par- ticipation of ESCOM’s NLP group at PoliticES-IberLEF2023: Voting Ensemble and basic Machine Learning methods ap- plied to political ideology. In Proceedings of the Iberian Languages Evaluation Fo- rum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEUR-WS.org.
  • Ahuir, V., L.-F. Hurtado, F. García- Granada, and Sanchis. 2023. ELiRF- VRAIN at PoliticES-IberLEF2023: Deal- ing with Long Texts in Transformer-based Systems for User Profiling. In Proceedings of the Iberian Languages Evaluation Fo- rum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEUR-WS.org.
  • Barbieri, F., L. Espinosa Anke, and J. Camacho-Collados. 2022. XLM-T: Multilingual language models in Twit- ter for sentiment analysis and beyond. In Proceedings of the Thirteenth Lan- guage Resources and Evaluation Confer- ence, pages 258–266, Marseille, France, June. European Language Resources As- sociation.
  • Baumgaertner, B., J. E. Carlisle, and F. Just- wan. 2018. The influence of political ide- ology and trust on willingness to vacci- nate. PloS one, 13(1):e0191728.
  • Brandon, S., M. M. Del-Toro Carballo, M. S. Arias Fernández, and I. Segura Bedmar. 2023. UC3M at PoliticEs 2023: Ap- plying The Basics. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Nat- ural Language Processing (SEPLN 2023), CEUR-WS.org.
  • Cabrera-Pineda, H., E. S. Tellez, and S. Miranda. 2023. INFOTEC-LaBD at PoliticES-IberLEF2023: Explainable Non-Linear Low-Dimensional Projections. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co- located with the 39th Conference of the Spanish Society for Natural Language Pro- cessing (SEPLN 2023), CEUR-WS.org.
  • Cañete, J., G. Chaperon, R. Fuentes, J.-H. Ho, H. Kang, and J. Pérez. 2020. Span- ish pre-trained bert model and evaluation data. Pml4dc at iclr, 2020:1–10.
  • Conneau, A., K. Khandelwal, N. Goyal, V. Chaudhary, G. Wenzek, F. Guzmán, é. Grave, M. Ott, L. Zettlemoyer, and V. Stoyanov. 2020. Unsupervised cross- lingual representation learning at scale. In Proceedings of the 58th Annual Meeting of the Association for Computational Lin- guistics, pages 8440–8451.
  • Devlin, J., M.-W. Chang, K. Lee, and K. Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for lan- guage understanding. In Proceedings of the 2019 Conference of the North Amer- ican Chapter of the Association for Com- putational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4171–4186, Minneapolis, Minnesota, June. Association for Compu- tational Linguistics.
  • Fatke, M. 2017. Personality traits and po- litical ideology: A first global assessment. Political Psychology, 38(5):881–899.
  • Fernandez de Landam, J. and R. Agerri. 2023. HiTZ-IXA at PoliticES- IberLEF2023: Document and Sentence Level Text Representations for Demo- graphic Characteristics and Political Ideology Detection. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEUR-WS.org.
  • García-Díaz, J. A., á. Almela, G. Alcaraz- Mármol, and R. Valencia-García. 2020. UMUCorpusClassifier: Compilation and evaluation of linguistic corpus for Nat- ural Language Processing tasks. Proce- samiento del Lenguaje Natural, 65(0):139– 142.
  • García-Díaz, J. A., R. Colomo-Palacios, and R. Valencia-García. 2022. Psychographic traits identification based on political ideology: An author analysis study on spanish politicians’ tweets posted in 2020. Future Generation Computer Systems, 130:59–74.
  • García-Díaz, J. A., S. M. Jiménez Zafra, M. T. Martín Valdivia, F. García-Sánchez, L. A. Ureña López, and R. Valencia García. 2022. Overview of politices 2022: Spanish author profiling for political ideology. Procesamiento del Lenguaje Natural.
  • Gutiérrez Fandiño, A., J. Armengol Estapé, M. Pamies, J. Llop Palao, J. Silveira Ocampo, C. Pio Carrino, C. Armentano Oller, C. Rodriguez Penagos, A. Gonzalez Agirre, and M. Villegas. 2022. MarIA: Spanish language models. Procesamiento del Lenguaje Natural, 68.
  • He, P., J. Gao, and W. Chen. 2021. Debertav3: Improving deberta using electra-style pre-training with gradientdisentangled embedding sharing. arXiv preprint arXiv:2111.09543.
  • Jiménez-Zafra, S. M., F. Rangel, and M. Montes-y Gómez. 2023. Overview of IberLEF 2023: Natural Language Processing Challenges for Spanish and other Iberian Languages. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEUR-WS.org.
  • López- ávila, P. E., A. B. García-Gutiérrez, P. A. Gallegos- ávila, R. Aranda, and M. A. álvarez Carmona. 2023. Dataverse at PoliticES-IberLEF2023. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEUR-WS.org.
  • McInnes, L., J. Healy, N. Saul, and L. Großberger. 2018. Umap: Uniform manifold approximation and projection. Journal of Open Source Software, 3(29):861.
  • Pan, R., C. Caparrós-Laiz, and A. Almela. 2023. UMUTeam at PoliticESIberLEF2023: Evaluating Transformers for Detecting Political Ideology in Spanish Texts. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEUR-WS.org.
  • Rodríguez-García, M. A. 2023. URJCTeam at PoliticES-IberLEF2023: Political Ideology Detection Using Hybrid Architecture. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEURWS.org.
  • Sanh, V., L. Debut, J. Chaumond, and T. Wolf. 2019. Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108.
  • Verhulst, B., L. J. Eaves, and P. K. Hatemi. 2012. Correlation not causation: The relationship between personality traits and political ideologies. American journal of political science, 56(1):34–51.
  • Villa-Cueva, E., I. González-Franco, F. Sanchez-Vega, and A. P. LópezMonroy. 2022. Nlp-cimat at politices 2022: Politibeto, a domain-adapted transformer for multi-class political author profiling. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruna, Spain.