Achieved results

Scientific publication Efficient Argument Classification with Compact Language Models and ChatGPT-4 Refinements. This paper presents comparative studies between a few deep learning-based models in argument mining. The work concentrates on argument classification. The research was done on a wide spectrum of datasets (Args.me, UKP, US2016). The main novelty of this paper is the ensemble model which is based on BERT architecture and ChatGPT-4 as fine tuning model. The presented results show that BERT+ChatGPT-4 outperforms the rest of the models including other Transformer-based and LSTM-based models. The observed improvement is, in most cases, greater than 10%. The presented analysis can provide crucial insights into how the models for argument classification should be further improved. Additionally, it can help develop a prompt-based algorithm to eliminate argument classification errors.
Cite as: Pietron, M., Olszowski, R., Gomułka, J. (2024). Efficient Argument Classification with Compact Language Models and ChatGPT-4 Refinements. In: Nguyen, N.T., et al. Computational Collective Intelligence. ICCCI 2024. Lecture Notes in Computer Science, vol 14810. Springer, Cham. https://doi.org/10.1007/978-3-031-70816-9_20.

Planned results

Scientific Publication on Comparative Research in Argument Classification Using Language Models. This scientific publication presents a comparative study on argument classification using language models. The analysis examines diverse datasets sourced from well-established research projects, including Args.me, UKP, and US2016. The study incorporates various models such as GPT-4, GPT-4o, Meta-LLAMA, and DeepSeek, as well as an evaluation of different prompting strategies. The publication discusses the most common errors observed across all tested models and highlights key differences between them. Furthermore, the paper investigates widely used inference algorithms based on prompt engineering techniques and introduces a hybrid model composed of selected LLMs, rephrased queries, and a voting module. This new model improves argument recognition accuracy, although it still does not achieve full precision. According to the authors, this study represents the first extensive analysis of the mentioned datasets using LLMs and prompt-based algorithms. Additionally, the paper identifies the limitations of existing prompt engineering approaches in argument analysis and suggests directions for their further improvement.
Scientific Publication on Argument Theory and Annotation Policy. This publication explores theoretical approaches to argumentation and challenges related to annotation policies in argument analysis. It provides a comprehensive review of argument definitions, examining various theoretical perspectives and practical difficulties associated with their application, particularly in Argument Mining (AM). In the first section, the authors discuss existing concepts of argumentation found in the academic literature. The review covers classical approaches in logic and critical thinking, rhetorical and legal perspectives, as well as a philosophical perspective inspired by Wittgenstein, where argumentation is viewed as a form of storytelling. Additionally, the paper analyzes argumentation within debates, contrasting the agonistic theory with deliberative models, and examines argumentation in linguistic contexts. The second part of the paper focuses on the challenges of automatic argument recognition in online discussions and existing annotated corpora used in AM research. The study then introduces a practical set of criteria for argument identification, aiming to develop an operational definition of an argument in a regulatory sense rather than a purely descriptive one. Beginning with a broad approach (e.g., Argumentative Discursive Unit; ADU), the authors propose a gradual refinement and justification of stricter regulations to enhance argument detection in both theoretical research and empirical applications.
Argument Database. As a key outcome of our research project, we develop a comprehensive corpus of texts on public issues, annotated in terms of argument-premise relationships and the multithreaded structures of various debate models. In line with our project’s objectives, the corpus will contain approximately 30,000 records in Polish and another 30,000 in English, facilitating effective machine learning for Argument Mining. The collected data, stored in a database in JSON format, will be continuously updated to reflect the most current topics of public debate that emerge throughout the project's duration. Given the rapidly changing nature of public discourse, ongoing data collection and annotation represent a valuable contribution to the field. Additionally, knowledge graphs will be generated using Argument Extraction Models to map interrelations between arguments within the corpus and to identify argumentation patterns across different debate models. These efforts will include the organization of hierarchical categories and relationships, forming formal ontologies that enhance the structured representation of argumentation. Argument database will be developed using Semantic Web Technology, accompanied by a web-based interface, ensuring accessibility and usability for researchers and AI-driven applications.
Two additional scientific publications discussing the results of the project.