2024
Autores
de Arriba Pérez, F; García Méndez, S; Leal, F; Malheiro, B; Burguillo Rial, JC;
Publicação
Lecture Notes in Networks and Systems
Abstract
The latest technological advances drive the emergence of countless real-time data streams fed by users, sensors, and devices. These data sources can be mined with the help of predictive and classification techniques to support decision-making in fields like e-commerce, industry or health. In particular, stream-based classification is widely used to categorise incoming samples on the fly. However, the distribution of samples per class is often imbalanced, affecting the performance and fairness of machine learning models. To overcome this drawback, this paper proposes Bplug, a balancing plug-in for stream-based classification, to minimise the bias introduced by data imbalance. First, the plug-in determines the class imbalance degree and then synthesises data statistically through non-parametric kernel density estimation. The experiments, performed with real data from Wikivoyage and Metro of Porto, show that Bplug maintains inter-feature correlation and improves classification accuracy. Moreover, it works both online and offline. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
2024
Autores
Sanmartín, AP; Arriba Pérez, Fd; Méndez, SG; Burguillo, JC; Leal, F; Malheiro, B;
Publicação
Good Practices and New Perspectives in Information Systems and Technologies - WorldCIST 2024, Volume 1, Lodz, Poland, 26-28 March 2024.
Abstract
This work applies Natural Language Processing (NLP) techniques, specifically transformer models, for the emotional evaluation of open-ended responses. Today’s powerful advances in transformer architecture, such as ChatGPT, make it possible to capture complex emotional patterns in language. The proposed transformer-based system identifies the emotional features of various texts. The research employs an innovative approach, using prompt engineering and existing context, to enhance the emotional expressiveness of the model. It also investigates spaCy’s capabilities for linguistic analysis and the synergy between transformer models and this technology. The results show a significant improvement in emotional detection compared to traditional methods and tools, highlighting the potential of transformer models in this domain. The method can be implemented in various areas, such as emotional research or mental health monitoring, creating a much richer and complete user profile. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
2024
Autores
de Arriba-Pérez, F; García-Méndez, S; Leal, F; Malheiro, B; Burguillo, JC;
Publicação
MACHINE LEARNING
Abstract
Social media platforms enable the rapid dissemination and consumption of information. However, users instantly consume such content regardless of the reliability of the shared data. Consequently, the latter crowdsourcing model is exposed to manipulation. This work contributes with an explainable and online classification method to recognize fake news in real-time. The proposed method combines both unsupervised and supervised Machine Learning approaches with online created lexica. The profiling is built using creator-, content- and context-based features using Natural Language Processing techniques. The explainable classification mechanism displays in a dashboard the features selected for classification and the prediction confidence. The performance of the proposed solution has been validated with real data sets from Twitter and the results attain 80% accuracy and macro F-measure. This proposal is the first to jointly provide data stream processing, profiling, classification and explainability. Ultimately, the proposed early detection, isolation and explanation of fake news contribute to increase the quality and trustworthiness of social media contents.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.