Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by Benedita Malheiro

2024

Balancing Plug-In for Stream-Based Classification

Authors
de Arriba Pérez, F; García Méndez, S; Leal, F; Malheiro, B; Burguillo Rial, JC;

Publication
Lecture Notes in Networks and Systems

Abstract
The latest technological advances drive the emergence of countless real-time data streams fed by users, sensors, and devices. These data sources can be mined with the help of predictive and classification techniques to support decision-making in fields like e-commerce, industry or health. In particular, stream-based classification is widely used to categorise incoming samples on the fly. However, the distribution of samples per class is often imbalanced, affecting the performance and fairness of machine learning models. To overcome this drawback, this paper proposes Bplug, a balancing plug-in for stream-based classification, to minimise the bias introduced by data imbalance. First, the plug-in determines the class imbalance degree and then synthesises data statistically through non-parametric kernel density estimation. The experiments, performed with real data from Wikivoyage and Metro of Porto, show that Bplug maintains inter-feature correlation and improves classification accuracy. Moreover, it works both online and offline. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

2024

Emotional Evaluation of Open-Ended Responses with Transformer Models

Authors
Sanmartín, AP; Arriba Pérez, Fd; Méndez, SG; Burguillo, JC; Leal, F; Malheiro, B;

Publication
Good Practices and New Perspectives in Information Systems and Technologies - WorldCIST 2024, Volume 1, Lodz, Poland, 26-28 March 2024.

Abstract
This work applies Natural Language Processing (NLP) techniques, specifically transformer models, for the emotional evaluation of open-ended responses. Today’s powerful advances in transformer architecture, such as ChatGPT, make it possible to capture complex emotional patterns in language. The proposed transformer-based system identifies the emotional features of various texts. The research employs an innovative approach, using prompt engineering and existing context, to enhance the emotional expressiveness of the model. It also investigates spaCy’s capabilities for linguistic analysis and the synergy between transformer models and this technology. The results show a significant improvement in emotional detection compared to traditional methods and tools, highlighting the potential of transformer models in this domain. The method can be implemented in various areas, such as emotional research or mental health monitoring, creating a much richer and complete user profile. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

2024

Exposing and explaining fake news on-the-fly

Authors
de Arriba-Pérez, F; García-Méndez, S; Leal, F; Malheiro, B; Burguillo, JC;

Publication
MACHINE LEARNING

Abstract
Social media platforms enable the rapid dissemination and consumption of information. However, users instantly consume such content regardless of the reliability of the shared data. Consequently, the latter crowdsourcing model is exposed to manipulation. This work contributes with an explainable and online classification method to recognize fake news in real-time. The proposed method combines both unsupervised and supervised Machine Learning approaches with online created lexica. The profiling is built using creator-, content- and context-based features using Natural Language Processing techniques. The explainable classification mechanism displays in a dashboard the features selected for classification and the prediction confidence. The performance of the proposed solution has been validated with real data sets from Twitter and the results attain 80% accuracy and macro F-measure. This proposal is the first to jointly provide data stream processing, profiling, classification and explainability. Ultimately, the proposed early detection, isolation and explanation of fake news contribute to increase the quality and trustworthiness of social media contents.

  • 23
  • 23