2023
Autores
Muhammad, SH; Abdulmumin, I; Ayele, AA; Ousidhoum, N; Adelani, DI; Yimam, SM; Ahmad, IS; Beloucif, M; Mohammad, SM; Ruder, S; Hourrane, O; Jorge, A; Brazdil, P; António Ali, FDM; David, D; Osei, S; Bello, BS; Lawan, FI; Gwadabe, T; Rutunda, S; Belay, TD; Messelle, WB; Balcha, HB; Chala, SA; Gebremichael, HT; Opoku, B; Arthur, S;
Publicação
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, Singapore, December 6-10, 2023
Abstract
Africa is home to over 2,000 languages from more than six language families and has the highest linguistic diversity among all continents. These include 75 languages with at least one million speakers each. Yet, there is little NLP research conducted on African languages. Crucial to enabling such research is the availability of high-quality annotated datasets. In this paper, we introduce AfriSenti, a sentiment analysis benchmark that contains a total of >110,000 tweets in 14 African languages (Amharic, Algerian Arabic, Hausa, Igbo, Kinyarwanda, Moroccan Arabic, Mozambican Portuguese, Nigerian Pidgin, Oromo, Swahili, Tigrinya, Twi, Xitsonga, and Yorùbá) from four language families. The tweets were annotated by native speakers and used in the AfriSenti-SemEval shared task 1. We describe the data collection methodology, annotation process, and the challenges we dealt with when curating each dataset. We further report baseline experiments conducted on the different datasets and discuss their usefulness. ©2023 Association for Computational Linguistics.
2023
Autores
Ribeiro, RP; Mastelini, SM; Davari, N; Aminian, E; Veloso, B; Gama, J;
Publicação
MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT II
Abstract
Predictive Maintenance applications are increasingly complex, with interactions between many components. Black-box models are popular approaches due to their predictive accuracy and are based on deep-learning techniques. This paper presents an architecture that uses an online rule learning algorithm to explain when the black-box model predicts rare events. The system can present global explanations that model the black-box model and local explanations that describe why the black-box model predicts a failure. We evaluate the proposed system using four real-world public transport data sets, presenting illustrative examples of explanations.
2023
Autores
Tome, ES; Ribeiro, RP; Veloso, B; Gama, J;
Publicação
MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT II
Abstract
An online data-driven predictive maintenance approach for railway switches using data logs obtained from the interlocking system of the railway infrastructure is proposed in this paper. The proposed approach is detailed described and consists of a two-phase process: anomaly detection and remaining useful life prediction. The approach is applied to and validated in a real case study, the Metro do Porto, from which seven months of data is available. The approach has been revealed to be satisfactory in detecting anomalies. The results open the possibilities for further studies and validation with a more extensive dataset on the remaining useful life prediction.
2023
Autores
Davari, N; Veloso, B; Ribeiro, RP; Gama, J;
Publicação
MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT II
Abstract
The demand for high-performance solutions for anomaly detection and forecasting fault events is increasing in the industrial area. The detection and forecasting faults from time-series data are one critical mission in the Internet of Things (IoT) data mining. The classical fault detection approaches based on physical modelling are limited to some measurable output variables. Accurate physical modelling of vehicle dynamics requires substantial prior information about the system. On the other hand, data-driven modelling techniques accurately represent the system's dynamic from data collection. Experimental results on large-scale data sets from Metro do Porto subsystems verify that our method performs high-quality fault detection and forecasting solutions. Also, health indicator obtained from the principal component analysis of the forecasting solution is applied to predict the remaining useful life.
2023
Autores
Koprinska, I; Mignone, P; Guidotti, R; Jaroszewicz, S; Fröning, H; Gullo, F; Ferreira, PM; Roqueiro, D; Ceddia, G; Nowaczyk, S; Gama, J; Ribeiro, RP; Gavaldà, R; Masciari, E; Ras, ZW; Ritacco, E; Naretto, F; Theissler, A; Biecek, P; Verbeke, W; Schiele, G; Pernkopf, F; Blott, M; Bordino, I; Danesi, IL; Ponti, G; Severini, L; Appice, A; Andresini, G; Medeiros, I; Graça, G; Cooper, LAD; Ghazaleh, N; Richiardi, J; Miranda, DS; Sechidis, K; Canakoglu, A; Pidò, S; Pinoli, P; Bifet, A; Pashami, S;
Publicação
PKDD/ECML Workshops (1)
Abstract
2023
Autores
Koprinska, I; Mignone, P; Guidotti, R; Jaroszewicz, S; Fröning, H; Gullo, F; Ferreira, PM; Roqueiro, D; Ceddia, G; Nowaczyk, S; Gama, J; Ribeiro, RP; Gavaldà, R; Masciari, E; Ras, ZW; Ritacco, E; Naretto, F; Theissler, A; Biecek, P; Verbeke, W; Schiele, G; Pernkopf, F; Blott, M; Bordino, I; Danesi, IL; Ponti, G; Severini, L; Appice, A; Andresini, G; Medeiros, I; Graça, G; Cooper, LAD; Ghazaleh, N; Richiardi, J; Miranda, DS; Sechidis, K; Canakoglu, A; Pidò, S; Pinoli, P; Bifet, A; Pashami, S;
Publicação
PKDD/ECML Workshops (2)
Abstract
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.