2018
Authors
Oroszlanyova, M; Lopes, CT; Nunes, S; Ribeiro, C;
Publication
2018 13TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI)
Abstract
The quality of consumer-oriented health information on the Web is usually assessed through the medical certification of websites. These tools are built upon quality indicators but, so far, no standard set of indicators has been defined. The objective of the present study is to explore the popularity of specific document features and their influence on the quality of health web documents, using HON code as ground truth. A set of top-ranked health documents retrieved from a major search engine was characterized in a univariate analysis, and then used in a bivariate analysis to seek features that affect documents' quality. The univariate analysis provides insights into the characteristics of the overall population of the health web documents. The bivariate analysis reveals strong relations between documents' quality and a set of features (namely split content, videos, images, advertisements, English language) that are potential quality indicators. We characterized health web documents and identified specific document features that can be used to assess whether the information in such documents is trustworthy. The main contribution of this work is to provide other features as candidate indicators of quality. Non-health professionals can use these indicators in automatic and manual assessments of health content.
2017
Authors
Karimova, Y; Castro, JA; da Silva, JR; Pereira, N; Rodrigues, J; Ribeiro, C;
Publication
Int. J. Metadata Semant. Ontologies
Abstract
Metadata puts research data in their context, making data intelligible and apt to sustain technology evolution and to be reused, in compliance with the FAIR principles. The workflow proposed in this work includes metadata generation in the context of research projects, created with the Dendro platform, and metadata originated in the interaction of people with the deposited data, created with the B2NOTE service from EUDAT. In our experiments, datasets are prepared with Dendro, taking into consideration general-purpose descriptors and domain-specific ones, then transparently deposited in B2SHARE. After publication, B2NOTE provides an environment where authors, other researchers, and any interested party can enrich the description with less formal comments, tags or keywords. This work contributes with (a) a set of use cases in several domains, (b) details on the descriptors used by authors in each case, and (c) reflections on the use of data after publication, using the B2NOTE contributions. © Copyright 2017 Inderscience Enterprises Ltd.
2018
Authors
Oroszlanyova, M; Lopes, CT; Nunes, S; Ribeiro, C;
Publication
INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL
Abstract
Introduction. The concept and study of relevance has been a central subject in information science. Although research in information retrieval has been focused on topical relevance, other kinds of relevance are also important and justify further study. Motivational relevance is typically inferred by criteria such as user satisfaction and success. Method. Using an existing dataset composed by an annotated set of health Web documents assessed for relevance and comprehension by a group of users, we build a multivariate prediction model for the motivational relevance of search sessions. Analysis. The analysis was based on lasso variable selection, followed by model selection using multiple logistic regression. Results. We have built two regression models; the full model, which considers all variables of the dataset, has a lower estimated prediction error than the reduced model, which contains the statistically-significant variables from the full model. The higher values of evaluation metrics, including accuracy, specificity and sensitivity in the full model support this finding. The full model has an accuracy of 91.94%, and is better at predicting motivational relevance. Conclusions. Our findings suggest features that can be considered by search engines to estimate motivational relevance, to be used in addition to topical relevance. Among these features, a high level of success in Web search and in health information search on social networks and chats are some of the most influencing user features. This shows that users with higher computer literacy might feel more satisfied and successful after completing the search tasks. In terms of task features, the results suggest that users with clearer goals feel more successful. Moreover, results show that users would benefit from the help of the system in clarifying the retrieved documents.
2015
Authors
Rahman, AU; Muzammal, M; David, G; Ribeiro, C;
Publication
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS
Abstract
In many institutions relational databases are used as a tool for managing information related to day to day activities. Institutions may be required to keep the information stored in relational databases accessible because of many reasons including legal requirements and institutional policies. However, the evolution in technology and change in users with the passage of time put the information stored in relational databases in danger. In the long term the information may become inaccessible when the operating system, database management system or the application software is not available any more or the contextual information not stored in the database may be lost thus affecting the authenticity and understandability of the information. This paper presents an approach for preserving relational databases for the long-term. The proposal involves migrating a relational database to a dimensional model which is simple to understand and easy to write queries against. Practical transformation rules are developed by carrying out multiple case studies. One of the case studies is presented as a running example in the paper. Systematic implementation of the rules ensures no loss of information in the process except for the unwanted details. The database preserved using the approach is converted to an open format but may be reloaded to a database management system in the long-term.
2018
Authors
Méndez, E; Crestani, F; Ribeiro, C; David, G; Lopes, JC;
Publication
TPDL
Abstract
2018
Authors
Lopes, CT; Ribeiro, C;
Publication
Experimental IR Meets Multilinguality, Multimodality, and Interaction - 9th International Conference of the CLEF Association, CLEF 2018, Avignon, France, September 10-14, 2018, Proceedings
Abstract
Health information is highly sought on the Web by users that naturally have different levels of expertise in the topics they search for. Assisting users with query formulation is important when users are searching for topics about which they have little knowledge or familiarity. To assist users with health query formulation, we developed a query suggestion system that provides alternative queries combining Portuguese and English language with lay and medico-scientific terminology. Here, we analyze how this system affects the precision of search sessions. Results show that a system providing these suggestions tends to perform better than a system without them. On specific groups of users, clicking on suggestions has positive effects on precision while using them as sources of new terms has the opposite effect. This suggests that a personalized suggestion system might have a good impact on precision. © Springer Nature Switzerland AG 2018.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.