Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by João Aguiar Castro

2017

A comparison of research data management platforms: architecture, flexible metadata and interoperability

Authors
Amorim, RC; Castro, JA; da Silva, JR; Ribeiro, C;

Publication
UNIVERSAL ACCESS IN THE INFORMATION SOCIETY

Abstract
Research data management is rapidly becoming a regular concern for researchers, and institutions need to provide them with platforms to support data organization and preparation for publication. Some institutions have adopted institutional repositories as the basis for data deposit, whereas others are experimenting with richer environments for data description, in spite of the diversity of existing workflows. This paper is a synthetic overview of current platforms that can be used for data management purposes. Adopting a pragmatic view on data management, the paper focuses on solutions that can be adopted in the long tail of science, where investments in tools and manpower are modest. First, a broad set of data management platforms is presented-some designed for institutional repositories and digital libraries-to select a short list of the more promising ones for data management. These platforms are compared considering their architecture, support for metadata, existing programming interfaces, as well as their search mechanisms and community acceptance. In this process, the stakeholders' requirements are also taken into account. The results show that there is still plenty of room for improvement, mainly regarding the specificity of data description in different domains, as well as the potential for integration of the data management platforms with existing research management tools. Nevertheless, depending on the context, some platforms can meet all or part of the stakeholders' requirements.

2017

Description + annotation: semantic data publication workflow with Dendro and B2NOTE

Authors
Karimova, Y; Castro, JA; da Silva, JR; Pereira, N; Rodrigues, J; Ribeiro, C;

Publication
Int. J. Metadata Semant. Ontologies

Abstract
Metadata puts research data in their context, making data intelligible and apt to sustain technology evolution and to be reused, in compliance with the FAIR principles. The workflow proposed in this work includes metadata generation in the context of research projects, created with the Dendro platform, and metadata originated in the interaction of people with the deposited data, created with the B2NOTE service from EUDAT. In our experiments, datasets are prepared with Dendro, taking into consideration general-purpose descriptors and domain-specific ones, then transparently deposited in B2SHARE. After publication, B2NOTE provides an environment where authors, other researchers, and any interested party can enrich the description with less formal comments, tags or keywords. This work contributes with (a) a set of use cases in several domains, (b) details on the descriptors used by authors in each case, and (c) reflections on the use of data after publication, using the B2NOTE contributions. © Copyright 2017 Inderscience Enterprises Ltd.

2018

Research Data Management Tools and Workflows: Experimental Work at the University of Porto

Authors
Ribeiro, C; Rocha da Silva, J; Aguiar Castro, J; Carvalho Amorim, R; Correia Lopes, J; David, G;

Publication
IASSIST Quarterly

Abstract
Research datasets include all kinds of objects, from web pages to sensor data, and originate in every domain. Concerns with data generated in large projects and well-funded research areas are centered on their exploration and analysis. For data in the long tail, the main issues are still how to get data visible, satisfactorily described, preserved, and searchable. Our work aims to promote data publication in research institutions, considering that researchers are the core stakeholders and need straightforward workflows, and that multi-disciplinary tools can be designed and adapted to specific areas with a reasonable effort. For small groups with interesting datasets but not much time or funding for data curation, we have to focus on engaging researchers in the process of preparing data for publication, while providing them with measurable outputs. In larger groups, solutions have to be customized to satisfy the requirements of more specific research contexts. We describe our experience at the University of Porto in two lines of enquiry. For the work with long-tail groups we propose general-purpose tools for data description and the interface to multi-disciplinary data repositories. For areas with larger projects and more specific requirements, namely wind infrastructure, sensor data from concrete structures and marine data, we define specialized workflows. In both cases, we present a preliminary evaluation of results and an estimate of the kind of effort required to keep the proposed infrastructures running.  The tools available to researchers can be decisive for their commitment. We focus on data preparation, namely on dataset organization and metadata creation. For groups in the long tail, we propose Dendro, an open-source research data management platform, and explore automatic metadata creation with LabTablet, an electronic laboratory notebook. For groups demanding a domain-specific approach, our analysis has resulted in the development of models and applications to organize the data and support some of their use cases. Overall, we have adopted ontologies for metadata modeling, keeping in sight metadata dissemination as Linked Open Data.

2018

Research data management in the field of Ecology: An overview

Authors
Alves, C; Castro, JA; Ribeiro, C; Honrado, JP; Lomba, A;

Publication
Proceedings of the International Conference on Dublin Core and Metadata Applications

Abstract
The diversity of research topics and resulting datasets in the field of Ecology (the scientific study of ecological systems and their biodiversity) has grown in parallel with developments in research data management. Based on a meta-analysis performed on 93 scientific references, this paper presents a comprehensive overview of the use of metadata tools in the Ecology domain through time. Overall, 40 metadata tools were found to be either referred or used by the research community from 1997 to 2018. In the same period, 50 different initiatives in ecology and biodiversity research were conceptualized and implemented to promote effective data sharing in the community. A relevant concern that stems from this analysis is the need to establish simple methods to promote data interoperability and reuse, so far limited by the production of metadata according to different standards. With this study, we also highlight challenges and perspectives in research data management in the domain of Ecology towards best practice guidelines.

2019

Data Deposit in a CKAN Repository: A Dublin Core-Based Simplified Workflow

Authors
Karimova, Y; Castro, JA; Ribeiro, C;

Publication
Digital Libraries: Supporting Open Science - 15th Italian Research Conference on Digital Libraries, IRCDL 2019, Pisa, Italy, January 31 - February 1, 2019, Proceedings

Abstract
Researchers are currently encouraged by their institutions and the funding agencies to deposit data resulting from projects. Activities related to research data management, namely organization, description, and deposit, are not obvious for researchers due to the lack of knowledge on metadata and the limited data publication experience. Institutions are looking for solutions to help researchers organize their data and make them ready for publication. We consider here the deposit process for a CKAN-powered data repository managed as part of the IT services of a large research institute. A simplified data deposit process is illustrated here by means of a set of examples where researchers describe their data and complete the publication in the repository. The process is organised around a Dublin Core-based dataset deposit form, filled by the researchers as preparation for data deposit. The contacts with researchers provided the opportunity to gather feedback about the Dublin Core metadata and the overall experience. Reflections on the ongoing process highlight a few difficulties in data description, but also show that researchers are motivated to get involved in data publication activities.

2019

Hands-On Data Publishing with Researchers: Five Experiments with Metadata in Multiple Domains

Authors
Rodrigues, J; Castro, JA; da Silva, JR; Ribeiro, C;

Publication
Digital Libraries: Supporting Open Science - 15th Italian Research Conference on Digital Libraries, IRCDL 2019, Pisa, Italy, January 31 - February 1, 2019, Proceedings

Abstract
The current requirements for open data in the EU are increasing the awareness of researchers with respect to data management and data publication. Metadata is essential in research data management, namely on data discovery and reuse. Current practices tend to either leave metadata definition to researchers, or to assign their creation to curators. The former typically results in ad-hoc descriptors, while the latter follows standards but lacks specificity. In this exploratory study, we adopt a researcher-curator collaborative approach in five data publication cases, involving researchers in data description and discussing the use of both generic and domain-oriented metadata. The study shows that researchers working on familiar datasets can contribute effectively to the definition of metadata models, in addition to the actual metadata creation. The cases also provide preliminary evidence of cross-disciplinary descriptor use. Moreover, the interaction with curators highlights the advantages of data management, making researchers more open to participate in the corresponding tasks. © Springer Nature Switzerland AG 2019.

  • 3
  • 4