IRTUM – Institutional Repository of the Technical University of Moldova

A comprehensive assessment of sequence read archive metadata completeness

Show simple item record

dc.contributor.author BAS, Albert
dc.contributor.author MUNTEANU, Viorel
dc.date.accessioned 2024-10-16T11:19:02Z
dc.date.available 2024-10-16T11:19:02Z
dc.date.issued 2024
dc.identifier.citation BAS, Albert and Viorel MUNTEANU. A comprehensive assessment of sequence read archive metadata completeness. In: Conferinţa tehnico-ştiinţifică a studenţilor, masteranzilor şi doctoranzilor = Technical Scientific Conference of Undergraduate, Master and PhD Students, Universitatea Tehnică a Moldovei, 27-29 martie 2024. Chișinău, 2024, vol. 2, pp. 1040-1045. ISBN 978-9975-64-458-7, ISBN 978 9975-64-460-0 (Vol. 2). en_US
dc.identifier.isbn 978-9975-64-458-7
dc.identifier.isbn 978 9975-64-460-0
dc.identifier.uri http://repository.utm.md/handle/5014/28097
dc.description.abstract Recent advances in high-throughput sequencing technologies have enabled the collection and sharing of a vast amount of omics data, along with its associated metadata. Enhancing the availability of this metadata is crucial to ensure the reusability and reproducibility of raw data, as well as for facilitating novel biomedical discoveries through efficient data reuse. In this study, we performed a comprehensive assessment of metadata completeness by analyzing over 26,000,000 experiments shared in the Sequence Read Archive (SRA) from 2008 to 2023. Our results show that the countries of Central Europe, the USA and China show dominance in generating sequencing data, corresponding to 45%, 16% and correspondingly 8% of total data in the SRA repository, the most frequently used platform is ILLUMINA (90%). Identified that some of the metadata contains inconsistencies in completeness: the absence of temporary identifiers (5.2%), the lack of assigned TaxonomyID (5%), and the absence of library strategy (8%). Our results highlight the urgent need for improved metadata sharing practices and the standardization of reporting. en_US
dc.language.iso en en_US
dc.publisher Universitatea Tehnică a Moldovei en_US
dc.relation.ispartofseries Conferinţa tehnico-ştiinţifică a studenţilor, masteranzilor şi doctoranzilor = Technical Scientific Conference of Undergraduate, Master and PhD Students: Chişinău, 27-29 martie 2024. Vol. 2;
dc.rights Attribution-NonCommercial-NoDerivs 3.0 United States *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/3.0/us/ *
dc.subject metadata en_US
dc.subject data reusability en_US
dc.subject Sequence Read Archive en_US
dc.subject sequencing en_US
dc.title A comprehensive assessment of sequence read archive metadata completeness en_US
dc.type Article en_US


Files in this item

The following license files are associated with this item:

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 United States Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 United States

Search DSpace


Browse

My Account