2022 Preprint
Unifying the Identification of Biomedical Entities with the Bioregistry
Abstract: The standardized identification of biomedical entities is a cornerstone of interoperability, reuse, and data integration in the life sciences. Several registries have been developed to catalog resources maintaining identifiers for biomedical entities such as small molecules, proteins, cell lines, and clinical trials. However, existing registries have struggled to provide sufficient coverage and metadata standards that meet the evolving needs of modern life sciences researchers. Here, we introduce the Bioregist…
View published versions
Search citation statements
Paper Sections
Select...
5
1
1
0
Citation Types
0
7
0
0
Year Published
2022
2025
Publication Types
Select...
4
1
1
Relationship
1
5
Authors
Journals
Cited by 6 publications
(7 citation statements)
References 70 publications
0
7
0
0
“…by assigning more precise predicates and including provenance information). Second, we hope to see these resources converging on external standards for the syntax and semantics used to communicate the entities and predicates appearing in mappings, such as the Bioregistry (Hoyt et al ., 2022c) in order to improve interoperability. Third, we hope to see large-scale efforts to aggregate, store, and redistribute mappings with more general scope than existing mapping services.…”
Section: Discussionmentioning
confidence: 99%
“…by assigning more precise predicates and including provenance information). Second, we hope to see these resources converging on external standards for the syntax and semantics used to communicate the entities and predicates appearing in mappings, such as the Bioregistry (Hoyt et al ., 2022c) in order to improve interoperability. Third, we hope to see large-scale efforts to aggregate, store, and redistribute mappings with more general scope than existing mapping services.…”
Section: Discussionmentioning
confidence: 99%
“…The first step of the cycle comprises retrieval and preprocessing of target identifier resources, including any existing mappings between the resources. We automate this process for ontologies by using the Bioregistry (Hoyt et al ., 2022c) to locate the ontology (i.e., with a URL) and ROBOT (Jackson et al ., 2019) to parse it. Similarly, we use custom automated preprocessing workflows in PyOBO (Hoyt et al ., 2022b) for other identifier resource types (e.g., databases like HGNC).…”
Section: Methodsmentioning
confidence: 99%
“…Additionally, some PX resources have their own identifiers for datasets, that can also be used in parallel to the PXD identifiers. Furthermore, Digital Object Identifiers (DOIs) can also be issued for ‘Complete’ submissions (see below for more details about submission types) and PXD identifiers are resolved by the identifier resolution services identifiers.org ( 15 ) and Bioregistry ( 16 ). In terms of data license, all PX resources moved to a default Creative Commons CC0 license as the basis in 2020.…”
Section: Current Px Data Workflow and Implementation Of Psi Data Stan...mentioning
confidence: 99%
“…Where possible, SeMRA wraps preexisting parsers for standard representations. For instance, SeMRA reads mappings from ontologies in OBO format by wrapping the PyOBO Python package [29]. Similarly, SeMRA reads mappings from ontologies in the OWL and OBO Graph JSON formats using the Bioontologies Python package [25].…”
Section: Sourcesmentioning
confidence: 99%
