Ciências NaturaisInglêsJekyll

Biopragmatics

Unraveling complex biology with biological knowledge graphs. Content licensed under CC BY 4.0.
Pagina inicialFeed AtomMastodon
language
SSSOMSemantic MappingsKnowledge GraphsInglês
Publicados
Autor Charles Tapley Hoyt

Data and knowledge originating from heterogeneous sources often use heterogeneous controlled vocabularies and/or ontologies for annotating named entities. Semantic mappings are essential towards resolving these discrepancies and integrating in a coherent way.

SSSOMSKOSSemantic MappingsMappingsInteroperabilityInglês
Publicados
Autor Charles Tapley Hoyt

JSKOS (JSON for Knowledge Organization Systems) is a JSON-based data model for representing terminologies, thesauri, classifications, and other semantic artifacts. Like the Simple Standard for Sharing Ontological Mappings (SSSOM), it can also encode semantic mappings. This post is about developing and implementing a crosswalk between them in the sssom-pydantic Python package.

SSSOMWikidataSKOSSemantic MappingsMappingsInglês
Publicados
Autor Charles Tapley Hoyt

At the 4th Ontologies4Chem Workshop in Limburg an der Lahn, I proposed an initial crosswalk between the Simple Standard for Sharing Ontological Mappings (SSSOM) and the Wikidata semantic mapping data model. This post describes the motivation for this proposal and the concrete implementation I’ve developed in sssom-pydantic.

LinkMLBioregistryPrefix MapsCURIEsURIsInglês
Publicados
Autor Charles Tapley Hoyt

LinkML enables defining data models and data schemas in YAML informed by semantic web best practices. As such, each definition includes a prefix map. Similarly to my previous posts on validating the prefix maps appearing in Turtle files and in unfamiliar SPARQL endpoints, this post showcases describes a new extension to the Bioregistry that validates prefix maps in LinkML definitions.

Named Entity RecognitionText MiningNatural Language ProcessingNamed Entity NormalizationMedical Subject HeadingsInglês
Publicados
Autor Charles Tapley Hoyt

Annotating the literature with mentions of key concepts from a given domain is often the first step towards extracting more substantial structured knowledge. This can be challenging, as it typically encompasses acquiring and processing the relevant literature and ontologies then installing and applying difficult-to-use named entity recognition (NER) workflows. This post highlights software components I’ve implemented to simplify this workflow.

LinkMLBioregistryPrefix MapsCURIEsURIsInglês
Publicados
Autor Charles Tapley Hoyt

I recently attended the 4th BioHackathon Germany hosted by the German Network for Bioinformatics Infrastructure (de.NBI). I participated in the project On the Path to Machine-actionable Training Materials in order to improve the interoperability between DALIA, TeSS, mTeSS-X, and Schema.org. This post gives a summary of the activities leading up to the hackathon and the results of our happy hacking.

BioPortalOntoPortalSSSOMCiências NaturaisInglês
Publicados
Autor Charles Tapley Hoyt

Earlier this week, a question was asked on OBO Foundry Slack on where to find semantic mappings to terms in the Systematized Nomenclature of Medicine - Clinical Terms (SNOMED-CT). While some are available in the SeMRA Disease Mappings Database, there are many more available within BioPortal, which has access to the entire SNOMED-CT source data and has produced semantic mapping predictions using LOOM.

OntologyOWLGenesCiências NaturaisInglês
Publicados
Autor Charles Tapley Hoyt

This is the first of a two-part post about encoding databases as ontologies. In this post, I give a background on the problems in biocuration that led me to start encoding databases as ontologies, the software I have written to do it, and the repository I have created to store the resulting artifacts in a FAIR, open, and sustainable way.

OntologyOWLGenesHGNCCiências NaturaisInglês
Publicados
Autor Charles Tapley Hoyt

This is the second of a two-part post about encoding databases as ontologies. In the first part, I gave a background on how I started working on this problem and the software stack I developed along the way. In this post, I explain the philosophy and design about how I encoded the HGNC (HUGO Gene Nomenclature Committee) database as an ontology using PyOBO.

Knowledge GraphsSparqlChemistryCultureCultural HeritageCiências NaturaisInglês
Publicados
Autor Charles Tapley Hoyt

At the sixth NFDI4Chem consortium meeting, Torsten Schrade from the NFDI4Culture consortium gave a lovely and whimsical talk entitled A Data Alchemist’s Journey through NFDI which explored ways that we might federate and jointly query both consortia’s knowledge via their respective SPARQL endpoints. He proposed a toy example in which he linked paintings depicting alchemists trying to make gold to compounds containing gold.