Scienze naturaliIngleseJekyll

Biopragmatics

Unraveling complex biology with biological knowledge graphs. Content licensed under CC BY 4.0.
Pagina inizialeAtom ForaggioMastodon
language
CURIEURIURNIRIIdentifiersScienze naturaliInglese
Pubblicato
Autore Charles Tapley Hoyt

The Bioregistry is a database and toolchain for standardization of prefixes, CURIEs, and URIs that appear in linked (open) data. While I created it in 2019 as a component of PyOBO in order to support parsing database cross-references appearing in biomedical ontologies, it has since become an independent project with a community-driven governance model and much broader applications. This post is a first attempt to quantify its usage and impact.

BiomarkerSemantic SpacesBioregistryBiomarkerKBScienze naturaliInglese
Pubblicato
Autore Charles Tapley Hoyt

The Bioregistry is a community-driven registry of semantic spaces and their metadata. When I learned about BiomarkerKB at the International Society for Biocuration’s 18th Annual International Biocuration Conference, I was excited to curate new records (and prefixes) in the Bioregistry to cover BiomarkerKB’s semantic spaces on biomarkers.

OntologyEmbeddingsBertSbertSimilarityScienze naturaliInglese
Pubblicato
Autore Charles Tapley Hoyt

The Ontology Lookup Service (OLS) is now indexing dense embeddings for ontology terms constructed from term labels, synonyms, and descriptions using LLMs. I maintain a Python client library for the OLS (ols-client) and was recently asked to implement a wrapper to the OLS’s API endpoint that exposes these embeddings.

Ontology MergingSemantic WebSemantic MappingsBioinformaticsOntologiesScienze naturaliInglese
Pubblicato
Autore Charles Tapley Hoyt

Assembling and inferring missing semantic mappings is a timely problem in biomedical data and knowledge integration. I’ve been developing the Semantic Mapping Assembler and Reasoner (SeMRA) as a generic toolkit for this. In this blog post, I highlight its inference capabilities. SeMRA implements the chaining and inference rules described in the SSSOM specification.

PythonMypyStatic TypingScienze naturaliInglese
Pubblicato
Autore Charles Tapley Hoyt

Following the theme of my previous two posts, I’ve run into another typing conundrum where I want to unpack a pre-existing Callable into a class with Generic[P, T] where P is a parameter specification type (i.e. ParamsSpec) After figuring out the right way to declare a generic featuring a ParamSpec, I updated the class-resolver package to use the shiny new (and more accurate) annotations.

PythonMypyStatic TypingScienze naturaliInglese
Pubblicato
Autore Charles Tapley Hoyt

I’ve been working on applying strict static typing to my Python package class-resolver and ran into an interesting way of using generics in combination with parameter specification variables (i.e., ParamSpecs). Normally, if you want to type annotate a function, you use the Callable, which works like the following: from collections.abc import Callable #: the [int] represents a function that takes in a single integer, #: and returns a single

PythonMypyStatic TypingScienze naturaliInglese
Pubblicato
Autore Charles Tapley Hoyt

This post describes an issue I’ve had with writing correct types when using PEP-696 defaults in typing.TypeVar. I posted the exploration in a companion repository on GitHub. The motivation behind this comes from my work in biomedical data integration and the semantic web.

ChEBIChEMBLUBERONExperimental Factor OntologyEFOScienze naturaliInglese
Pubblicato
Autore Charles Tapley Hoyt

ChEMBL periodically curates clinical trial information into its DRUG_INDICATION table. However, there’s some weird inconsistencies in the way it references disease concepts in external vocabularies. This blog post is an exploration of that table.

BooksScienze naturaliInglese
Pubblicato
Autore Charles Tapley Hoyt

Here’s the books I read in 2024. If I were Dudley Dursley, I’d be very upset that I read one fewer new book than in 2023. But then, I’d remember that I re-read a lot of Cosmere in 2024 to prepare for Wind and Truth , which was great.

WikidataBibliometricsOpen DataScienze naturaliInglese
Pubblicato
Autore Charles Tapley Hoyt

I was working on making data about scholarly conferences more FAIR and a big question crossed my mind: what are all the conference venues? This post is about some queries I wrote for Wikidata, data issues I found, and a few drive-by curations that I did while looking for an answer, and my ideas for the future.