Rogue Scholar

Ontology MergingSemantic WebSemantic MappingsBioinformaticsOntologiesSciences naturellesAnglais

Inference over Semantic Mappings with SeMRA

Publié 28 avril 2025

Auteur Charles Tapley Hoyt

Assembling and inferring missing semantic mappings is a timely problem in biomedical data and knowledge integration. I’ve been developing the Semantic Mapping Assembler and Reasoner (SeMRA) as a generic toolkit for this. In this blog post, I highlight its inference capabilities. SeMRA implements the chaining and inference rules described in the SSSOM specification.

PythonMypyStatic TypingSciences naturellesAnglais

I wish I could unpack Callables in Python type annotations

https://doi.org/10.59350/tcz2x-n4d84

Publié 23 avril 2025

Auteur Charles Tapley Hoyt

Following the theme of my previous two posts, I’ve run into another typing conundrum where I want to unpack a pre-existing Callable into a class with Generic[P, T] where P is a parameter specification type (i.e. ParamsSpec) After figuring out the right way to declare a generic featuring a ParamSpec, I updated the class-resolver package to use the shiny new (and more accurate) annotations.

PythonMypyStatic TypingSciences naturellesAnglais

Using ParamSpec with Python Generics

https://doi.org/10.59350/a9srr-an019

Publié 22 avril 2025

Auteur Charles Tapley Hoyt

I’ve been working on applying strict static typing to my Python package class-resolver and ran into an interesting way of using generics in combination with parameter specification variables (i.e., ParamSpecs). Normally, if you want to type annotate a function, you use the Callable, which works like the following: from collections.abc import Callable #: the [int] represents a function that takes in a single integer, #: and returns a single

PythonMypyStatic TypingSciences naturellesAnglais

A dilemma with PEP-696 default generics when using optional static typing in Python

https://doi.org/10.59350/3zq9w-my741

Publié 19 avril 2025

Auteur Charles Tapley Hoyt

This post describes an issue I’ve had with writing correct types when using PEP-696 defaults in typing.TypeVar. I posted the exploration in a companion repository on GitHub. The motivation behind this comes from my work in biomedical data integration and the semantic web.

ChEBIChEMBLUBERONExperimental Factor OntologyEFOSciences naturellesAnglais

The EFO_ID column in ChEMBL’s drug indications table isn’t what you think it is

https://doi.org/10.59350/mmrpx-qda35

Publié 17 avril 2025

Auteur Charles Tapley Hoyt

ChEMBL periodically curates clinical trial information into its DRUG_INDICATION table. However, there’s some weird inconsistencies in the way it references disease concepts in external vocabularies. This blog post is an exploration of that table.

Clinical TrialsClinicalTrials.govOntologiesOBIChEBISciences naturellesAnglais

Data Modeling and Integration with Clinical Trials

https://doi.org/10.59350/jkdtn-kgs07

Publié 23 janvier 2025

Auteur Charles Tapley Hoyt

I’ve recently worked with clinical studies from ClinicalTrials.gov and other international registries.

BooksSciences naturellesAnglais

Books I Read in 2024

https://doi.org/10.59350/psy2m-adk43

Publié 18 janvier 2025

Auteur Charles Tapley Hoyt

Here’s the books I read in 2024. If I were Dudley Dursley, I’d be very upset that I read one fewer new book than in 2023. But then, I’d remember that I re-read a lot of Cosmere in 2024 to prepare for Wind and Truth , which was great.

WikidataBibliometricsOpen DataSciences naturellesAnglais

Exploring Event Venues in Wikidata

https://doi.org/10.59350/53dah-9vf82

Publié 17 janvier 2025

Auteur Charles Tapley Hoyt

I was working on making data about scholarly conferences more FAIR and a big question crossed my mind: what are all the conference venues? This post is about some queries I wrote for Wikidata, data issues I found, and a few drive-by curations that I did while looking for an answer, and my ideas for the future.

FundingOpen SourceSciences naturellesAnglais

Notes on Open Source Funding

https://doi.org/10.59350/eckhy-09r58

Publié 3 décembre 2024

Auteur Charles Tapley Hoyt

This stub post contains my notes about funding for open source software. It doesn’t follow a story like a lot of my posts, and is more like an ever-evolving notes sheet.

ReadingAutomationSciences naturellesAnglais

Downloading Audio from Soundcloud

https://doi.org/10.59350/683zj-mfk55

Publié 3 décembre 2024

Auteur Charles Tapley Hoyt

Brandon Sanderson has been releasing a few chapters a week of his upcoming novel, Wind and Truth, on his publisher’s website leading up to its December 6 ^th release. This includes the audiobook chapters, but they’re posted to Soundcloud and there’s no good way to listen at 1.6x speed. This post is a note sheet on how to download audio from Soundcloud and prepare it for my audiobook reader.

PythonPackagingCookiecutterDocumentationSciences naturellesAnglais

Dependency Groups and ReadTheDocs

https://doi.org/10.59350/3v5dd-78w25

Publié 19 novembre 2024

Auteur Charles Tapley Hoyt

PEP 735 introduced dependency groups in packaging metadata, which are complementary to optional dependencies in that they might not correspond to features in the package, but rather be something like development or release dependencies. I am slowly working towards updating my cookiecutter template cookiecutter-snekpack to use PEP 735. So far, uv and tox have released support - all that’s left is ReadTheDocs.

Biopragmatics

Inference over Semantic Mappings with SeMRA

I wish I could unpack Callables in Python type annotations

Using ParamSpec with Python Generics

A dilemma with PEP-696 default generics when using optional static typing in Python

The EFO_ID column in ChEMBL’s drug indications table isn’t what you think it is

Data Modeling and Integration with Clinical Trials

Books I Read in 2024

Exploring Event Venues in Wikidata

Notes on Open Source Funding

Downloading Audio from Soundcloud

Dependency Groups and ReadTheDocs