Ciências NaturaisInglêsHugo

Donny Winston

Donny Winston
Made as simple as possible, but not simpler.
Pagina inicialFeed AtomMastodon
language
Ciências NaturaisInglês
Publicados

Inference based on semantic retrieval is more robust than inference based on syntactic parsing. In order to be authoritative, identifiers should be assigned as early as practicable in the creation process, but minting is not binding. Identifier resolution delays binding; identifier structures induce binding. Moral: Structure identifiers late (or never) in the minting process.

Ciências NaturaisInglês
Publicados

Developers often resort to shotgun parsing : scattering data checks and fallback values in various places throughout the system’s main logic. 1 The habit of scattering parser-like behaviour throughout an application’s code and the resulting inconsistencies in data handling can often lead not just to annoying complications and bugs, but also security vulnerabilities. 2 This is about reading data.

Ciências NaturaisInglês
Publicados

This week on Machine-Centric Science, I interviewed Shreyas Cholia, currently at the Lawrence Berkeley National Laboratory in Berkeley, California. Topics we spoke about included: data lifecycles, edge computing for data firehoses, provenance, standards, broad versus detailed domain vocabularies, scope for common APIs, and identifier leveling.

Ciências NaturaisInglês
Publicados

Arrangements of bits have structure just like arrangements of atoms have structure. Interoperability is about aligning structure. Processing, properties, performance – if their characterization can be repeated, they have information structure.

Ciências NaturaisInglês
Publicados

“My data model is JSON”. JSON is not a data model. JSON has no semantics in the context of information systems; JSON defines neither how data “behaves” nor how machines can compute with it. “My data is just JSON”. Your data is never just JSON; you always impose external semantics. “JSON is easy to understand”. What does the field "harrastukset" mean?

Ciências NaturaisInglês
Publicados

Whether deliberate 1 or inevitable 2 , the hourglass architecture of the Internet supports a great diversity of applications implemented using a great diversity of supporting services: Could there be a minimal “spanning layer” protocol for FAIR-principled 3 applications and services? The FAIR Digital Object (FDO) has emerged as a conceptual nexus for consideration of such a protocol.

Ciências NaturaisInglês
Publicados

Validation is about preconditions for operation. It may be useful to separate preconditions into three subtypes: syntax, semantics, and pragmatics. 1 Syntax : Rules about what’s grammatically well-formed. Example: A CalculateAqueousStability command may have a set of atomic-composition pairs and a set of ion-concentration pairs.

Ciências NaturaisInglês
Publicados

Content classification is the most fundamental form of holistic content understanding. It helps make your resources findable (F2) and connects them to other resources (I3). Content understanding represents each piece of content in the index. Relevance of content is a function of query and content understanding. Query understanding represents each search query as a search intent.