Rogue Scholar

DggsFrankenplaceGridSearchComputer and Information Sciences

Frankenplace, geospatial search, and discrete global grid systems

Published May 28, 2019

Quick note on Frankenplace, a cool search tool that displays the geographic distribution of documents that match the user's query as a heatmap. Details of how the tool works are given in: At the heart of the method is a discrete global grid that divides the world up into small areas of the same size.

DBpediaKnowledge GraphNatural LanguageWikidataWikipediaComputer and Information Sciences

Ozymandias meets Wikipedia, with notes on natural language generation

https://doi.org/10.59350/wa20b-ygh11

Published May 28, 2019

Author Roderic Page

I've tweaked Ozymandias to now include short natural language summaries (snippets) for various taxa. This makes the output a little more friendly and informative. For example, here's a snippet from the page on Cephalodesmius , a dung beetle that makes its own dung. These snippets come from Wikipedia, well actually, from the DBpedia project.

Computer and Information Sciences

Ozymandias: A biodiversity knowledge graph published in PeerJ

https://doi.org/10.59350/r65vb-dsg51

Published April 10, 2019

Author Roderic Page

My paper "Ozymandias: A biodiversity knowledge graph" has been published in PeerJ https://doi.org/10.7717/peerj.6739 The paper describes my entry in GBIF's 2018 Ebbe Nielsen Challenge, which you can explore here. I tweeted about its publication yesterday, and got some interesting responses (and lots of retweets, thanks to everyone for those). Carl Boettiger (@cboettig) asked where the triples were, as did Kingsley Uyi Idehen (@kidehen). Doh!

Computer and Information Sciences

Where is the damned collection? Wikidata, GrBio, and a global list of all natural history collections

https://doi.org/10.59350/xgztb-tq056

Published March 24, 2019

Author Roderic Page

One of the things the biodiversity informatics community has struggled to do is come up with a list of all natural history collections (Taylor, 2016). Most recently GrBio attempted to do this, and appealed for community help to curate the list (Schindel et al., 2016), but this did not emerge, and at the time of writing GrBio is moribund.

ChallengeKnowledge GraphOzymandiasComputer and Information Sciences

Ozymandias: A biodiversity knowledge graph available as a preprint on Biorxiv

https://doi.org/10.59350/jw07y-48b02

Published December 5, 2018

Author Roderic Page

I've written up my entry for the 2018 GBIF Challenge ("Ozymandias") and posted a preprint on Biorxiv (https://www.biorxiv.org/content/early/2018/12/04/485854). The DOI is https://doi.org/10.1101/485854 which, last time I checked, still needs to be registered. The abstract appears below. I'll let the preprint sit there for a little while before I summon the enthusiasm to revisit it, tidy it up, and submit it for publication.

BioRxivGBIFGenbankGeocodingSpecimen CodesComputer and Information Sciences

Geocoding genomic databases using GBIF

https://doi.org/10.59350/35kwk-1ty15

Published November 15, 2018

Author Roderic Page

I've put a short note up on bioRxiv about ways to geocode nucleotide sequences in databases such as GenBank. The preprint is "Geocoding genomic databases using GBIF" https://doi.org/10.1101/469650.

DifferenceTaxonomic ConceptTaxonomyVersion ControlComputer and Information Sciences

Taxonomic publications as patch files and the notion of taxonomic concepts

https://doi.org/10.59350/p9bbs-96669

Published October 25, 2018

Author Roderic Page

There's a slow-burning discussion on taxonomic concepts on Github that I am half participating in. As seems inevitable in any discussion of taxonomy, there's a lot of floundering about given that there's lots of jargon - much of it used in different ways by different people - and people are coming at the problem from different perspectives. In one sense, taxonomy is pretty straightforward.

Computer and Information Sciences

Specimens, collections, researchers, and publications: towards social and citation graphs for natural history collections

https://doi.org/10.59350/mkn7d-ayc87

Published October 24, 2018

Author Roderic Page

Being in Ottawa last week for a hackathon meant I could catch up with David Shorthouse (@dpsSpiders. David has been doing some neat work on linking specimens to identifiers for researchers, such as ORCIDs, and tracking citations of specimens in the literature. David's Bloodhound tool processes lots of GBIF data for occurrences with names of those who collected or identified specimens.

DockerWikibaseWikidataComputer and Information Sciences

Ottawa Ecobiomics hackathon: graph databases and Wikidata

https://doi.org/10.59350/f7n4w-46n82

Published October 24, 2018

Author Roderic Page

I spent last week in Ottawa at a "Ecobiomics" hackathon organised by Joel Sachs. Essentially we spent a week exploring the application of linked data to various topics in biodiversity, with an emphasis on looking at working examples.

ChallengeGBIFOzymandiasComputer and Information Sciences

GBIF Ebbe Nielsen Challenge update

https://doi.org/10.59350/n8xys-sre19

Published October 24, 2018

Author Roderic Page

Quick note to express my delight and surprise that my entry for the 2018 GBIF Ebbe Nielsen Challenge come in joint first! My entry was Ozymandias - a biodiversity knowledge graph which built upon data from sources such as ALA, AFD, BioStor, CrossRef, ORCID), Wikispecies, and BLR.

Computer and Information Sciences

Guest post - Quality paralysis: a biodiversity data disease

https://doi.org/10.59350/64xvq-vvh21

Published September 11, 2018

Author Roderic Page

The following is a guest post by Bob Mesibov. In 2005, GBIF released Arthur Chapman's Principles of Data Quality and Principles and Methods of Data Cleaning: Primary Species and Species-Occurrence Data as freely available electronic publications. Their impact on museums and herbaria has been minimal.

iPhylo

Frankenplace, geospatial search, and discrete global grid systems

Ozymandias meets Wikipedia, with notes on natural language generation

Ozymandias: A biodiversity knowledge graph published in PeerJ

Where is the damned collection? Wikidata, GrBio, and a global list of all natural history collections

Ozymandias: A biodiversity knowledge graph available as a preprint on Biorxiv

Geocoding genomic databases using GBIF

Taxonomic publications as patch files and the notion of taxonomic concepts

Specimens, collections, researchers, and publications: towards social and citation graphs for natural history collections

Ottawa Ecobiomics hackathon: graph databases and Wikidata

GBIF Ebbe Nielsen Challenge update

Guest post - Quality paralysis: a biodiversity data disease