Rogue Scholar

Published March 7, 2014

Author Roderic Page

As part of a project exploring GBIF data I've been playing with displaying GBIF data on Google Maps.

ClassificationErrorGBIFLiverwortComputer and Information Sciences

GBIF liverwort taxonomy broken

https://doi.org/10.59350/cq66n-7mc69

Published March 3, 2014

Author Roderic Page

A quick note to myself to document a problem with the GBIF classification of liverworts (I've created issue POR-1879 for this). While building a new tool to browse GBIF data I ran into a problem that the taxon "Jungermanniales" popped up in two different places in the GBIF classification, which broke a graphical display widget I was using.

DataData GriefData QualityODIComputer and Information Sciences

Five Stages of Data Grief

https://doi.org/10.59350/8fwx6-95789

Published February 19, 2014

Author Roderic Page

There is a great post by Jeni Tennison on the Open Data Institute blog entitled Five Stages of Data Grief. It resonates so much with my experience working with biodiversity data (such as building BioNames, or exploring data errors in GBIF) that I've decide to reproduce it here.

MarkupPreziPro-iBiosphereComputer and Information Sciences

Mark-up of biodiversity literature

https://doi.org/10.59350/2vwca-1xx91

Published February 10, 2014

Author Roderic Page

I gave a remote presentation at a proiBioSphere workshop this morning. The slides are below (to try and make it a bit more engaging than a desk of Powerpoints I played around with Prezi). There is a version on Vimeo that has audio as well. I sketched out the biodiversity "knowledge graph", then talked about how mark-up relates to this, finishing with a few questions.

GenbankNCBIType SpecimensComputer and Information Sciences

NCBI taxonomy database now shows type material

https://doi.org/10.59350/217kh-1j345

Published January 24, 2014

Author Roderic Page

Scott Federhen told me about a nice new feature in GenBank that he's described in a piece for NCBI News. The NCBI taxonomy database now shows a its of type material (where known), and the GenBank sequence database "knows: about types. Here's the summary: You can query for sequences from type using the query "sequence from type"[filter]. This could lead to some nice automated tools.

AnnotationFreebaseGithubVertNetComputer and Information Sciences

VertNet starts issue tracking using GitHub

https://doi.org/10.59350/9h8bk-1tm71

Published January 24, 2014

Author Roderic Page

VertNet has announced that they have implemented issue tracking using GitHub. This is a really interesting development, as figuring out how to capture and make use of annotations in biodiversity databases is a problem that's attracting a lot of attention.

2014BioNamesGBIFGoogleKnowledge GraphComputer and Information Sciences

What I'll be working on in 2014: knowledge graphs and Google forests

https://doi.org/10.59350/br4aq-bzv55

Published January 15, 2014

Author Roderic Page

More for my own benefit than anything else I've decided to list some of the things I plan to work on this year. If nothing else, it may make sobering reading this time next year. A knowledge graph for biodiversity Google's introduction of the "knowledge graph" gives us a happy phrase to use when talking about linking stuff together.

AnnotationEditingFiltered-pushGBIFIdentifiersComputer and Information Sciences

Annotating GBIF: some thoughts

https://doi.org/10.59350/m9m1a-19s87

Published January 9, 2014

Author Roderic Page

Given that it's the start of a new year, and I have a short window before teaching kicks off in earnest (and I have to revise my phyloinformatics course) I'm playing with a few GBIF-related ideas. One topic which comes up a lot is annotating and correcting errors. There has been some work in this area [1][2] bit it strikes me as somewhat complicated. I'm wondering whether we couldn't try and keep things simple.

DNA BarcodingGenbankGPSGuest PostComputer and Information Sciences

Guest post: response to "Putting GenBank Data on the Map"

https://doi.org/10.59350/fdkqx-47m65

Published December 12, 2013

Author Roderic Page

The following is a guest blog post by David Schindel and colleagues and is a response to the paper by Antonio Marques et al. in Science doi:10.1126/science.341.6152.1341-a. Marques, Maronna and Collins (1) rightly call on the biodiversity research community to include latitude/longitude data in database and published records of natural history specimens.

BHLCodeDjVuHOCRJATSComputer and Information Sciences

Towards BioStor articles marked up using Journal Archiving Tag Set

https://doi.org/10.59350/18fc1-gxf54

Published December 4, 2013

Author Roderic Page

A while ago I posted BHL to PDF workflow which was a sketch of a work flow to generate clean, searchable PDFs from Biodiversity Heritage Library (BHL) content: I've made some progress on putting this together, as well as expanded the goal somewhat. In fact, there are several goals: BioStor articles need to be archived somewhere.

iPhylo

GBIF data overlayed on Google Maps

GBIF liverwort taxonomy broken

Five Stages of Data Grief

Mark-up of biodiversity literature

NCBI taxonomy database now shows type material

VertNet starts issue tracking using GitHub

What I'll be working on in 2014: knowledge graphs and Google forests

Annotating GBIF: some thoughts

Guest post: response to "Putting GenBank Data on the Map"

Towards BioStor articles marked up using Journal Archiving Tag Set