Rogue Scholar

BHLErrorsFictional TaxaGBIFGoogleInformática y Ciencias de la InformaciónInglés

Fictional taxa

Publicado 18 de junio de 2012

Anyone who works with taxonomic databases is aware of the fact that they have errors. Some taxonomic databases are restricted in scope to a particular taxon in which one or more people have expertise, these then get aggregated into larger databases, which may in turn be aggregated by databases whose scope is global.

BHLDjVuPDFInformática y Ciencias de la InformaciónInglés

BHL to PDF workflow

https://doi.org/10.59350/7zndf-5ds97

Publicado 15 de junio de 2012

Autor Roderic Page

Just some random thoughts on creating searchable PDFs for article extracted from BHL.

Arthur C ClarkeCanonical NameGodTaxonomic NameTDWGInformática y Ciencias de la InformaciónInglés

Taxonomy and the nine billion names of God

https://doi.org/10.59350/f71qb-n9573

Publicado 14 de junio de 2012

Autor Roderic Page

In Arthur C. Clarke's short story The Nine Billion Names of God Tibetan monks hire two programmers to help them generate all the the possible names of God. The monks believe that the purpose of the Universe is to generate those names, once that goal is achieved the Universe will end.

ClassificationCluster MapsDemansiaEOLVisualisationInformática y Ciencias de la InformaciónInglés

Visualising differences between classifications using cluster maps

https://doi.org/10.59350/snnd5-9pa62

Publicado 13 de junio de 2012

Autor Roderic Page

As part of a project to build a tool to navigate through taxonomic names and classifications I've become interested in quick ways to compare classifications.

Catalogue Of LifeEOLJavascriptQuantum TreemapRectangle PackingInformática y Ciencias de la InformaciónInglés

Using a zoomable treemap to visualise a taxonomic classification

https://doi.org/10.59350/s04p4-4ta18

Publicado 12 de junio de 2012

Autor Roderic Page

One visualisation method I keep coming back too is the treemap.

HoplocephalusOCRTroveInformática y Ciencias de la InformaciónInglés

Discovering species descriptions in digitised newspapers: Trove and The Brisbane Courier

https://doi.org/10.59350/j3c65-1jd58

Publicado 7 de junio de 2012

Autor Roderic Page

While exploring ways to visually compare classifications I came across the Australian snake name Demansia atra , and ended up reading a series of papers in the Bulletin of Zoological Nomenclature discussing the status of the name (more fun than it sounds, trust me). For example, Smith and Wallach Case 2920.

GBIFLinkingLinkoutNCBITreeBASEInformática y Ciencias de la InformaciónInglés

Linking NCBI taxonomy to GBIF

https://doi.org/10.59350/sg04y-k2b09

Publicado 2 de junio de 2012

Autor Roderic Page

In response to Rutger Vos's question I've started to add GBIF taxon ids to the iPhylo Linkout website. If you've not come across iPhylo Linkout, it's a Semantic Mediawiki-based site were I maintain links between the NCBI taxonomy and other resources, such as Wikipedia and the BBC Nature Wildlife finder. For more background seePage, R. D. M. (2011). Linking NCBI to Wikipedia: a wiki-based approach. PLoS Currents, 3, RRN1228.

EOLErrorLeptograpsusTrustInformática y Ciencias de la InformaciónInglés

Can you trust EOL?

https://doi.org/10.59350/9x2bk-t0x79

Publicado 1 de junio de 2012

Autor Roderic Page

There's a recent thread on the Encyclopedia of Life concerning erroneous images for the crab Leptograpsus . This is a crab I used to chase around rooks on stormy west-coast beaches near Auckland, so I was a little surprised to see the EOL page for Leptograpsus looks like this:The name and classification is the crab, but the image is of a fish ( Lethrinus variegatus ). Perhaps at some point in aggregating the images the two

BioStorClassificationData CleaningErrorGBIFInformática y Ciencias de la InformaciónInglés

The GBIF classification is broken — how do we fix it?

https://doi.org/10.59350/5a5re-kp839

Publicado 30 de mayo de 2012

Autor Roderic Page

This post arose from an ongoing email conversation with Tony Rees about extracting and annotating taxonomic names. In BioStor I use the GBIF classification to display the taxonomic names found in the OCR text in the form of a tree. The idea is to give the reader a sense of "what the paper is about". I also use the classification to help link to GBIF occurrence records.

ChallengeData IntegrationEOLTaxonomyInformática y Ciencias de la InformaciónInglés

EOL challenge draft proposal

https://doi.org/10.59350/wn07a-qfy18

Publicado 15 de mayo de 2012

Autor Roderic Page

In the spirit of the Would you give me a grant experiment? [1] here's the draft of a proposal I'm working on for the Computable Data Challenge. It's an attempt to merge taxonomic names, the primary literature, and phylogenetics into one all-singing, all-dancing website that makes it easy to browse names, see the publications relevant to those names, and see what, if anything, we know about the phylogeny of those taxa.

Dark TaxaDNA BarcodingNCBIInformática y Ciencias de la InformaciónInglés

Dark taxa even darker: NCBI pulls (some) DNA barcodes from GenBank (updated)

https://doi.org/10.59350/p7r0x-kb326

Publicado 24 de abril de 2012

Autor Roderic Page

Dark taxa have become even darker. NCBI has pulled the plug on large numbers of DNA barcode sequences that lack scientific names. For example, taxon Cyclopoida sp. BOLD:AAG9771 (tax_id 818059) now has a sparse page that has no associated sequences. From an earlier download of EMBL I know that this taxon is associated with at least 5 sequences, such as GU679674. But if you go to that sequence you get this:So the the sequence is hidden.