Rogue Scholar

BHLTwitterVisualisationScienze informatiche e dell'informazioneInglese

BHL interface ideas

Pubblicato 11 dicembre 2009

I've been buried in programming (and it's exam time at Glasgow) so I've not blogged for a month (gasp). I've been playing with ways to visualise Biodiversity Heritage Library content for a while (click here for a list of previous posts), and have occasionally surfaced to tweet a screenshot via twitpic.

BHLCatalogue Of LifeTag TreeTagsScienze informatiche e dell'informazioneInglese

Tag trees: displaying the taxonomy of names in BHL

https://doi.org/10.59350/ggwsa-2gf14

Pubblicato 11 novembre 2009

Autore Roderic Page

I've added a feature to my Biodiversity Heritage Library viewer that should help make sense of the names found on a page. Until now I've displayed them as a list of "tags", which ignores the relations among the names.

ScreencastTDWGWikiScienze informatiche e dell'informazioneInglese

iTaxon screencast

https://doi.org/10.59350/vctfg-fa944

Pubblicato 9 novembre 2009

Autore Roderic Page

Sadly I won't be at TDWG 2009, at least not in person. However, there is a session on wikis, which may contain this brief screencast of my iTaxon experiments. The screencast was made in haste, but tries to convey some of the ideas behind these experiments, especially the idea that by linking data together we can generate more interesting and rich views of objects such as scientific publications.

BHLJavascriptLazy LoadScienze informatiche e dell'informazioneInglese

BHL Viewer now with go faster stripes

https://doi.org/10.59350/zmeek-d6615

Pubblicato 5 novembre 2009

Autore Roderic Page

One of the more glaring limitations of my BHL viewer described in the previous post is that it can take a while to load all the page thumbnails (there can be hundreds). Given that one of the original motivations for this project was a faster viewer, this kinda sucks.

BHLMetadataScienze informatiche e dell'informazioneInglese

Biodiversity Heritage Library viewer experiments

https://doi.org/10.59350/pjc6v-ejz61

Pubblicato 3 novembre 2009

Autore Roderic Page

In between the chaos that is term-time I've been playing with ways to view Biodiversity Heritage Library content. The viewer is crude, and likely to go off-line at any moment while I fuss with it, the you can view an example here.

BHLData CleaningIndexMatchingMySQLScienze informatiche e dell'informazioneInglese

n-gram fulltext indexing in MySQL

https://doi.org/10.59350/26ame-4a164

Pubblicato 23 ottobre 2009

Autore Roderic Page

Continuing with my exploration of the Biodiversity Heritage Library one obstacle to linking BHL content with nomenclature databases is the lack of a consistent way to refer to the same bibliographic item (e.g., book or journal). For example, the Amphibia Species of the World (ASW) page for Gastrotheca aureomaculata gives the first reference for this name as: Gastrotheca aureomaculata Cochran and Goin, 1970, Bull. U.S. Natl.

Mac OS XMemcachedPHPSoftwareTutorialScienze informatiche e dell'informazioneInglese

Memcached, Mac OS X, and PHP

https://doi.org/10.59350/5b1va-ap837

Pubblicato 17 ottobre 2009

Autore Roderic Page

Thinking about ways to improve the performance of some of my web servers I've begun to toy with Memcached.

BHLBioguidOpenURLZoteroScienze informatiche e dell'informazioneInglese

Linking Bulletin of Zoological Nomenclature to BHL

https://doi.org/10.59350/h8ejh-6qj27

Pubblicato 8 ottobre 2009

Autore Roderic Page

After some fussing and hair pulling I've constructed a demo of linking a journal to the Biodiversity Heritage Library and displaying the results in Zotero (see my earlier post for rationale).After some searching I managed to retrieve metadata for several hundred article from the Bulletin of Zoological Nomenclature . Using a local copy of the BHL metadata, I wrote a script that looked up each article in BHL and found the URL of the first

BHLBioguidMetadataOpenURLZoteroScienze informatiche e dell'informazioneInglese

Zotero group for Biodiversity Heritage Library content

https://doi.org/10.59350/mxzkp-ktx48

Pubblicato 7 ottobre 2009

Autore Roderic Page

One thing I find myself doing (probably more often than I should) is adding a reference to my Zotero library for an item in the Biodiversity Heritage Library (BHL). BHL doesn't have article-level metadata (see But where are the articles?), so when I discover a page of interest (e.g., one that contains the original description of a taxon) I store metadata for the article containing that page in my Zotero library.

ClassificationGregg's ParadoxTaxonomyWikipediaScienze informatiche e dell'informazioneInglese

Wikipedia and Gregg's paradox

https://doi.org/10.59350/6bbcj-xf875

Pubblicato 6 ottobre 2009

Autore Roderic Page

Continuing the theme of taxonomic classification in Wikipedia, I'm perversely delighted that Wikipedia demonstrates Gregg's paradox so nicely. The late John R. Gregg wrote several papers and a book exploring the logical structure of taxonomy.

ClassificationWikipediaScienze informatiche e dell'informazioneInglese

Wikipedia's taxonomic classification is badly broken

https://doi.org/10.59350/vxhjg-y5c77

Pubblicato 5 ottobre 2009

Autore Roderic Page

Wikipedia is wonderful, but parts of it are horribly broken. Take, for example, taxonomic classifications. A classification is a rooted tree, which means that each node in the tree has a single parent. We can store trees in databases in a variety of ways. For example, for each node we could store a list of its children, or we could store the single unique parent of each node. Ideally we'd choose to store one or other, but not both.

iPhylo

BHL interface ideas

Tag trees: displaying the taxonomy of names in BHL

iTaxon screencast

BHL Viewer now with go faster stripes

Biodiversity Heritage Library viewer experiments

n-gram fulltext indexing in MySQL

Memcached, Mac OS X, and PHP

Linking Bulletin of Zoological Nomenclature to BHL

Zotero group for Biodiversity Heritage Library content

Wikipedia and Gregg's paradox

Wikipedia's taxonomic classification is badly broken