InformatikEnglischBlogger

iPhylo

Rants, raves (and occasionally considered opinions) on phyloinformatics, taxonomy, and biodiversity informatics. For more ranty and less considered opinions, see my Twitter feed.ISSN 2051-8188. Written content on this site is licensed under a Creative Commons Attribution 4.0 International license.
StartseiteAtom-FeedMastodonISSN 2051-8188
language
Data CleaningGoogle RefineTaxonomic NameInformatikEnglisch
Veröffentlicht

Google Refine is an elegant tool for data cleaning. One of its most powerful features is the ability to call "Reconciliation Services" to help clean data, for example by matching names to external identifiers. Google Refine comes with the ability to use Freebase reconciliation services, but you can also add external services.

Darwin Core RipletData MiningMuseumSpecimen CodesInformatikEnglisch
Veröffentlicht

Quick note about a tool I've cobbled together as part of the phyloinformatics course, which addresses a long standing need I and others have to extract specimen codes from text. I've had this code kicking around for a while (as part of various never-finished data mining projects), but never got around to releasing it, until now.

GithubPhyloinformaticsTeachingInformatikEnglisch
Veröffentlicht

As part of a postgraduate course here at the University of Glasgow I'm teaching five sessions on "phyloinformatics", which I've decided to define broadly enough to encompass most of biodiversity informatics.Given that this module is being developed on the fly, and will make use of lots of little "toys" I've developed and discussed on this blog, I've decided to put the course notes online, along with the interactive demos and the source code.

APIEOLIPadJQueryMobileInformatikEnglisch
Veröffentlicht

As part of a course on "phyloinformatics" that I'm about to teach I've been making some visualisations of classifications. Here's one I've put together using jQuery Mobile and the Encyclopedia of Life API. It's pretty limited, but is a simple way to explore EOL using three different classifications.

CitationCollectionsIdentifiersSpecimensTAXACOMInformatikEnglisch
Veröffentlicht

This message appeared on the TAXACOM mailing list:Given that most specimens lack resolvable digital identifiers (a theme I've harped on about before, most recently in the context of DNA barcoding), answering this kind of query ends up being a case of searching publications for text strings that contain the acronym of the collection.

BHLCiteBankMendeleyInformatikEnglisch
Veröffentlicht

Here are some quick notes on how BHL could use Mendeley as a "CiteBank". As a repository of bibliographic data If the goal is to assemble a "bibliography of life" then there are various ways this could be done. Taxon-specific bibliographies Create groups that are taxon-specific (or find existing groups in Mendeley.

BHLBHL-EuropeTaxonomic NameInformatikEnglisch
Veröffentlicht

I've recently updated my database of links between animal taxonomic names and literature identifiers, which now has over 280,000 names linked to some form of identifier (127,000 of these being DOIs). You can see the current version here:http://iphylo.org/~rpage/itaxon/As an experiment I've added a feature to list the number of names for each journal.

IPadNLM DTDPLoSXMLXSLTInformatikEnglisch
Veröffentlicht

One of the things I keep revisiting is the way we display scientific articles. Apart from Nature's excellent iPhone and iPad apps, most efforts to re-imagine how we display articles are little more than glorified PDF viewers (e.g., the PLoS iPad app).Part of the challenge is that if we make the article more interactive we immediately confront the problem of how to link to other content.

Australian Faunal DirectoryCodeCouchDBListInformatikEnglisch
Veröffentlicht

Quick note to self about exporting data from my Australian Faunal Directory on CouchDB project. To export data from a CouchDB view you can use a list function (see Formatting with Show and List). Following the example on the Kanapes IDE blog, I created the following list function: { "_id": "_design/publication", "_rev": "14-467dee8248e97d874f1141411f536848", "language": "javascript", "lists": { "tsv": "function(head,req) { var row;