Ciências da Computação e da InformaçãoInglêsBlogger

iPhylo

Rants, raves (and occasionally considered opinions) on phyloinformatics, taxonomy, and biodiversity informatics. For more ranty and less considered opinions, see my Twitter feed.ISSN 2051-8188. Written content on this site is licensed under a Creative Commons Attribution 4.0 International license.
Pagina inicialFeed AtomMastodonISSN 2051-8188
language
Data CleaningGoogle RefineTaxonomic NameCiências da Computação e da InformaçãoInglês
Publicados

Google Refine is an elegant tool for data cleaning. One of its most powerful features is the ability to call "Reconciliation Services" to help clean data, for example by matching names to external identifiers. Google Refine comes with the ability to use Freebase reconciliation services, but you can also add external services. Inspired by this I've started to implement services to reconcile taxonomic names.

BrowserPhyloinformaticsTreeBASEVisualisationCiências da Computação e da InformaçãoInglês
Publicados

One of the things I find frustrating about TreeBASE is that there's no easy way to get an overview of what it contains. What is it's taxonomic coverage like? Is it dominated by plants and fungi, or are there lots of animal trees as well? Are the obvious gaps in our phylogenetic knowledge, or do the phylogenies it contains pretty much span the tree of life?

Darwin Core RipletData MiningMuseumSpecimen CodesCiências da Computação e da InformaçãoInglês
Publicados

Quick note about a tool I've cobbled together as part of the phyloinformatics course, which addresses a long standing need I and others have to extract specimen codes from text. I've had this code kicking around for a while (as part of various never-finished data mining projects), but never got around to releasing it, until now.

GithubPhyloinformaticsTeachingCiências da Computação e da InformaçãoInglês
Publicados

As part of a postgraduate course here at the University of Glasgow I'm teaching five sessions on "phyloinformatics", which I've decided to define broadly enough to encompass most of biodiversity informatics. Given that this module is being developed on the fly, and will make use of lots of little "toys" I've developed and discussed on this blog, I've decided to put the course notes online, along with the interactive demos and the source code.

APIEOLIPadJQueryMobileCiências da Computação e da InformaçãoInglês
Publicados

As part of a course on "phyloinformatics" that I'm about to teach I've been making some visualisations of classifications. Here's one I've put together using jQuery Mobile and the Encyclopedia of Life API. It's pretty limited, but is a simple way to explore EOL using three different classifications.

CitationCollectionsIdentifiersSpecimensTAXACOMCiências da Computação e da InformaçãoInglês
Publicados

This message appeared on the TAXACOM mailing list: Given that most specimens lack resolvable digital identifiers (a theme I've harped on about before, most recently in the context of DNA barcoding), answering this kind of query ends up being a case of searching publications for text strings that contain the acronym of the collection.

BHLCiteBankMendeleyCiências da Computação e da InformaçãoInglês
Publicados

Here are some quick notes on how BHL could use Mendeley as a "CiteBank". As a repository of bibliographic data If the goal is to assemble a "bibliography of life" then there are various ways this could be done. Taxon-specific bibliographies Create groups that are taxon-specific (or find existing groups in Mendeley.

BHLBHL-EuropeTaxonomic NameCiências da Computação e da InformaçãoInglês
Publicados

I've recently updated my database of links between animal taxonomic names and literature identifiers, which now has over 280,000 names linked to some form of identifier (127,000 of these being DOIs). You can see the current version here: http://iphylo.org/~rpage/itaxon/ As an experiment I've added a feature to list the number of names for each journal.

IPadNLM DTDPLoSXMLXSLTCiências da Computação e da InformaçãoInglês
Publicados

One of the things I keep revisiting is the way we display scientific articles. Apart from Nature's excellent iPhone and iPad apps, most efforts to re-imagine how we display articles are little more than glorified PDF viewers (e.g., the PLoS iPad app). Part of the challenge is that if we make the article more interactive we immediately confront the problem of how to link to other content.

Australian Faunal DirectoryCodeCouchDBListCiências da Computação e da InformaçãoInglês
Publicados

Quick note to self about exporting data from my Australian Faunal Directory on CouchDB project. To export data from a CouchDB view you can use a list function (see Formatting with Show and List). Following the example on the Kanapes IDE blog, I created the following list function: { "_id": "_design/publication", "_rev": "14-467dee8248e97d874f1141411f536848", "language": "javascript", "lists": { "tsv": "function(head,req) { var row;