Ciências da Computação e da InformaçãoInglêsBlogger

iPhylo

Rants, raves (and occasionally considered opinions) on phyloinformatics, taxonomy, and biodiversity informatics. For more ranty and less considered opinions, see my Twitter feed.ISSN 2051-8188. Written content on this site is licensed under a Creative Commons Attribution 4.0 International license.
Pagina inicialFeed AtomMastodonISSN 2051-8188
language
MediawikiSemantic WebTreeBASEWikiWorkshopCiências da Computação e da InformaçãoInglês
Publicados

At the start of this week I took part in a biodiversity informatics workshop at the Naturhistoriska riksmuseets, organised by Kevin Holston. It was a fun experience, and Kevin was a great host, going out of his way to make sure myself and other contributors were looked after.

AntsHistory FlowPyramicaStrumigenysWikipediaCiências da Computação e da InformaçãoInglês
Publicados

Stumbled across Alex Wild's post Pyramica vs Strumigenys : why does it matter?, which takes as it's starting point a minor edit war on the Wikipedia page for Pyramica . Alex gives the background to the argument about whether Pyramica is a synonym of Strumigenys , and investigates the issue using the surprisingly small about of data available in GenBank.

Gene WikiGoogleWikipediaCiências da Computação e da InformaçãoInglês
Publicados

Andrew Su has posted an analysis of Gene Wiki, a project to provide Wikipedia pages on every human gene:This result is interesting in that an existing resource (Gene Cards) beats Wikipedia, but only just.

History FlowSVGVisualisationWikipediaCiências da Computação e da InformaçãoInglês
Publicados

Quick post (really should be doing something else). Reading Jeff Atwood's post Mixing Oil and Water: Authorship in a Wiki World lead me to IBM's wonderful history flow tool to visualise the edit history of a Wikipedia page. There's a nice paper describing history flow (doi:10.1145/985692.985765, free PDF here). Inspired by this I decided to try and implement history flow in PHP and SVG.

GoogleWikipediaCiências da Computação e da InformaçãoInglês
Publicados

Given that one response to my post on Fungi in Wikipedia was to say that fungi are also charismatic, so maybe I should try [insert unsexy taxon name here]. So, I've now looked at all the species I extracted from Wikipedia (nearly 72,000), ran the Google searches, and here are the results:SiteHow many times is it the top

FungiGoogleSearchWikipediaCiências da Computação e da InformaçãoInglês
Publicados

One response to the analysis I did of the Google rank of mammal pages in Wikipedia is to suggest that Wikipedia does well for mammals because these are charismatic. It's been suggested that for other groups of taxa Wikipedia might not be so prominent in the search results.As a quick test I extracted the 1552 fungal species I could find in Wikipedia and repeated the analysis.

Clay ShirkyEOLGooglePower LawSearchCiências da Computação e da InformaçãoInglês
Publicados

One assumption I've been making so far is that when people search for information on an organism using its scientific name, Wikipedia will dominate the search results (see my earlier post for an example of this assumption). I've decided to quantify this by doing a little experiment. I grabbed the Mammal Species of the World taxonomy and extracted the 5416 species names. I then used Google's AJAX search API to look up each name in Google.

ClassificationMammal Species Of The WorldMammalsMSWWikipediaCiências da Computação e da InformaçãoInglês
Publicados

Continuing the saga of making sense of the mammal classification in Wikipedia, I've done a quick comparison with the Mammal Species of the World (third edition) classification. MSW is the default taxonomic reference used by WikiProject Mammals.

ClassificationMammalsVisualisationWikipediaCiências da Computação e da InformaçãoInglês
Publicados

Following on from my previous post about visualising the mammalian classification in Wikipedia, I've extracted the largest component from the graph for all mammal taxa in Wikipedia, and it is a tree. This wasn't apparent in the previous diagram, where the component appeared as a big ball due to the layout algorithm used.