Rogue Scholar

GoogleWikipediaScienze informatiche e dell'informazioneInglese

Google and Wikipedia revisited

Pubblicato 3 settembre 2009

Given that one response to my post on Fungi in Wikipedia was to say that fungi are also charismatic, so maybe I should try [insert unsexy taxon name here]. So, I've now looked at all the species I extracted from Wikipedia (nearly 72,000), ran the Google searches, and here are the results:SiteHow many times is it the top

FungiGoogleSearchWikipediaScienze informatiche e dell'informazioneInglese

Fungi in Wikipedia

https://doi.org/10.59350/ye5r0-ta821

Pubblicato 2 settembre 2009

Autore Roderic Page

One response to the analysis I did of the Google rank of mammal pages in Wikipedia is to suggest that Wikipedia does well for mammals because these are charismatic. It's been suggested that for other groups of taxa Wikipedia might not be so prominent in the search results.As a quick test I extracted the 1552 fungal species I could find in Wikipedia and repeated the analysis.

GoogleMammalsPower LawWikipediaScienze informatiche e dell'informazioneInglese

Wikipedia mammals and the power law

https://doi.org/10.59350/yfzna-72q11

Pubblicato 1 settembre 2009

Autore Roderic Page

Playing a bit more with the Wikipedia mammal data, there are some interesting patterns to note.

Clay ShirkyEOLGooglePower LawSearchScienze informatiche e dell'informazioneInglese

Google, Wikipedia, and EOL

https://doi.org/10.59350/qvzh4-v1988

Pubblicato 1 settembre 2009

Autore Roderic Page

One assumption I've been making so far is that when people search for information on an organism using its scientific name, Wikipedia will dominate the search results (see my earlier post for an example of this assumption). I've decided to quantify this by doing a little experiment. I grabbed the Mammal Species of the World taxonomy and extracted the 5416 species names. I then used Google's AJAX search API to look up each name in Google.

ClassificationMammal Species Of The WorldMammalsMSWWikipediaScienze informatiche e dell'informazioneInglese

Comparing Wikipedia and Mammal Species of the World classifications

https://doi.org/10.59350/b679a-wjz41

Pubblicato 31 agosto 2009

Autore Roderic Page

Continuing the saga of making sense of the mammal classification in Wikipedia, I've done a quick comparison with the Mammal Species of the World (third edition) classification. MSW is the default taxonomic reference used by WikiProject Mammals.

ClassificationMammalsVisualisationWikipediaScienze informatiche e dell'informazioneInglese

Mammal tree from Wikipedia

https://doi.org/10.59350/qj5rg-hmk44

Pubblicato 29 agosto 2009

Autore Roderic Page

Following on from my previous post about visualising the mammalian classification in Wikipedia, I've extracted the largest component from the graph for all mammal taxa in Wikipedia, and it is a tree. This wasn't apparent in the previous diagram, where the component appeared as a big ball due to the layout algorithm used.

Scienze informatiche e dell'informazioneInglese

Visualising the Wikipedia classification of mammals

https://doi.org/10.59350/wnhjf-9c704

Pubblicato 28 agosto 2009

Autore Roderic Page

As part of my on-going experiments with Wikipedia as a repository of taxonomic information, I've extracted mammal pages from Wikipedia.

Australian Systematic BotanyCitationCitation NeededImpact FactorNuytsiaScienze informatiche e dell'informazioneInglese

Scientific citations in Wikipedia

https://doi.org/10.59350/dsvg4-vy879

Pubblicato 21 agosto 2009

Autore Roderic Page

While thinking about measuring the quality of Wikipedia articles by counting the number of times they cite external literature, and conversely measuring the impact of papers by how many times they're cited in Wikipedia, I discovered, as usual, that somebody has already done it. I came across this nice paper by Finn Årup Nielsen (arXiv:0705.2106v1) (originally published in First Monday as a HTML document, I've embedded the PDF from arXiv

BioguidFutureISpeciesMashupPlansScienze informatiche e dell'informazioneInglese

To wiki or not to wiki?

https://doi.org/10.59350/jeqt0-ykn86

Pubblicato 18 agosto 2009

Autore Roderic Page

What follows are some random thoughts as I try and sort out what things I want to focus on in the coming days/weeks. If you don't want to see some wallowing and general procrastination, look away now.I see four main strands in what I've been up to in the last year or so:servicesmashupswikisphyloinformaticsLet's take these in turns. Services Not glamourous, but necessary.

NDEVistaWindowsScienze informatiche e dell'informazioneInglese

Nexus Data Editor and Windows Vista

https://doi.org/10.59350/5m2gk-kah85

Pubblicato 17 agosto 2009

Autore Roderic Page

Sometimes it's just amazing/frightening how long a piece of software remains useful. I wrote Nexus Data Editor (NDE) in the late 1990's, mainly to keep my then PhD student Vince Smith happy.

GBIFGUIDsLinked DataScienze informatiche e dell'informazioneInglese

GBIF and Linked Data

https://doi.org/10.59350/d5c5f-hby28

Pubblicato 12 agosto 2009

Autore Roderic Page

At the end of day two of the GBIF LSID-GUID Task Group I put together this crude diagram to summarise some of the possible links between biodiversity data and the larger linked data cloud, which I, among others, have argued is where biodiversity informatics should be heading. Here's my hastily put together diagram (created using the wonderful OmniGraffle):I've put GBIF at the centre since we're at GBIF, and it's them we are trying to convince.

iPhylo

Google and Wikipedia revisited

Fungi in Wikipedia

Wikipedia mammals and the power law

Google, Wikipedia, and EOL

Comparing Wikipedia and Mammal Species of the World classifications

Mammal tree from Wikipedia

Visualising the Wikipedia classification of mammals

Scientific citations in Wikipedia

To wiki or not to wiki?

Nexus Data Editor and Windows Vista

GBIF and Linked Data