Rogue Scholar

GoogleWikipediaInformatikEnglisch

Google and Wikipedia revisited

Veröffentlicht 3. September 2009

Given that one response to my post on Fungi in Wikipedia was to say that fungi are also charismatic, so maybe I should try [insert unsexy taxon name here]. So, I've now looked at all the species I extracted from Wikipedia (nearly 72,000), ran the Google searches, and here are the results:SiteHow many times is it the top

FungiGoogleSearchWikipediaInformatikEnglisch

Fungi in Wikipedia

https://doi.org/10.59350/ye5r0-ta821

Veröffentlicht 2. September 2009

Autor Roderic Page

One response to the analysis I did of the Google rank of mammal pages in Wikipedia is to suggest that Wikipedia does well for mammals because these are charismatic. It's been suggested that for other groups of taxa Wikipedia might not be so prominent in the search results.As a quick test I extracted the 1552 fungal species I could find in Wikipedia and repeated the analysis.

GoogleMammalsPower LawWikipediaInformatikEnglisch

Wikipedia mammals and the power law

https://doi.org/10.59350/yfzna-72q11

Veröffentlicht 1. September 2009

Autor Roderic Page

Playing a bit more with the Wikipedia mammal data, there are some interesting patterns to note.

Clay ShirkyEOLGooglePower LawSearchInformatikEnglisch

Google, Wikipedia, and EOL

https://doi.org/10.59350/qvzh4-v1988

Veröffentlicht 1. September 2009

Autor Roderic Page

One assumption I've been making so far is that when people search for information on an organism using its scientific name, Wikipedia will dominate the search results (see my earlier post for an example of this assumption). I've decided to quantify this by doing a little experiment. I grabbed the Mammal Species of the World taxonomy and extracted the 5416 species names. I then used Google's AJAX search API to look up each name in Google.

ClassificationMammal Species Of The WorldMammalsMSWWikipediaInformatikEnglisch

Comparing Wikipedia and Mammal Species of the World classifications

https://doi.org/10.59350/b679a-wjz41

Veröffentlicht 31. August 2009

Autor Roderic Page

Continuing the saga of making sense of the mammal classification in Wikipedia, I've done a quick comparison with the Mammal Species of the World (third edition) classification. MSW is the default taxonomic reference used by WikiProject Mammals.

ClassificationMammalsVisualisationWikipediaInformatikEnglisch

Mammal tree from Wikipedia

https://doi.org/10.59350/qj5rg-hmk44

Veröffentlicht 29. August 2009

Autor Roderic Page

Following on from my previous post about visualising the mammalian classification in Wikipedia, I've extracted the largest component from the graph for all mammal taxa in Wikipedia, and it is a tree. This wasn't apparent in the previous diagram, where the component appeared as a big ball due to the layout algorithm used.

InformatikEnglisch

Visualising the Wikipedia classification of mammals

https://doi.org/10.59350/wnhjf-9c704

Veröffentlicht 28. August 2009

Autor Roderic Page

As part of my on-going experiments with Wikipedia as a repository of taxonomic information, I've extracted mammal pages from Wikipedia.

Australian Systematic BotanyCitationCitation NeededImpact FactorNuytsiaInformatikEnglisch

Scientific citations in Wikipedia

https://doi.org/10.59350/dsvg4-vy879

Veröffentlicht 21. August 2009

Autor Roderic Page

While thinking about measuring the quality of Wikipedia articles by counting the number of times they cite external literature, and conversely measuring the impact of papers by how many times they're cited in Wikipedia, I discovered, as usual, that somebody has already done it. I came across this nice paper by Finn Årup Nielsen (arXiv:0705.2106v1) (originally published in First Monday as a HTML document, I've embedded the PDF from arXiv

BioguidFutureISpeciesMashupPlansInformatikEnglisch

To wiki or not to wiki?

https://doi.org/10.59350/jeqt0-ykn86

Veröffentlicht 18. August 2009

Autor Roderic Page

What follows are some random thoughts as I try and sort out what things I want to focus on in the coming days/weeks. If you don't want to see some wallowing and general procrastination, look away now.I see four main strands in what I've been up to in the last year or so:servicesmashupswikisphyloinformaticsLet's take these in turns. Services Not glamourous, but necessary.

NDEVistaWindowsInformatikEnglisch

Nexus Data Editor and Windows Vista

https://doi.org/10.59350/5m2gk-kah85

Veröffentlicht 17. August 2009

Autor Roderic Page

Sometimes it's just amazing/frightening how long a piece of software remains useful. I wrote Nexus Data Editor (NDE) in the late 1990's, mainly to keep my then PhD student Vince Smith happy.

GBIFGUIDsLinked DataInformatikEnglisch

GBIF and Linked Data

https://doi.org/10.59350/d5c5f-hby28

Veröffentlicht 12. August 2009

Autor Roderic Page

At the end of day two of the GBIF LSID-GUID Task Group I put together this crude diagram to summarise some of the possible links between biodiversity data and the larger linked data cloud, which I, among others, have argued is where biodiversity informatics should be heading. Here's my hastily put together diagram (created using the wonderful OmniGraffle):I've put GBIF at the centre since we're at GBIF, and it's them we are trying to convince.

iPhylo

Google and Wikipedia revisited

Fungi in Wikipedia

Wikipedia mammals and the power law

Google, Wikipedia, and EOL

Comparing Wikipedia and Mammal Species of the World classifications

Mammal tree from Wikipedia

Visualising the Wikipedia classification of mammals

Scientific citations in Wikipedia

To wiki or not to wiki?

Nexus Data Editor and Windows Vista

GBIF and Linked Data