Rogue Scholar

ClusterfuckYahooComputer and Information Sciences

Word for the day - "clusterfuck"

Published February 12, 2007

Wired's article How Yahoo Blew It contains this wonderful extract: For some reason, the word really appeals. Not that I've got anything against Yahoo -- far from it. The provide some very cool tools, such as Pipes, which is an interactive feed aggregator and manipulator. David Shorthouse brought this to my attention. As he points out, its highly relevant to our conversation on iSpecies.

Computer and Information Sciences

Many Eyes

https://doi.org/10.59350/bcgr8-8vr65

Published February 10, 2007

Author Roderic Page

Spotted on information aesthetics, IBM's Many Eyes looks very cool. It's a way to share visualisations of data.

Computer and Information Sciences

TreeBASE name mapping

https://doi.org/10.59350/8hqs3-p9w31

Published February 9, 2007

Author Roderic Page

In the spirit of "release early, release often", a preliminary version of the TreeBASE name mapping project is now online at TBMap. It's a bit crude, the graphs look awful because they're generated on the fly on a Linux box using GraphViz, but you'll get the idea. I'll try and tidy it up and add a few more visuals to it next week after the EOL Informatics meeting at Woods Hole. There are also some missing mappings to add to the database.

Computer and Information Sciences

When phylogenetic names would be useful

https://doi.org/10.59350/j2t0b-9pt21

Published January 31, 2007

Author Roderic Page

To avoid being charged with being consistent (unlikely, I know), despite being underwhelmed by phylogenetic names in the context of TreeBASE (see conversation with David Marjanović in the previous post), I think they could be very useful in annotating phylogenetic trees.

Computer and Information Sciences

Quixotic(?) tree of Life visualisation

https://doi.org/10.59350/2t2xb-g7h82

Published January 30, 2007

Author Roderic Page

The Ant Room has a nice post on Visualizing the tree of life, with some cool links. And just to balance that, Donat Agosti drew my attention to Ford Doolittle and Eric Bapteste's PNAS article "Pattern pluralism and the Tree of Life hypothesis" doi:10.1073/pnas.0610699104.

Computer and Information Sciences

Encyclopedia of Life

https://doi.org/10.59350/6bwdq-xhj13

Published January 30, 2007

Author Roderic Page

E. O. Wilson's much used quote features prominently on the EoL Informatics web site. This project involves the Smithsonian Institution, Field Museum, Harvard University, Biodiversity Heritage Library, and the MBL. I will be at the Informatics Workshop next month. For my own toy efforts in this direction, see iSpecies.

Computer and Information Sciences

Phylocode

https://doi.org/10.59350/e2nzd-x1m46

Published January 30, 2007

Author Roderic Page

Comments by David Marjanović elsewhere on this blog (here and here) about TreeBASE, classification and Phylocode have prompted me to write a little bit about why I'm underwhelmed by the Phylocode. Suppose I have the question: How do I answer this? Well, my approach is to do the following. Firstly, I attempt to map every name in TreeBASE onto a name in an external database, such as NCBI Taxonomy, uBio, etc.

Computer and Information Sciences

The joys of mapping names in TreeBASE

https://doi.org/10.59350/wp4b1-vzy14

Published January 18, 2007

Author Roderic Page

Here's a fun example of how databases get out of sync, making them harder to link up. TreeBASE taxon T4628 is labelled Bolitoglossa sombra , which doesn't exist in NCBI's taxonomy database, which is odd as the study by Mueller et al. (S1139) is a molecular phylogeny (doi:10.1073/pnas.0405785101), and the taxon concerned has had its whole mitochondrial genome sequenced.

Computer and Information Sciences

A manifesto

https://doi.org/10.59350/zr1a0-syg32

Published January 16, 2007

Author Roderic Page

The funding of pPOD mentioned earlier today motivates me to write some notes on what I think "core database technologies for enabling the integration of AToL data" could, or indeed, should be about. Much of what follows I've mentioned elsewhere on the iPhylo blog (for example here and related blogs SemAnt and iSpecies) but it seems useful to bring this together here.

Computer and Information Sciences

processing PhylOData (pPOD)

https://doi.org/10.59350/kym89-0rb51

Published January 16, 2007

Author Roderic Page

Some good news! pPOD, a NSF-funded project on integrating data from AToL (A Tree of Life) projects has been funded. Val Tannen (right) is the co-ordinating PI. I'm a consultant, which means more opportunities to mouth-off about phylogenetic data and databases (for earlier examples see TreeBASE rocks, TreeBASE talk at CIPRES, and Towards the ToL database - some visions). The project is called pPOD, and has a wiki.

Computer and Information Sciences

chem-bla-ics: Including SMILES, CML and InChI in blogs

https://doi.org/10.59350/36qcs-2dp13

Published January 8, 2007

Author Roderic Page

Browsing Postgenomic.com eventually lead to Egon Willighagen's post about Including SMILES, CML and InChI in blogs, which talks about the sort of things I'd like to do in biodiversity informatics. I'm particularly keen on using blogs as annotation tools. One more for the reading list... (see also RDFa.