BiologieEnglischBlogger

Getting Genetics Done

Getting Things Done in Genetics & Bioinformatics Research
Startseite
language
PLINKBiologieEnglisch
Veröffentlicht
Autor Unknown

A while back, Stephen wrote a very nice post about converting PLINK output to a CSV file. If you are like me, you have used this a thousand times -- enough to get tired of typing lots of SED commands. I just crafted a little BASH script that accomplishes the same effect with a single easy to type command. Insert the following text into your .bashrc file.

GWASRecommended ReadingBiologieEnglisch
Veröffentlicht
Autor Stephen Turner

Researchers in the ENGAGE consortium used a clever technique to leverage genome-wide expression data to select or prioritize genes for GWAS analysis. The investigators published the novel candidate genes for obesity in this month's PLoS Genetics, but I think the method they used here is more interesting.

Noteworthy BlogsRecommended ReadingBiologieEnglisch
Veröffentlicht
Autor Stephen Turner

NYU PhD student Drew Conway has compiled a very nice list of 10 reasons why grad students should blog. I've been writing GGD for a little over a year now and it's been a great way to extend my own network past the Vanderbilt walls, participate in lively discussions with other scientists oceans away, and to write stuff that people actually read and find useful.

GWASRecommended ReadingBiologieEnglisch
Veröffentlicht
Autor Unknown

As a graduate student a few years ago, I learned about (and in some cases witnessed) the various phases, fads, and revolutions in the field of human genetics. The mid to late 90's saw a shift from family-based linkage analysis to a plethora of small candidate gene studies. The early 2000's saw the completion of the human genome project, the development of the HapMap project, and the birth of genome-wide association studies.

AnnouncementsStatisticsBiologieEnglisch
Veröffentlicht
Autor Stephen Turner

I've had friends in biochem "wet" labs who've asked me to do some simple statistics on some of their results. This looks like an interesting seminar to attend if you've ever thought about doing a t-test on fold changes in some outcome measure between treatment and control groups, a pretty common outcome in biochemical assays. If the speaker provides slides electronically I'll happily post them here after the seminar.

AnnouncementsBiologieEnglisch
Veröffentlicht
Autor Stephen Turner

Vanderbilt Center for Human Genetics Research faculty candidate Goncalo Abecasis will be interviewing for a faculty position here this week. Come check out his seminar - "Sequencing Thousands of Human Genomes" - Friday June 11th, 2-3pm in 512 Light Hall.Getting Genetics Done by Stephen Turner is licensed under a Creative Commons Attribution (CC BY) License.

RSQLBiologieEnglisch
Veröffentlicht
Autor Stephen Turner

I've covered a few topics in the past including the plyr package, which is kind of like "GROUP BY" for R, and the merge function for merging datasets. I only recently found the sqldf package for R, and it's already one of the most useful packages I've ever installed. The main function in the package is sqldf(), which takes a quoted string as an argument. You can treat data frames as tables as if they were in a relational database.

RStatisticsTutorialsBiologieEnglisch
Veröffentlicht
Autor Stephen Turner

Found this tutorial by Emily Mankin on how to do principal components analysis (PCA) using R. Has a nice example with R code and several good references. The example starts by doing the PCA manually, then uses R's built in prcomp() function to do the same PCA. Principle Components Analysis: A How-To Manual for R Getting Genetics Done by Stephen Turner is licensed under a Creative Commons Attribution (CC BY) License.

GWASVisualizationWeb AppsBiologieEnglisch
Veröffentlicht
Autor Stephen Turner

Previously mentioned LocusZoom has undergone some major updates over the last few months. Many of the bugs mentioned in my previous post are now fixed, and now there's a good bit of documentation available. There are also a few new features, including the ability to add an extra column to your results file to change the plotting symbol to reflect your own custom annotation (i.e. whether the SNP was imputed or genotyped, or the SNP's function).

ProductivityRStatisticsWritingBiologieEnglisch
Veröffentlicht
Autor Stephen Turner

Several readers emailed me or left a comment on my previous announcement of Frank Harrell's workshop on using Sweave for reproducible research asking if we could record the seminar. Unfortunately we couldn't record audio or video, but take a look at the Sweave/Latex page on the Biostatistics Dept Wiki.