Biyolojik BilimlerİngilizceBlogger

Getting Genetics Done

Getting Things Done in Genetics & Bioinformatics Research
Ana Sayfa
language
Recommended ReadingSequencingBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

Following up on last week's coverage of the Genotyping Portal, check out this new review article on next-generation sequencing in Nature Reviews Genetics.  One major focus of this paper is that the next generation of sequencing platforms each use fundamentally different technologies.

SequencingBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

Diego Forero has compiled a comprehensive list of primary publications on commonly used SNP genotyping and DNA sequencing technologies (including SNP arrays, Sequenom, TaqMan, Pyrosequencing, Molecular Beacons, FP-TDI, Invader, xMAP, SNaPshot, SNPlex, Sanger, 454, Illumina, Helicos, SOLiD, Complete Genomics, Bisulfite sequencing, and others).

LinuxVisualizationBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

Do you use SSH to connect to a remote Linux machine from your local Windows computer?  Ever needed to run a program on that Linux machine that displays graphical output, or uses a GUI? I was in this position last week trying to make figures using ggplot2 in R of results from an analysis of GWAS data which required using a 64-bit Linux machine with more RAM than my 32-bit windows machine can see.

Machine LearningRBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

A Beautiful WWW put together a great set of resources for getting started with machine learning in R.  First, they recommend the previously mentioned free book, The Elements of Statistical Learning.  Then there's a link to a list of dozens of machine learning and statistical learning packages for R.  Next, you'll need data.  Hundreds of free real datasets are available at the UCI machine learning repository.

NewsRRecommended ReadingBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

The New York Times had an interesting piece yesterday about how SAS is facing several business threats from companies like the recently IBM-acquired SPSS, and from burgeoning interest in open-source software like R.

AnnouncementsBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

The 2009 Cancer Epidemiology, Biostatistics, and Bioinformatics Retreat will be held on Friday, December 4th, 2009, from 1:30 pm to 5:00 pm, on the eighth floor of the VICC building (898B PRB). The purpose of the retreat is to promote interactions among biostatisticians, bioinformaticians, epidemiologists, clinical investigators, and other translational researchers.

AnnouncementsRBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

Theresa Scott, instructor of the previously mentioned R workshop and weekly R clinic, is giving a lecture entitled "Reproducible Research with R, LaTeX, & Sweave" in MRB III, room 1220, this Wednesday 11/18 at 1:30.  You can see more details about the lecture here. Looks like her slides as well as much more introductory material on R, Latex, and Sweave are on her website. Reproducible Research with R, LaTeX, &

Ggplot2GWASRTutorialsVisualizationBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

Way back will wrote on this topic.  See his previous post for Stata code for doing this.  Unfortunately the R package that was used to create QQ-plots here has been removed from CRAN, so I wrote my own using ggplot2 and some code I received from Daniel Shriner at NHGRI. Of course you can use R's built-in qqplot() function, but I could never figure out a way to add the diagonal using base graphics.

RTutorialsBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

While flirting around with previously mentioned ggplot2 I came across an incredibly useful set of functions in the plyr package, made by Hadley Wickham, the same guy behind ggplot2.  If you've ever used MySQL before, think of "GROUP BY", but here you can arbitrarily apply any R function to splits of the data, or write one yourself.

GWASStatisticsVisualizationBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

Often when presenting statistics from a candidate gene study, or a region of interest from a genome-wide association study, it is useful to see various SNP-wise values in the context of linkage disequilibrium patterns.