Biyolojik BilimlerİngilizceBlogger

Getting Genetics Done

Getting Things Done in Genetics & Bioinformatics Research
Ana Sayfa
language
BioinformaticsBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

I mentioned BioStar in a previous post about getting all your questions answered. I can't emphasize enough how helpful the BioStar and other StackExchange communities are. Whenever I ask a statistics question on CrossValidated or a programming question on StackOverflow I often multiple answers within 10 minutes.

GWASRecommended ReadingSoftwareBiyolojik Bilimlerİngilizce
Yayınlandı

Peter Visscher and colleagues have recently published a flurry of papers employing a new software package called GCTA to estimate the heritability of traits using GWAS data (GCTA stands for Genome-wide Complex Trait Analysis -- clever acronymity!). The tool, supported (and presumably coded) by Jian Yang is remarkably easy to use, based in part on the familiar PLINK commandline interface.

BioinformaticsRRecommended ReadingStatisticsTutorialsBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

I haven't posted much here recently, but here is a roundup of a few of the links I've shared on Twitter (@genetics_blog) over the last two weeks. Here is a nice tutorial on accessing high-throughput public data (from NCBI) using R and Bioconductor. Cloudnumbers.com , a startup that allows you to run high-performance computing (HPC) applications in the cloud, now supports the previously mentioned R IDE, RStudio.

Ggplot2RVisualizationBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

I just discovered a handy function in R to produce a scatterplot matrix of selected variables in a dataset. The base graphics function is pairs(). Producing these plots can be helpful in exploring your data, especially using the second method below. Try it out on the built in iris dataset.

1000 GenomesSequencingBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

Sequencing company Complete Genomics recently made available 69 ethnically diverse complete human genome sequences: a Yoruba trio; a Puerto Rican trio; a 17-member, 3-generation pedigree; and a diversity panel representing 9 different populations. Some of the samples partially overlap with HapMap and the 1000 Genomes Project. The data can be downloaded directly from the FTP site.

BioinformaticsPathwaysRBiyolojik Bilimlerİngilizce
Yayınlandı

There are several tools available for conducting a post-hoc analysis of GWAS data looking for enrichment of significant SNPs using literature or pathway based resources. Examples include GRAIL, ALLIGATOR, and WebGestalt among others (see SNPath R Package). Since gene enrichment and pathway analysis essentially evolved from methods for analyzing gene expression data, many of these tools require specific gene identifiers as input.

BioinformaticsRecommended ReadingBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

Nucleic Acids Research just published its Web Server Issue, featuring new and updates to existing web servers and applications for genomics and proteomics research. In case you missed it, be sure to check out the Database Issue that came out earlier this year. This web server issue has lots of papers on tools for microRNA analysis, and protein/RNA secondary structure analysis and annotation.

AnnouncementsRBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

I wanted to contribute any content and code I post here to the R Programming Wikibook so I made a slight change to the Creative Commons license on this blog. All the written content is now cc-by-sa and all the code here is still open source BSD.

Biyolojik Bilimlerİngilizce
Yayınlandı

Genome-wide association studies have produced a wealth of new genetic associations to numerous traits over the last few years. As such, new studies of these phenotypes often attempt to replicate previous associations in their samples, or examine how the effects of these SNPs are altered by environmental factors or clinical subtypes.

BioinformaticsGWASRecommended ReadingBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

DNA genotyping and sequencing are getting cheaper every day. As Oxford Nanopore CTO Clive Brown recently discussed at Genomes Unzipped, when the cost of a full DNA sequence begins to fall below $1000, the value of having that information far outweighs the cost of data generation.