Rogue Scholar

AnnouncementsBiyolojik Bilimlerİngilizce

Personal Genomics and Data Sharing Survey

Yayınlandı 31 Ağustos 2011

Yazar Stephen Turner

I was recently contacted by a couple of German biologists working on a project evaluating opinions on sharing raw data from DTC genetic testing companies like 23andme. A handful of people like the gang at Genomes Unzipped, the PGP-10, and others at SNPedia have released their own genotype or sequencing data into the public domain. As of now, data like this is scattered around the web and most of it is not attached to any phenotype data.

BioinformaticsBiyolojik Bilimlerİngilizce

Bioinformatics Posters Collection

https://doi.org/10.59350/m2fc2-zrn57

Yayınlandı 29 Ağustos 2011

Yazar Stephen Turner

I mentioned BioStar in a previous post about getting all your questions answered. I can't emphasize enough how helpful the BioStar and other StackExchange communities are. Whenever I ask a statistics question on CrossValidated or a programming question on StackOverflow I often multiple answers within 10 minutes.

GWASRecommended ReadingSoftwareBiyolojik Bilimlerİngilizce

Estimating Trait Heritability from GWAS Data

https://doi.org/10.59350/y5ycj-xj779

Yayınlandı 22 Ağustos 2011

Peter Visscher and colleagues have recently published a flurry of papers employing a new software package called GCTA to estimate the heritability of traits using GWAS data (GCTA stands for Genome-wide Complex Trait Analysis -- clever acronymity!). The tool, supported (and presumably coded) by Jian Yang is remarkably easy to use, based in part on the familiar PLINK commandline interface.

ProductivityRBiyolojik Bilimlerİngilizce

Sync Your Rprofile Across Multiple R Installations

https://doi.org/10.59350/82vpr-8fa10

Yayınlandı 15 Ağustos 2011

Yazar Stephen Turner

Your Rprofile is a script that R executes every time you launch an R session.

BioinformaticsRRecommended ReadingStatisticsTutorialsBiyolojik Bilimlerİngilizce

Friday Links: R, OpenHelix Bioinformatics Tips, 23andMe, Perl, Python, Next-Gen Sequencing

https://doi.org/10.59350/yv2q1-qp048

Yayınlandı 5 Ağustos 2011

Yazar Stephen Turner

I haven't posted much here recently, but here is a roundup of a few of the links I've shared on Twitter (@genetics_blog) over the last two weeks. Here is a nice tutorial on accessing high-throughput public data (from NCBI) using R and Bioconductor. Cloudnumbers.com , a startup that allows you to run high-performance computing (HPC) applications in the cloud, now supports the previously mentioned R IDE, RStudio.

Ggplot2RVisualizationBiyolojik Bilimlerİngilizce

Scatterplot matrices in R

https://doi.org/10.59350/gc38j-syp94

Yayınlandı 25 Temmuz 2011

Yazar Stephen Turner

I just discovered a handy function in R to produce a scatterplot matrix of selected variables in a dataset. The base graphics function is pairs(). Producing these plots can be helpful in exploring your data, especially using the second method below. Try it out on the built in iris dataset.

1000 GenomesSequencingBiyolojik Bilimlerİngilizce

Download 69 Complete Human Genomes

https://doi.org/10.59350/57mnp-sfa92

Yayınlandı 12 Temmuz 2011

Yazar Stephen Turner

Sequencing company Complete Genomics recently made available 69 ethnically diverse complete human genome sequences: a Yoruba trio; a Puerto Rican trio; a 17-member, 3-generation pedigree; and a diversity panel representing 9 different populations. Some of the samples partially overlap with HapMap and the 1000 Genomes Project. The data can be downloaded directly from the FTP site.

BioinformaticsPathwaysRBiyolojik Bilimlerİngilizce

Mapping SNPs to Genes for GWAS Enrichment Analysis

https://doi.org/10.59350/3vmp0-v1t07

Yayınlandı 30 Haziran 2011

There are several tools available for conducting a post-hoc analysis of GWAS data looking for enrichment of significant SNPs using literature or pathway based resources. Examples include GRAIL, ALLIGATOR, and WebGestalt among others (see SNPath R Package). Since gene enrichment and pathway analysis essentially evolved from methods for analyzing gene expression data, many of these tools require specific gene identifiers as input.

BioinformaticsRecommended ReadingBiyolojik Bilimlerİngilizce

Nucleic Acids Research Web Server Issue

https://doi.org/10.59350/6ard5-ann05

Yayınlandı 29 Haziran 2011

Yazar Stephen Turner

Nucleic Acids Research just published its Web Server Issue, featuring new and updates to existing web servers and applications for genomics and proteomics research. In case you missed it, be sure to check out the Database Issue that came out earlier this year. This web server issue has lots of papers on tools for microRNA analysis, and protein/RNA secondary structure analysis and annotation.

AnnouncementsRBiyolojik Bilimlerİngilizce

Steal This Blog!

https://doi.org/10.59350/nmvp1-pp625

Yayınlandı 23 Haziran 2011

Yazar Stephen Turner

I wanted to contribute any content and code I post here to the R Programming Wikibook so I made a slight change to the Creative Commons license on this blog. All the written content is now cc-by-sa and all the code here is still open source BSD.

Biyolojik Bilimlerİngilizce

Displaying Regression Coefficients from Complex Analyses

https://doi.org/10.59350/1wyq2-8f261

Yayınlandı 14 Haziran 2011

Genome-wide association studies have produced a wealth of new genetic associations to numerous traits over the last few years. As such, new studies of these phenotypes often attempt to replicate previous associations in their samples, or examine how the effects of these SNPs are altered by environmental factors or clinical subtypes.

Getting Genetics Done

Personal Genomics and Data Sharing Survey

Bioinformatics Posters Collection

Estimating Trait Heritability from GWAS Data

Sync Your Rprofile Across Multiple R Installations

Friday Links: R, OpenHelix Bioinformatics Tips, 23andMe, Perl, Python, Next-Gen Sequencing

Scatterplot matrices in R

Download 69 Complete Human Genomes

Mapping SNPs to Genes for GWAS Enrichment Analysis

Nucleic Acids Research Web Server Issue

Steal This Blog!

Displaying Regression Coefficients from Complex Analyses