Biyolojik BilimlerİngilizceBlogger

Getting Genetics Done

Getting Things Done in Genetics & Bioinformatics Research
Ana Sayfa
language
BioinformaticsGWASRecommended ReadingBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

DNA genotyping and sequencing are getting cheaper every day. As Oxford Nanopore CTO Clive Brown recently discussed at Genomes Unzipped, when the cost of a full DNA sequence begins to fall below $1000, the value of having that information far outweighs the cost of data generation.

AnnouncementsBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

Agilent Technologies is fostering integrated, whole-systems approaches to biological research through two $75,000 grants. The application deadline is August 12, 2011. Funds will support academic or nonprofit research projects covering the development of open source Agilent-compatible software tools for integrating data from different omics platforms—genomics, transcriptomics, proteomics, and metabolomics.

BioinformaticsGWASPathwaysRecommended ReadingBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

I just read a helpful paper on pathway analysis and interactome reconstruction: Tieri, P., Fuente, A. D., Termanini, A., & Franceschi, C. (2011). Integrating Omics Data for Signaling Pathways, Interactome Reconstruction, and Functional Analysis. In Bioinformatics for Omics Data, Methods in Molecular Biology, vol.

PerlPubMedBiyolojik Bilimlerİngilizce
Yayınlandı

NCBI has put a lot of effort into unifying their data access and retrieval system -- whether you are searching for a gene, protein, or publication, the results are returned in a similar fashion.

BioinformaticsGWASRSoftwareBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

In a previous post I linked to gcol as a quick and intuitive alternative to awk. I just stumbled across yet another set of handy text file manipulation utilities from the creators of the BEAGLE software for GWAS data imputation and analysis.

BioinformaticsRecommended ReadingBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

After evaluating an unnamed bioinformatics core facility, a group of bioinformaticians in Europe wrote up a short list of basic guidelines for organizing a bioinformatics core facility in large research institutes.

SoftwareBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

A while back Will showed you how to ditch Excel for awk, a handy Unix command line tool for extracting certain rows and columns from a text file. While I was browsing the documentation on the previously mentioned PLINK/SEQ library, I came across gcol, another utility for extracting columns from a tab-delimited text file. It can't do anything that awk can't, but it's easier and more intuitive to use for simple text munging tasks.

RSQLBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

Jeffrey Breen put together a useful slideshow on accessing databases from R. I use RODBC every single day to access my own local MySQL server from R. I've had trouble with RMySQL, so I've always used RODBC instead after setting up my localhost MySQL server as a Windows data source. Once you get accustomed to accessing your data directly with SQL queries rather than dumping files you'll wonder why you waited so long.

1000 GenomesBioinformaticsGWASRSequencingBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

PLINK/SEQ is an open source C/C++ library for analyzing large-scale genome sequencing data. The library can be accessed via the pseq command line tool, or through an R interface. The project is developed independently of PLINK but it's syntax will be familiar to PLINK users. PLINK/SEQ boasts an impressive feature set for a project still in the beta testing phase.

BioinformaticsNoteworthy BlogsSequencingSoftwareBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

This is a few months old but I just got around to reading this series of blog posts on next-generation sequencing (NGS) by Gabe Rudy, Golden Helix's VP of product development. This series gives a seriously useful overview of NGS technology, then delves into the analysis of NGS data at each step, right down to a description of the most commonly used file formats and tools for the job. Check it out now if you haven't already.

Ggplot2GWASRVisualizationBiyolojik Bilimlerİngilizce
Yayınlandı
Yazar Stephen Turner

**** UPDATE, May 15 2014 ***** The functions described here have now been wrapped into an R package. View the updated blog post or see the online package vignette for how to install and use. If you'd still like to use the old code described here, you can access this at version 0.0.0 on GitHub.