BiologieAnglaisBlogger

Getting Genetics Done

Getting Things Done in Genetics & Bioinformatics Research
Page d'accueil
language
BiologieAnglais
Publié
Auteur Stephen Turner

The winter Joint Techs meeting is next week in Baton Rouge. I'm not going, but I plan on participating via a netcast to see what's going on. Jim Bottum, Clemson's CIO, is moderating an entire day devoted to the topic Enhancing Infrastructure Support for Data Intensive Science. Of particular interest to me are the talks from 9:30-11am Tuesday January 24 from researchers and those supporting climatology, genomics, and the XSEDE projects.

BioinformaticsRSoftwareBiologieAnglais
Publié
Auteur Stephen Turner

Lately I've been using the limma package often for analyzing microarray data. When I read in Affy CEL files using ReadAffy(), the resulting ExpressionSet won't contain any featureData annotation. Consequentially, when I run topTable to get a list of differentially expressed genes, there's no annotation information other than the Affymetrix probeset IDs or transcript cluster IDs.

ProductivityRTutorialsBiologieAnglais
Publié
Auteur Stephen Turner

Farhad Manjoo at Slate has a good article on why you need to learn how to program. Chances are, if you're reading this post here you're already fairly adept at some form of programming. But if you're not, you should give it some serious thought.

RBiologieAnglais
Publié
Auteur Stephen Turner

I use this all the time, and the setup is dead simple. Follow the code below to load the RMySQL package, connect to a database (here the UCSC genome browser's public MySQL instance), set up a function to make querying easier, and query the database to return results as a data frame. Getting Genetics Done by Stephen Turner is licensed under a Creative Commons Attribution (CC BY) License.

AnnouncementsBioinformaticsWritingBiologieAnglais
Publié
Auteur Stephen Turner

The Galaxy Project started using CiteULike to organize papers that are about, use, or reference Galaxy. The Galaxy CiteULike group is open to any CUL user, and once you join, you can add papers to the group, assign tags, and rate papers.

BioinformaticsRRNA-SeqSequencingBiologieAnglais
Publié
Auteur Stephen Turner

I found the slides below on the education page from Bioinformatics & Research Computing at the Whitehead Institute. The first set (PDF) gives an overview of the methods and software available for quality assessment of microarray and RNA-seq experiments using the FastX toolkit and FastQC. The second set (PDF)  gives an example RNA-seq workflow using TopHat, SAMtools, Python/HTseq, and R/DEseq.

BiologieAnglais
Publié
Auteur Stephen Turner

I just got an email from Illumina about a webinar that looks interesting this Wednesday at 9am PST (noon EST) on clinical applications of next-gen sequencing. Date: Wednesday, December 7, 2011Time: 9:00 AM (PST)Speaker: Rick Dewey, MD, Stanford Center for Inherited Cardiovascular Disease Next-generation sequencing (NGS) presents both challenges and opportunities for clinical care.

BioinformaticsBiologieAnglais
Publié
Auteur Stephen Turner

BioMart recently got a facelift. I'm not sure if this was always available in the old BioMart, but there's now a link to a gene ID converter that worked pretty well for me for converting S. cerevisiae gene IDs to standard gene names. It looks like the tool will convert nearly any ID you could imagine. Looks like it will also map Affy probe IDs to gene, transcript, or protein IDs and names.

BioinformaticsRBiologieAnglais
Publié
Auteur Stephen Turner

Gene Expression Omnibus is NCBI's repository for publicly available gene expression data with thousands of datasets having over 600,000 samples with array or sequencing data. You can download data from GEO using FTP, or download and load the data directly into R using the GEOquery bioconductor package written (and well documented) by Sean Davis, and analyze the data using the limma package.

Recommended ReadingRNA-SeqSequencingTutorialsBiologieAnglais
Publié
Auteur Stephen Turner

James Taylor came to UVA last week and gave an excellent talk on how Galaxy enables transparent and reproducible research in genomics. I'm gearing up to take on several projects that involve next-generation sequencing, and I'm considering installing my own Galaxy framework on a local cluster or on the cloud. If you've used Galaxy in the past you're probably aware that it allows you to share data, workflows, and histories with other users.