Rogue Scholar

Ggplot2RVisualizationBiologieAnglais

Hadley Wickham: ggplot2 Webinar (Today!)

Publié 8 février 2012

Auteur Stephen Turner

Title: A Backstage Tour of ggplot2 with Hadley Wickham Date: Wednesday, February 8, 2012 Time: 11:00AM - 12:00PM Pacific Presenter: Hadley Wickham, Professor of Statistics, Rice University Register here.

BiologieAnglais

Joint Techs Netcast: Enhancing Infrastructure Support for Data Intensive Science

https://doi.org/10.59350/ratdh-ej621

Publié 20 janvier 2012

Auteur Stephen Turner

The winter Joint Techs meeting is next week in Baton Rouge. I'm not going, but I plan on participating via a netcast to see what's going on. Jim Bottum, Clemson's CIO, is moderating an entire day devoted to the topic Enhancing Infrastructure Support for Data Intensive Science. Of particular interest to me are the talks from 9:30-11am Tuesday January 24 from researchers and those supporting climatology, genomics, and the XSEDE projects.

BioinformaticsRSoftwareBiologieAnglais

Annotating limma Results with Gene Names for Affy Microarrays

https://doi.org/10.59350/vw55p-gr892

Publié 17 janvier 2012

Auteur Stephen Turner

Lately I've been using the limma package often for analyzing microarray data. When I read in Affy CEL files using ReadAffy(), the resulting ExpressionSet won't contain any featureData annotation. Consequentially, when I run topTable to get a list of differentially expressed genes, there's no annotation information other than the Affymetrix probeset IDs or transcript cluster IDs.

ProductivityRTutorialsBiologieAnglais

New Year's Resolution: Learn How to Code

https://doi.org/10.59350/mtxn3-1c431

Publié 5 janvier 2012

Auteur Stephen Turner

Farhad Manjoo at Slate has a good article on why you need to learn how to program. Chances are, if you're reading this post here you're already fairly adept at some form of programming. But if you're not, you should give it some serious thought.

RBiologieAnglais

Query a MySQL Database from R using RMySQL

https://doi.org/10.59350/ksgk2-vqc08

Publié 15 décembre 2011

Auteur Stephen Turner

I use this all the time, and the setup is dead simple. Follow the code below to load the RMySQL package, connect to a database (here the UCSC genome browser's public MySQL instance), set up a function to make querying easier, and query the database to return results as a data frame. Getting Genetics Done by Stephen Turner is licensed under a Creative Commons Attribution (CC BY) License.

AnnouncementsBioinformaticsWritingBiologieAnglais

Galaxy Project Group on CiteULike and Mendeley

https://doi.org/10.59350/ahgx4-y5v08

Publié 15 décembre 2011

Auteur Stephen Turner

The Galaxy Project started using CiteULike to organize papers that are about, use, or reference Galaxy. The Galaxy CiteULike group is open to any CUL user, and once you join, you can add papers to the group, assign tags, and rate papers.

BioinformaticsRRNA-SeqSequencingBiologieAnglais

RNA-Seq & ChiP-Seq Data Analysis Course at EBI

https://doi.org/10.59350/hmpzy-aje85

Publié 8 décembre 2011

Auteur Stephen Turner

I just got this announcement from EMBL-EBI about an RNA-seq/ChIP-seq analysis hands-on course.

BioinformaticsRRNA-SeqSequencingBiologieAnglais

An example RNA-Seq Quality Control and Analysis Workflow

https://doi.org/10.59350/4hd84-jj453

Publié 6 décembre 2011

Auteur Stephen Turner

I found the slides below on the education page from Bioinformatics & Research Computing at the Whitehead Institute. The first set (PDF) gives an overview of the methods and software available for quality assessment of microarray and RNA-seq experiments using the FastX toolkit and FastQC. The second set (PDF) gives an example RNA-seq workflow using TopHat, SAMtools, Python/HTseq, and R/DEseq.

BiologieAnglais

Webinar: Applications of Next-Generation Sequencing in Clinical Care

https://doi.org/10.59350/bfj9t-c1361

Publié 5 décembre 2011

Auteur Stephen Turner

I just got an email from Illumina about a webinar that looks interesting this Wednesday at 9am PST (noon EST) on clinical applications of next-gen sequencing. Date: Wednesday, December 7, 2011Time: 9:00 AM (PST)Speaker: Rick Dewey, MD, Stanford Center for Inherited Cardiovascular Disease Next-generation sequencing (NGS) presents both challenges and opportunities for clinical care.

BioinformaticsBiologieAnglais

BioMart Gene ID Converter

https://doi.org/10.59350/s4erg-f3292

Publié 18 novembre 2011

Auteur Stephen Turner

BioMart recently got a facelift. I'm not sure if this was always available in the old BioMart, but there's now a link to a gene ID converter that worked pretty well for me for converting S. cerevisiae gene IDs to standard gene names. It looks like the tool will convert nearly any ID you could imagine. Looks like it will also map Affy probe IDs to gene, transcript, or protein IDs and names.

BioinformaticsRBiologieAnglais

GEO2R: Web App to Analyze Gene Expression in GEO Datasets Using R

https://doi.org/10.59350/zjmhr-d1z21

Publié 17 novembre 2011

Auteur Stephen Turner

Gene Expression Omnibus is NCBI's repository for publicly available gene expression data with thousands of datasets having over 600,000 samples with array or sequencing data. You can download data from GEO using FTP, or download and load the data directly into R using the GEOquery bioconductor package written (and well documented) by Sean Davis, and analyze the data using the limma package.