Rogue Scholar

BioinformaticsRSoftwareTutorialsVisualizationBiological Sciences

qqman: an R package for creating Q-Q and manhattan plots from GWAS results

Published May 15, 2014

Author Stephen Turner

Three years ago I wrote a blog post on how to create manhattan plots in R. After hundreds of comments pointing out bugs and other issues, I've finally cleaned up this code and turned it into an R package.

1000 GenomesBioinformaticsDatabasesEthicsNewsBiological Sciences

Mycoplasma Contamination in Cell-Line Based Experiments

https://doi.org/10.59350/pxsvc-fpj89

Published May 6, 2014

For a few years now, my EvoSTAR colleague, Bill Langdon, has been exploring the degree to which Mycoplasma bacteria have contaminated experimental systems and even "infected" online databases with the contents of their genomes.

BioinformaticsConferencesGWASPathwaysRecommended ReadingBiological Sciences

Russ Altman's Translational Bioinformatics Year in Review

https://doi.org/10.59350/q4jbp-3nj46

Published April 22, 2014

A few weeks ago the 2014 AMIA Translational Bioinformatics Meeting (TBI) was held in beautiful San Francisco. This meeting is full of great science that spans the divide between molecular and clinical research, but a true highlight of this meeting is the closing keynote, traditionally given by Russ Altman.

Web AppsWritingBiological Sciences

Unsuck your writing

https://doi.org/10.59350/f3s73-yw993

Published April 8, 2014

Author Stephen Turner

I recently found this little gem of a web app that analyzes the clarity of your writing. Hemingway highlights long, complex, and hard to read sentences. It also highlights complex words where a simple one would do, and highlights adverbs, suggesting you use a stronger verb instead. It highlights passive voice (bad!), and tells you the minimum reading grade level necessary to understand your writing.

BioinformaticsRVisualizationBiological Sciences

Visualize coverage for targeted NGS (exome) experiments

https://doi.org/10.59350/ttga5-v0s14

Published March 20, 2014

Author Stephen Turner

I'm calling variants from exome sequencing data and I need to evaluate the efficiency of the capture and the coverage along the target regions. This sounds like a great use case for bedtools, your swiss-army knife for genomic arithmetic and interval manipulation.

GithubLinuxPythonRTutorialsBiological Sciences

Software Carpentry at UVA, Redux

https://doi.org/10.59350/n8xed-a5z89

Published March 12, 2014

Author Stephen Turner

Software Carpentry is an international collaboration backed by Mozilla and the Sloan Foundation comprising a team of volunteers that teach computational competence and basic programming skills to scientists.

BioinformaticsRBiological Sciences

Data Analysis for Genomics MOOC

https://doi.org/10.59350/9a456-xmd75

Published February 20, 2014

Author Stephen Turner

Last month I told you about Coursera's specializations in data science, systems biology, and computing. Today I was reading Jeff Leek's blog post defending p-values and found a link to HarvardX's Data Analysis for Genomics course, taught by Rafael Irizarry and Mike Love. Here's the course description: If you've ever wanted to get started with data analysis in genomics and you'd learn R along the way, this looks like a great place to start.

BioinformaticsData ScienceRBiological Sciences

There is no Such Thing as Biomedical "Big Data"

https://doi.org/10.59350/h2yx0-7dw32

Published February 11, 2014

At the moment, the world is obsessed with “Big Data” yet it sometimes seems that people who use this phrase don’t have a good grasp of its meaning. Like most good buzz-words, “Big Data” sparks the idea of something grand and complicated, while sounding ordinary enough that listeners feel like they have an intuitive understanding of the concept.

LinuxBiological Sciences

GNU Screen

https://doi.org/10.59350/dv5k0-g7k03

Published January 30, 2014

Author Stephen Turner

This is one of those things I picked up years ago while in graduate school that I just assumed everyone else already knew about. GNU screen is a great utility built-in to most Linux installations for remote session management. Typing 'screen' at the command line enters a new screen session.

BioinformaticsRStatisticsTutorialsBiological Sciences

Coursera Specializations: Data Science, Systems Biology, Python Programming

https://doi.org/10.59350/aaq76-apf20

Published January 22, 2014

Author Stephen Turner

I first mentioned Coursera about a year ago, when I hired a new analyst in my core. This new hire came in as a very competent Python programmer with a molecular biology and microbial ecology background, but with very little experience in statistics.

LinuxPerlTutorialsBiological Sciences

How To Install BioPerl Without Root Privileges

https://doi.org/10.59350/qg5em-aag35

Published January 13, 2014

Author Stephen Turner

I've seen this question asked and partially answered all around the web. As with anything related to Perl, I'm sure there is more than one way to do it. Here's how I do it with Perl 5.10.1 on CentOS 6.4. First, install local::lib with bootstrapping method as described here.

Getting Genetics Done

qqman: an R package for creating Q-Q and manhattan plots from GWAS results

Mycoplasma Contamination in Cell-Line Based Experiments

Russ Altman's Translational Bioinformatics Year in Review

Unsuck your writing

Visualize coverage for targeted NGS (exome) experiments

Software Carpentry at UVA, Redux

Data Analysis for Genomics MOOC

There is no Such Thing as Biomedical "Big Data"

GNU Screen

Coursera Specializations: Data Science, Systems Biology, Python Programming

How To Install BioPerl Without Root Privileges