Rogue Scholar

GithubLinuxPythonRTutorialsBiologíaInglés

Software Carpentry at UVA, Redux

Publicado 12 de marzo de 2014

Software Carpentry is an international collaboration backed by Mozilla and the Sloan Foundation comprising a team of volunteers that teach computational competence and basic programming skills to scientists.

BioinformaticsRBiologíaInglés

Data Analysis for Genomics MOOC

https://doi.org/10.59350/9a456-xmd75

Publicado 20 de febrero de 2014

Autor Stephen Turner

Last month I told you about Coursera's specializations in data science, systems biology, and computing. Today I was reading Jeff Leek's blog post defending p-values and found a link to HarvardX's Data Analysis for Genomics course, taught by Rafael Irizarry and Mike Love.

BioinformaticsData ScienceRBiologíaInglés

There is no Such Thing as Biomedical "Big Data"

https://doi.org/10.59350/h2yx0-7dw32

Publicado 11 de febrero de 2014

Autor Stephen Turner

At the moment, the world is obsessed with “Big Data” yet it sometimes seems that people who use this phrase don’t have a good grasp of its meaning. Like most good buzz-words, “Big Data” sparks the idea of something grand and complicated, while sounding ordinary enough that listeners feel like they have an intuitive understanding of the concept.

LinuxBiologíaInglés

GNU Screen

https://doi.org/10.59350/dv5k0-g7k03

Publicado 30 de enero de 2014

Autor Stephen Turner

This is one of those things I picked up years ago while in graduate school that I just assumed everyone else already knew about. GNU screen is a great utility built-in to most Linux installations for remote session management. Typing 'screen' at the command line enters a new screen session.

BioinformaticsRStatisticsTutorialsBiologíaInglés

Coursera Specializations: Data Science, Systems Biology, Python Programming

https://doi.org/10.59350/aaq76-apf20

Publicado 22 de enero de 2014

Autor Stephen Turner

I first mentioned Coursera about a year ago, when I hired a new analyst in my core. This new hire came in as a very competent Python programmer with a molecular biology and microbial ecology background, but with very little experience in statistics.

LinuxPerlTutorialsBiologíaInglés

How To Install BioPerl Without Root Privileges

https://doi.org/10.59350/qg5em-aag35

Publicado 13 de enero de 2014

Autor Stephen Turner

I've seen this question asked and partially answered all around the web. As with anything related to Perl, I'm sure there is more than one way to do it. Here's how I do it with Perl 5.10.1 on CentOS 6.4. First, install local::lib with bootstrapping method as described here.

BioinformaticsMetagenomicsPythonRRecommended ReadingBiologíaInglés

Jeff Leek's non-comprehensive list of awesome things other people did in 2013

https://doi.org/10.59350/j87eq-84k69

Publicado 31 de diciembre de 2013

Autor Stephen Turner

Jeff Leek, biostats professor at Johns Hopkins and instructor of the Coursera Data Analysis course, recently posted on Simly Statistics this list of awesome things other people accomplished in 2013 in genomics, statistics, and data science. At risk of sounding too meta , I'll say that this list itself is one of the awesome things that was put together in 2013.

BioinformaticsSoftwareBiologíaInglés

Curoverse raises $1.5M to develop & support an open-source bioinformatics data analysis platform

https://doi.org/10.59350/e8f1k-2ev20

Publicado 18 de diciembre de 2013

Autor Stephen Turner

Boston-based startup Curoverse has announced $1.5 million in funding to develop and support the open-source Arvados platform for cloud-based bioinformatics & genomics data analysis. The Arvados platform was developed in George Church's lab by scientists and engineers led by Alexander Wait Zaranek, now scientific director at Curoverse.

BioinformaticsDatabasesTutorialsBiologíaInglés

Biostar Tutorial: Cheat sheet for one-based vs zero-based coordinate systems

https://doi.org/10.59350/9y10w-1y991

Publicado 9 de diciembre de 2013

Autor Stephen Turner

Obi Griffith over at Biostar put together this excellent cheat sheet for dealing with one-based and zero-based genomic coordinate systems. The cheat sheet visually explains the difference between zero and one-based coordinate systems, as well as how to indicate a position, SNP, range, or indel using both coordinate systems.

AnnotationBioinformaticsGWASPLINKSQLBiologíaInglés

Using Database Joins to Compare Results Sets

https://doi.org/10.59350/xwgvw-4xg85

Publicado 20 de noviembre de 2013

Autor Stephen Turner

One of the most powerful tools you can learn to use in genomics research is a relational database system, such as MySQL. These systems are fairly easy to setup and use, and provide users the ability to organize and manipulate data and statistical results with simple commands. As a graduate student (during the height of GWAS), this single skill quickly turned me into an “expert”.

GWASRVisualizationBiologíaInglés

A Mitochondrial Manhattan Plot

https://doi.org/10.59350/dvd1d-ywx41

Publicado 6 de noviembre de 2013

Autor Stephen Turner

Manhattan plots have become the standard way to visualize results for genetic association studies, allowing the viewer to instantly see significant results in the rough context of their genomic position. Manhattan plots are typically shown on a linear X-axis (although the circos package can be used for radial plots), and this is consistent with the linear representation of the genome in online genome browsers.

Getting Genetics Done

Software Carpentry at UVA, Redux

Data Analysis for Genomics MOOC

There is no Such Thing as Biomedical "Big Data"

GNU Screen

Coursera Specializations: Data Science, Systems Biology, Python Programming

How To Install BioPerl Without Root Privileges

Jeff Leek's non-comprehensive list of awesome things other people did in 2013

Curoverse raises $1.5M to develop & support an open-source bioinformatics data analysis platform

Biostar Tutorial: Cheat sheet for one-based vs zero-based coordinate systems

Using Database Joins to Compare Results Sets

A Mitochondrial Manhattan Plot