Rogue Scholar

RTutorialsBiologiaInglese

Compiling RMarkdown from a Helper R Script

Pubblicato 6 agosto 2015

Autore Stephen Turner

The problem I was looking for a way to compile an RMarkdown document and have the filename of the resulting PDF or HTML document contain the name of the input data that it processed. That is, if I compiled the analysis.Rmd file, where in that file it did some analysis and reporting on data001.txt, I’d want the resulting filename to look something like data001.txt.analysis.html.

RTutorialsVisualizationBiologiaInglese

R: single plot with two different y-axes

https://doi.org/10.59350/1n1hp-hs716

Pubblicato 21 aprile 2015

Autore Stephen Turner

I forgot where I originally found the code to do this, but I recently had to dig it out again to remind myself how to draw two different y axes on the same plot to show the values of two different features of the data. This is somewhat distinct from the typical use case of aesthetic mappings in ggplot2 where I want to have different lines/points/colors/etc. for the same feature across multiple subsets of data.

BioinformaticsConferencesData ScienceDatabasesWeb AppsBiologiaInglese

Translational Bioinformatics Year In Review

https://doi.org/10.59350/ewmkk-ef451

Pubblicato 10 aprile 2015

Per tradition, Russ Altman gave his "Translational Bioinformatics: The Year in Review" presentation at the close of the AMIA Joint Summit on Translational Bioinformatics in San Francisco on March 26th. This year, papers came from six key areas (and a final Odds and Ends category). His full slide deck is available here.

ClusteringMachine LearningRVisualizationBiologiaInglese

R User Group Recap: Heatmaps and Using the caret Package

https://doi.org/10.59350/p7ad0-6hd50

Pubblicato 10 aprile 2015

Autore Stephen Turner

At our most recent R user group meeting we were delighted to have presentations from Mark Lawson and Steve Hoang, both bioinformaticians at Hemoshear. All of the code used in both demos is in our Meetup’s GitHub repo.

RBiologiaInglese

Using and Abusing Data Visualization: Anscombe's Quartet and Cheating Bonferroni

https://doi.org/10.59350/4qyw5-99102

Pubblicato 26 febbraio 2015

Autore Stephen Turner

Anscombe’s quartet comprises four datasets that have nearly identical simple statistical properties, yet appear very different when graphed. Each dataset consists of eleven ( x , y ) points. They were constructed in 1973 by the statistician Francis Anscombe to demonstrate both the importance of graphing data before analyzing it and the effect of outliers on statistical properties. Let’s load and view the data.

Microbial Genomics: the State of the Art in 2015

https://doi.org/10.59350/7c8jx-crj32

Pubblicato 4 febbraio 2015

Autore Stephen Turner

Current Opinion in Microbiology recently published a special issue in genomics. In an excellent editorial overview, “Genomics: The era of genomically-enabled microbiology”, Neil Hall and Jay Hinton give an overview of the state of the field in microbial genomics, summarize recent contributions, and give a great synopsis of each of the reviews in this issue.

Ggplot2RBiologiaInglese

R + ggplot2 Graph Catalog

https://doi.org/10.59350/msaaz-2ej63

Pubblicato 3 febbraio 2015

Autore Stephen Turner

Joanna Zhao’s and Jenny Bryan’s R graph catalog is meant to be a complement to the physical book, Creating More Effective Graphs, but it’s a really nice gallery in its own right. The catalog shows a series of different data visualizations, all made with R and ggplot2. Click on any of the plots and you get the R code necessary to generate the data and produce the plot.

Noteworthy BlogsBiologiaInglese

Microbiome Digest Blog

https://doi.org/10.59350/ezw0k-gt934

Pubblicato 20 gennaio 2015

Autore Stephen Turner

I have a noteworthy blogs tag on this blog that I sort of forgot about, and haven't used in years. But I started reading one recently that's definitely qualified for the distinction. The Microbiome Digest is written by Elisabeth Bik, a scientist studying the microbiome at Stanford.

RBiologiaInglese

Using the microbenchmark package to compare the execution time of R expressions

https://doi.org/10.59350/8ryxb-4sq10

Pubblicato 14 gennaio 2015

Autore Stephen Turner

I recently learned about the microbenchmark package while browsing through Hadley’s advanced R programming book. I’ve done some quick benchmarking using system.time() in a for loop and taking the average, but the microbenchmark function in the microbenchmark package makes this much easier.

RBiologiaInglese

Importing Illumina BeadArray data into R

https://doi.org/10.59350/9f7y5-d7754

Pubblicato 8 dicembre 2014

Autore Stephen Turner

A colleague needed some help getting Illumina BeadArray gene expression data loaded into R for data analysis with limma. Hopefully whoever ran your arrays can export the data as text files formatted as described in the code below. If so, you can import those text files directly using the beadarray package.

RRNA-SeqTutorialsBiologiaInglese

RNA-seq Data Analysis Course Materials

https://doi.org/10.59350/avz01-j3p68

Pubblicato 20 novembre 2014

Autore Stephen Turner

Last week I ran a one-day workshop on RNA-seq data analysis in the UVA Health Sciences Library. I set up an AWS public EC2 image with all the necessary software installed.

Getting Genetics Done

Compiling RMarkdown from a Helper R Script

R: single plot with two different y-axes

Translational Bioinformatics Year In Review

R User Group Recap: Heatmaps and Using the caret Package

Using and Abusing Data Visualization: Anscombe's Quartet and Cheating Bonferroni

Microbial Genomics: the State of the Art in 2015

R + ggplot2 Graph Catalog

Microbiome Digest Blog

Using the microbenchmark package to compare the execution time of R expressions

Importing Illumina BeadArray data into R

RNA-seq Data Analysis Course Materials