Rogue Scholar

BioinformaticsConferencesMetagenomicsRRNA-SeqBiological Sciences

Automated Archival and Visual Analysis of Tweets Mentioning #bog13, Bioinformatics, #rstats, and Others

Published May 15, 2013

Author Stephen Turner

Automatically Archiving Twitter Results Ever since Twitter gamed its own API and killed off great services like IFTTT triggers, I've been looking for a way to automatically archive tweets containing certain search terms of interest to me. Twitter's built-in search is limited, and I wanted to archive interesting tweets for future reference and to start playing around with some basic text / trend analysis.

BioinformaticsMetagenomicsRecommended ReadingBiological Sciences

Three Metagenomics Papers for You

https://doi.org/10.59350/3e6pa-jxv67

Published May 6, 2013

Author Stephen Turner

A handful of good metagenomics papers have come out over the last few months. Below I've linked to and copied my evaluation of each of these articles from F1000. ... 1. Willner, Dana, and Philip Hugenholtz. "From deep sequencing to viral tagging: Recent advances in viral metagenomics." BioEssays (2013). My evaluation: This review lays out some of the challenges and recent advances in viral metagenomic sequencing.

AnnouncementsBioinformaticsRTutorialsBiological Sciences

List of Bioinformatics Workshops and Training Resources

https://doi.org/10.59350/gywd8-09p35

Published April 4, 2013

Author Stephen Turner

I frequently get asked to recommend workshops or online learning resources for bioinformatics, genomics, statistics, and programming. I compiled a list of both online learning resources and in-person workshops (preferentially highlighting those where workshop materials are freely available online): List of Bioinformatics Workshops and Training Resources I hope to keep the page above as up-to-date as possible.

BioinformaticsClusteringConferencesMachine LearningBiological Sciences

Evolutionary Computation and Data Mining in Biology

https://doi.org/10.59350/f6r8r-thn86

Published March 27, 2013

For over 15 years, members of the computer science, machine learning, and data mining communities have gathered in a beautiful European location each spring to share ideas about biologically-inspired computation. Stemming from the work of John Holland who pioneered the field of genetic algorithms, multiple approaches have been developed that exploit the dynamics of natural systems to solve computational problems.

AnnouncementsConferencesDatabasesRecommended ReadingSoftwareBiological Sciences

Software Carpentry Bootcamp at University of Virginia

https://doi.org/10.59350/vfsme-6z247

Published March 19, 2013

Author Stephen Turner

A couple of weeks ago I, with the help of others here at UVA, organized a Software Carpentry bootcamp, instructed by Steve Crouch, Carlos Anderson, and Ben Morris. The day before the course started, Charlottesville was racked by nearly a foot of snow, widespread power outages, and many cancelled incoming flights. Luckily our instructors arrived just in time, and power was (mostly) restored shortly before the boot camp started.

BioinformaticsMetagenomicsRecommended ReadingSoftwareBiological Sciences

Comparing Sequence Classification Algorithms for Metagenomics

https://doi.org/10.59350/zpr5r-85f13

Published March 4, 2013

Author Stephen Turner

Metagenomics is the study of DNA collected from environmental samples (e.g., seawater, soil, acid mine drainage, the human gut, sputum, pus, etc.). While traditional microbial genomics typically means sequencing a pure cultured isolate, metagenomics involves taking a culture-free environmental sample and sequencing a single gene (e.g. the 16S rRNA gene), multiple marker genes, or shotgun sequencing everything in the sample in order to

PathwaysTutorialsVisualizationWeb AppsBiological Sciences

NetGestalt for Data Visualization in the Context of Pathways

https://doi.org/10.59350/kqnp4-ey394

Published February 20, 2013

Many of you may be familiar with WebGestalt, a wonderful web utility developed by Bing Zhang at Vanderbilt for doing basic gene-set enrichment analyses. Last year, we invited Bing to speak at our annual retreat for the Vanderbilt Graduate Program in Human Genetics, and he did not disappoint! Bing walked us through his new tool called NetGestalt.

BioinformaticsRRecommended ReadingBiological Sciences

"Document Design and Purpose, Not Mechanics"

https://doi.org/10.59350/4822m-cj548

Published February 12, 2013

Author Stephen Turner

If you ever write code for scientific computing (chances are you do if you're here), stop what you're doing and spend 8 minutes reading this open-access paper: Wilson et al. Best Practices for Scientific Computing.

BioinformaticsRNA-SeqSequencingStatisticsWeb AppsBiological Sciences

Scotty, We Need More Power! Power, Sample Size, and Coverage Estimation for RNA-Seq

https://doi.org/10.59350/92fbk-t3s93

Published January 28, 2013

Author Stephen Turner

Two of the most common questions at the beginning of an RNA-seq experiments are "how many reads do I need?" and "how many replicates do I need?". This paper describes a web application for designing RNA-seq applications that calculates an appropriate sample size and read depth to satisfy user-defined criteria such as cost, maximum number of reads or replicates attainable, etc.

BioinformaticsConferencesRecommended ReadingBiological Sciences

The Pacific Symposium on Biocomputing 2013

https://doi.org/10.59350/trkr6-bgm87

Published January 15, 2013

For 18 years now, computational biologists have convened on the beautiful islands of Hawaii to present and discuss research emerging from new areas of biomedicine. PSB Conference Chairs Teri Klein (@teriklein), Keith Dunker, Russ Altman (@Rbaltman) and Larry Hunter (@ProfLHunter) organize innovative sessions and tutorials that are always interactive and thought-provoking.

DatabasesSoftwareWeb AppsWritingBiological Sciences

Stop Hosting Data and Code on your Lab Website

https://doi.org/10.59350/9zcfd-83n38

Published January 8, 2013

Author Stephen Turner

It's happened to all of us. You read about a new tool, database, webservice, software, or some interesting and useful data, but when you browse to http://instititution.edu/~home/professorX/lab/data, there's no trace of what you were looking for. THE PROBLEM This isn't an uncommon problem. See the following two articles: The first gives us some alarming statistics.

Getting Genetics Done

Automated Archival and Visual Analysis of Tweets Mentioning #bog13, Bioinformatics, #rstats, and Others

Three Metagenomics Papers for You

List of Bioinformatics Workshops and Training Resources

Evolutionary Computation and Data Mining in Biology

Software Carpentry Bootcamp at University of Virginia

Comparing Sequence Classification Algorithms for Metagenomics

NetGestalt for Data Visualization in the Context of Pathways

"Document Design and Purpose, Not Mechanics"

Scotty, We Need More Power! Power, Sample Size, and Coverage Estimation for RNA-Seq

The Pacific Symposium on Biocomputing 2013

Stop Hosting Data and Code on your Lab Website