Rogue Scholar

R Biological Sciences

Plot Data Along a Genome with karyoploteR

Published July 7, 2025

Author Stephen Turner

karyoploteR is an R package that’s been in Bioconductor for nearly a decade. It lets you create linear chromosomal representations of any genome with genomic annotations and experimental data plotted along them. Bioconductor : https://bioconductor.org/packages/karyoploteR/ Tutorial : https://bernatgel.github.io/karyoploter_tutorial/ Paper : Bernat Gel & Eduard Serra. (2017).

PapersBiological Sciences

Weekly Recap (July 2025, part 1)

https://doi.org/10.59350/acqjv-a0153

Published July 3, 2025

Author Stephen Turner

This week’s recap highlights the Rust-based wgatools for manipulating alignments and visualizing in the terminal, the nf-core scnanoseq Nextflow pipeline for ONT scRNA-seq, sawfish for better SV discovery and genotyping with long reads, the BINSEQ high-performance binary formats for nucleotide sequence data, and a unified analysis of atlas single-cell data.

Biological Sciences

Moving from academia to biotech

https://doi.org/10.59350/krctr-49t84

Published June 30, 2025

Author Stephen Turner

I originally wrote and published this essay at The Connected Ideas Project, an excellent newsletter by my good friend and colleague Alexander Titus. If you’re not reading TCIP you’re missing out. After I finished my postdoc I was faculty in academia for eight years before moving to a consulting firm for five years, then joined a biotech startup two years ago.

PapersBiological Sciences

Weekly Recap (June 2025, part 2)

https://doi.org/10.59350/wrgv4-2xb84

Published June 27, 2025

Author Stephen Turner

This week’s recap highlights the new Datavzrd tool for interactive visualization and communication of tabular data (I’m genuinely really looking forward to trying this one), tracing the shared foundations of gene expression and chromatin structure, PISA for visualizing cis-regulatory rules in genomic data, fast protein structure searching using structure graph embeddings, and a review/perspective on intrinsically disordered regions as

Biological Sciences

uv, part 4: uv with Jupyter

https://doi.org/10.59350/y0y68-32f93

Published June 23, 2025

Author Stephen Turner

This is part 4 of a series on uv. Other posts in this series: uv, part 1: running scripts and tools uv, part 2: building and publishing packages uv, part 3: Python in R with reticulate I’ve never been a big fan of notebooks, and I’m not the only one. Out of order code execution, hidden state, difficulty diffing in version control, output bloat, etc.

Biological Sciences

Free AI courses from Google

https://doi.org/10.59350/9zymn-3ed97

Published June 16, 2025

Author Stephen Turner

One of my previous employers was a Google Cloud partner, which gave me full and free access to all of Google Cloud’s certification programs, where I took the Professional Cloud Architect and Professional Data Engineer programs. It shouldn’t surprise anyone that with Google leaning hard into GenAI that they have new certification programs and learning paths, like this Generative AI Leader certification.

PapersBiological Sciences

Weekly Recap (June 2025, part 1)

https://doi.org/10.59350/92abc-x0277

Published June 11, 2025

Author Stephen Turner

This week’s recap highlights PhyloSketch for interactively drawing and manipulating phylogenies, Uncalled4 for nanopore DNA and RNA modification detection, Severus for SV calling from long reads, CREsted for modeling synthetic cell type-specific enhancers, and a review on transformers and genome language models.

Biological Sciences

The Modern R Stack for Production AI

https://doi.org/10.59350/z9frb-fr404

Published June 2, 2025

Author Stephen Turner

There was a time in late 2023 to early 2024 when I and probably many others in the R community felt like R was falling woefully behind Python in tooling for development using AI and LLMs. This is no longer the case. The R community, and Posit in particular, have been on an absolute tear bringing new packages online to take advantage of all the capabilities that LLMs provide.

PapersBiological Sciences

Weekly Recap (May 2025, part 3)

https://doi.org/10.59350/ggv04-ksz57

Published May 27, 2025

Author Stephen Turner

This week’s recap highlights polars-bio for fast and scalable and out-of-core operations on large genomic interval datasets, combining DNA and protein alignments to improve genome annotation with LiftOn, feature selection methods for scRNA-seq, STRkit for read-level genotyping of short tandem repeats using long reads and single-nucleotide variation, and nf-core/detaxizer for decontamination of human sequences in metagenomics data.

R TILBiological Sciences

Writing a book with Quarto

https://doi.org/10.59350/9qj3c-7t040

Published May 19, 2025

Author Stephen Turner

In the spirit of learning in public, I wanted an excuse to dive into Quarto to learn more about publishing formats beyond simple PDF and HTML documents. If you’re not familiar, Quarto (quarto.org) is the successor to RMarkdown, the next-generation scientific publishing system that works natively with Python, R, and OJS. If you already have RMarkdown you probably don’t have to do anything to it to get it to render with Quarto.

PapersBiological Sciences

Weekly Recap (May 2025, part 2)

https://doi.org/10.59350/h1rfy-r6306

Published May 15, 2025

Author Stephen Turner

This week’s recap highlights compendium of human gene functions derived from evolutionary modelling from the Gene Ontology Consortium, an AI reasoning model applied to rare disease diagnosis, an agentic AI for scRNA-seq data exploration, and applying FAIR principles to scientific workflows.

Paired Ends

Plot Data Along a Genome with karyoploteR

Weekly Recap (July 2025, part 1)

Moving from academia to biotech

Weekly Recap (June 2025, part 2)

uv, part 4: uv with Jupyter

Free AI courses from Google

Weekly Recap (June 2025, part 1)

The Modern R Stack for Production AI

Weekly Recap (May 2025, part 3)

Writing a book with Quarto

Weekly Recap (May 2025, part 2)