Rogue Scholar

PapersBiologieEnglisch

Weekly Recap (May 2025, part 3)

Veröffentlicht 27. Mai 2025

This week’s recap highlights polars-bio for fast and scalable and out-of-core operations on large genomic interval datasets, combining DNA and protein alignments to improve genome annotation with LiftOn, feature selection methods for scRNA-seq, STRkit for read-level genotyping of short tandem repeats using long reads and single-nucleotide variation, and nf-core/detaxizer for decontamination of human sequences in metagenomics data.

R TILBiologieEnglisch

Writing a book with Quarto

https://doi.org/10.59350/9qj3c-7t040

Veröffentlicht 19. Mai 2025

Autor Stephen Turner

In the spirit of learning in public, I wanted an excuse to dive into Quarto to learn more about publishing formats beyond simple PDF and HTML documents. If you’re not familiar, Quarto (quarto.org) is the successor to RMarkdown, the next-generation scientific publishing system that works natively with Python, R, and OJS. If you already have RMarkdown you probably don’t have to do anything to it to get it to render with Quarto.

PapersBiologieEnglisch

Weekly Recap (May 2025, part 2)

https://doi.org/10.59350/h1rfy-r6306

Veröffentlicht 15. Mai 2025

Autor Stephen Turner

This week’s recap highlights compendium of human gene functions derived from evolutionary modelling from the Gene Ontology Consortium, an AI reasoning model applied to rare disease diagnosis, an agentic AI for scRNA-seq data exploration, and applying FAIR principles to scientific workflows.

R PythonBiologieEnglisch

uv, part 3: Python in R with reticulate

https://doi.org/10.59350/3k5zg-md663

Veröffentlicht 6. Mai 2025

Autor Stephen Turner

This is part 3 of a series on uv. Other posts in this series: uv, part 1: running scripts and tools uv, part 2: building and publishing packages This post uv, part 4: uv with Jupyter Python and R I get the same question all the time from up and coming data scientists in training: “should I use Python or R?” My answer is always the same: it’s not

PapersBiologieEnglisch

Weekly Recap (May 2025, part 1)

https://doi.org/10.59350/aj7v8-nv788

Veröffentlicht 2. Mai 2025

Autor Stephen Turner

This week’s recap highlights new methods in genetic epidemiology, mostly centered around genomic data sharing and privacy-preserving methods: a short commentary on genomic data sharing highlighting how new challenges complicate large-scale data sharing practices, a privacy-preserving method for QTL mapping, privacy-preserving methods for federated biobank-scale GWAS analysis, a Nextflow pipeline for polygenic score QC and construction, and new

BiologieEnglisch

Listen to papers, PDFs, and articles with ElevenReader

https://doi.org/10.59350/870q9-5tg73

Veröffentlicht 25. April 2025

Autor Stephen Turner

I get asked a lot how I have the time to read all these papers and articles I post about here. I don’t. Not all of them at least. I listen to many of them. Lately I’ve been using an app called ElevenReader , made by the popular text-to-speech service ElevenLabs. It’s free for iOS and Android.

PapersBiologieEnglisch

Weekly Recap (April 2025, part 2)

https://doi.org/10.59350/gg2ns-xje57

Veröffentlicht 24. April 2025

Autor Stephen Turner

This week’s recap highlights FLAMES for prioritizing genes at trait-associated GWAS hits, integrating protein language models and an automatic biofoundry for enhanced protein evolution, benchmarking DNA sequence models for causal regulatory variant prediction, and the doubletrouble R/Bioconductor package for identifying and classifying gene and genome duplications.

R BiologieEnglisch

R 4.5.0 and Bioconductor 3.21

https://doi.org/10.59350/a3j7j-xr970

Veröffentlicht 17. April 2025

Autor Stephen Turner

R 4.5.0 was released last week, and Bioconductor 3.21 came a few days later. You can read the R release notes here and the Bioconductor 3.21 announcement here.

PapersBiologieEnglisch

Weekly Recap (April 2025, part 1)

https://doi.org/10.59350/z80ds-7c905

Veröffentlicht 11. April 2025

Autor Stephen Turner

This week’s recap highlights Evo2 for variant effect analysis and genome design, a preprint showing that pretraining doesn’t necessarily increase performance on genomic foundation models, a new R package ggalign for making complex biological data visualizations with ggplot2, and an ancestral reconstruction method for ancient DNA. I also highlight a few reviews in biodiversity genomics.

AIBiologieEnglisch

Build a local RAG application with Open WebUI to chat with your Zotero library

https://doi.org/10.59350/gxvhr-pg694

Veröffentlicht 5. April 2025

Autor Stephen Turner

In a previous post I demonstrated how to set up a local LLM that you can run through either a command line interface (Ollama) or a graphical user interface (Open WebUI and others), and quickly demonstrated how to “chat with your documents” with a local model using LMStudio. In that previous post I simply attached a few documents to a one-off chat.

PapersBiologieEnglisch

Weekly Recap (March 2025, part 2)

https://doi.org/10.59350/4meve-et54

Veröffentlicht 28. März 2025

Autor Stephen Turner

This week’s recap highlights ESCARGOT, an AI agent for biomedical knowledge graphs and reasoning, CASTER for direct species tree inference from whole-genome alignments, the scGPT-spatial foundation model for spatial transcriptomics, the BioChatter platform for biomedical research applications with LLMs, moscot for mapping cells through time and space, and two reviews: one on epigenetic clocks and another on structural variation in the human

Paired Ends

Weekly Recap (May 2025, part 3)

Writing a book with Quarto

Weekly Recap (May 2025, part 2)

uv, part 3: Python in R with reticulate

Weekly Recap (May 2025, part 1)

Listen to papers, PDFs, and articles with ElevenReader

Weekly Recap (April 2025, part 2)

R 4.5.0 and Bioconductor 3.21

Weekly Recap (April 2025, part 1)

Build a local RAG application with Open WebUI to chat with your Zotero library

Weekly Recap (March 2025, part 2)