Rogue Scholar

RBiological Sciences

Repost: Tidy RAG in R with ragnar

Published July 15, 2025

Author Stephen Turner

Reposted from the original at: https://blog.stephenturner.us/p/tidy-rag-in-r-with-ragnar Retrieval augmented generation in R using the ragnar package. Demonstration: scraping text from relevant links on a website and using RAG to ask about a university's grant funding.

RBiological Sciences

Repost: The Modern R Stack for Production AI

https://doi.org/10.59350/jnc43-b3g52

Published June 3, 2025

Author Stephen Turner

Reposted from the original at: https://blog.stephenturner.us/p/r-production-ai ... Python isn't the only game in town anymore: R can interact with local and cloud LLM APIs, inspect and modify your local R environment and files, implement RAG, computer vision, NLP, evals, &

RBiological Sciences

Repost: Writing a book with Quarto

https://doi.org/10.59350/fp38z-9gb40

Published May 23, 2025

Author Stephen Turner

Reposted from the original at https://blog.stephenturner.us/p/quarto-books. ...In the spirit of learning in public, I wanted an excuse to dive into Quarto to learn more about publishing formats beyond simple PDF and HTML documents.If you’re not familiar, Quarto (quarto.org) is the successor to RMarkdown, the next-generation scientific publishing system that works natively with Python, R, and OJS.

RBiological Sciences

Repost: uv, part 3: Python in R with reticulate

https://doi.org/10.59350/5f1c0-p5d14

Published May 6, 2025

Author Stephen Turner

Reposted from the original at https://blog.stephenturner.us/p/uv-part-3-python-in-r-with-reticulate. Two demos using Python in R via reticulate+uv: (1) Hugging Face transformers for sentiment analysis, (2) pyBigWig to query a BigWig file and visualize with ggplot2.

RBiological Sciences

Repost: R 4.5.0 and Bioconductor 3.21

https://doi.org/10.59350/p41t7-1x405

Published April 17, 2025

Author Stephen Turner

Reposted from the original at https://blog.stephenturner.us/p/r-450-bioconductor-321. Faster package installation, import only the functions you want with use(), built-in Palmer penguins data, grep values shortcut, and lots of new bioinformatics packages in Bioconductor ... R 4.5.0 was released last week, and Bioconductor 3.21 came a few days later.

RBiological Sciences

Repost: Bluesky conversation analysis with local and frontier LLMs with R/Tidyverse

https://doi.org/10.59350/2c6mk-cce32

Published December 30, 2024

Author Stephen Turner

Reposted from the original at https://blog.stephenturner.us/p/bluesky-analysis-claude-llama-tidyverse.

RBiological Sciences

Use an LLM to translate help documentation on-the-fly

https://doi.org/10.59350/q7kqf-s1990

Published December 16, 2024

Author Stephen Turner

Reposted from Paired Ends at https://blog.stephenturner.us/p/llm-translate-documentation. ---The lang package overrides the ? and help() functions in your R session. The translated help page will appear in the help pane in RStudio or Positron. It can also translate your Roxygen documentation.

RBiological Sciences

Turn a GitHub repo into a single text file for LLM-friendly input (repost)

https://doi.org/10.59350/3t3wg-g6750

Published December 9, 2024

Author Stephen Turner

Reposted from the original at https://blog.stephenturner.us/p/github-repo-to-text-for-llm-input. --- If you use ChatGPT, Claude, or even some local model through Ollama or HuggingFace Assistants, you’ll know that the chat interface makes it challenging to feed in an entire repo like a Python or R package, because functions, tests, etc. can be scattered across many files throughout a repo.

RBiological Sciences

Tech I'm thankful for (repost)

https://doi.org/10.59350/x3yfq-n1j07

Published November 25, 2024

Author Stephen Turner

Reposted from https://blog.stephenturner.us/p/tech-im-thankful-for-2024 Data science and bioinformatics tech I'm thankful for in 2024: tidyverse, RStudio, Positron, Bluesky, blogs, Quarto, bioRxiv, LLMs for code, Ollama, Seqera Containers, StackOverflow, ...It’s a short week here in the US. As I reflect on the tools that shape modern bioinformatics and data science it’s striking to see how far we’ve come in the 20 years I’ve

RBiological Sciences

Expand your Bluesky network with R (repost)

https://doi.org/10.59350/rz9fd-3rw89

Published November 20, 2024

Author Stephen Turner

This is reposted from the original at https://blog.stephenturner.us/p/expand-your-bluesky-network-with-r. ---I’m encouraging everyone I know online to join the scientific community on Bluesky.Bluesky for Science Stephen Turner·Nov 16Read full storyIn that post I link to several starter packs — lists of accounts posting about a topic that you can follow individually or all at once to start filling out your network.I started

RBiological Sciences

Build a Python CLI with Click+Cookiecutter (repost)

https://doi.org/10.59350/pgqr6-td115

Published November 10, 2024

Author Stephen Turner

Reposted from the original at https://blog.stephenturner.us/p/python-cli-click-cookiecutter. --- In the spirit of Learning in Public, I wanted an excuse to explore (1) click for creating command line interfaces, (2) Cookiecutter project templates, and (3) modern tools in the Python packaging ecosystem.

Getting Genetics Done

Repost: Tidy RAG in R with ragnar

Repost: The Modern R Stack for Production AI

Repost: Writing a book with Quarto

Repost: uv, part 3: Python in R with reticulate

Repost: R 4.5.0 and Bioconductor 3.21

Repost: Bluesky conversation analysis with local and frontier LLMs with R/Tidyverse

Use an LLM to translate help documentation on-the-fly

Turn a GitHub repo into a single text file for LLM-friendly input (repost)

Tech I'm thankful for (repost)

Expand your Bluesky network with R (repost)

Build a Python CLI with Click+Cookiecutter (repost)