Rogue Scholar

R NextflowPythonBiologiaInglês

Tech I'm thankful for (2024)

Publicados 25 de novembro de 2024

It’s a short week here in the US. As I reflect on the tools that shape modern bioinformatics and data science it’s striking to see how far we’ve come in the 20 years I’ve been in this field. Today’s ecosystem is rich with tools that make our work faster, better, enjoyable, and increasingly accessible.

PapersBiologiaInglês

Weekly Recap (Nov 2024, part 3)

https://doi.org/10.59350/jvvh2-8sh15

Publicados 22 de novembro de 2024

Autor Stephen Turner

This week’s recap highlights pangenome graph construction with nf-core/pangenome, building pangenome graphs with PGGB, benchmarking algorithms for single-cell multi-omics prediction and integration, RNA foundation models, and a Nextflow pipeline for characterizing B cell receptor repertoires from non-targeted bulk RNA-seq data.

R BiologiaInglês

Expand your Bluesky network with R

https://doi.org/10.59350/pmcr6-asp17

Publicados 20 de novembro de 2024

Autor Stephen Turner

This post is inspired by the Bluesky Network Analyzer made by @theo.io. I’m encouraging everyone I know online to join the scientific community on Bluesky. In that post I link to several starter packs — lists of accounts posting about a topic that you can follow individually or all at once to start filling out your network. I started following accounts of people I knew from X and from a few starter packs I came across.

R NextflowBiologiaInglês

Bluesky for Science

https://doi.org/10.59350/ggpbb-g4471

Publicados 16 de novembro de 2024

Autor Stephen Turner

I joined Twitter1 way back in 2009. For nearly 10 years “scitwitter” was an amazing place for discussion, discovery, and engagement with the scientific community. The #Rstats and #pydata hashtags were great places to learn about something new in programming, #icanhazpdf was great for getting papers you didn’t have access to, and conference live-tweeting was common and useful for those of us with FOMO not able to make it in person.

PapersBiologiaInglês

Weekly Recap (Nov 2024, part 2)

https://doi.org/10.59350/hp1b1-wnb98

Publicados 15 de novembro de 2024

Autor Stephen Turner

This week’s recap highlights an AI agent for automated multi-omic analysis (AutoBA), rapid species-level metagenome profiling and containment (sylph), a review on genome-wide association analysis beyond SNPs, private information leakage from scRNA-seq count matrices, and a method to “unlearn” viral knowledge in protein language models as a means to develop safe PLM-based variant effect analysis (PROEDIT). Others that caught my attention include

TILPythonBiologiaInglês

Build a Python CLI with Click+Cookiecutter

https://doi.org/10.59350/zsq9k-6xs17

Publicados 10 de novembro de 2024

Autor Stephen Turner

In the spirit of Learning in Public, I wanted an excuse to explore (1) click for creating command line interfaces, (2) Cookiecutter project templates, and (3) modern tools in the Python packaging ecosystem. If you’re primarily an R developer like me, I recently wrote about resources for getting better at Python for R users.

PapersBiologiaInglês

Weekly Recap (Nov 2024, part 1)

https://doi.org/10.59350/4a13p-nky04

Publicados 8 de novembro de 2024

Autor Stephen Turner

This week's recap highlights a new pipeline for metagenome quality assessment and taxonomic annotation (MAGFlow &

NextflowBiologiaInglês

Nextflow Summit Barcelona 2024

https://doi.org/10.59350/xkn7n-zep24

Publicados 4 de novembro de 2024

Autor Stephen Turner

I just returned from a week in Barcelona where I attended the Nextflow Summit and nf-core hackathon, and I can hardly contain my excitement for the near term future of bioinformatics, computational biology, and open science in general.

PapersBiologiaInglês

Weekly Recap (Oct 2024, part 4)

https://doi.org/10.59350/xhh6j-z5b67

Publicados 25 de outubro de 2024

Autor Stephen Turner

This week’s recap highlights protein design with RoseTTAFold, surveillance with wastewater sequencing, T2T human genomes, Vitessce for visualization of multimodal spatial single-cell data, and Taxometer for taxonomic classification of metagenomics contigs.

R PythonBiologiaInglês

Python for R users

https://doi.org/10.59350/nw1ga-79906

Publicados 21 de outubro de 2024

Autor Stephen Turner

A Google search for “R vs Python” returns thousands of hits across sites like Reddit, IBM, Datacamp, Coursera, Kaggle, and many others. A quick Google Trends analysis shows that this search query has grown steadily over the last decade. Any real data scientist would agree that this argument is silly, that the right answer is to use the best tool for the job. What’s “best” isn’t always easy to answer.

PapersBiologiaInglês

Weekly Recap (Oct 2024, part 3)

https://doi.org/10.59350/dy8w5-g3p74

Publicados 18 de outubro de 2024

Autor Stephen Turner

This week’s recap highlights a new Nextflow workflow for calculating polygenic scores with adjustments for genetic ancestry, a paper demonstrating that whole exome plus imputation on more samples is more powerful than whole genome sequencing for finding more trait associated variants, a new deep-learning-based splice site predictor that improves spliced alignments, a new method for accurate community profiling of large metagenomic datasets, and

Paired Ends

Tech I'm thankful for (2024)

Weekly Recap (Nov 2024, part 3)

Expand your Bluesky network with R

Bluesky for Science

Weekly Recap (Nov 2024, part 2)

Build a Python CLI with Click+Cookiecutter

Weekly Recap (Nov 2024, part 1)

Nextflow Summit Barcelona 2024

Weekly Recap (Oct 2024, part 4)

Python for R users

Weekly Recap (Oct 2024, part 3)