Publicaciones de Rogue Scholar

language
BiologíaInglés
Publicado in Paired Ends

I recently wrote a piece about leaving academia for biotech. I left academia for industry in 2019. I spent four years at a consulting firm before joining Colossal Biosciences. This week I’m returning to the University of Virginia School of Data Science as a tenured associate professor and dean of research. The transition from academia to industry can be tricky, but it’s also increasingly common.

IupacBeilsteinChemblCiencias QuímicasInglés
Publicado in chem-bla-ics

A lot is happening. If you have been following this project more closesly, you may have already seen some interesting updates, but I will post it here too. First, a quick recap. In March I started a new Blue Obelisk project to collect CCZero IUPAC names from primary literature (paper still pending). It turned out we can automate that, while legally not violating any laws or licenses.

Large Language ModelAi SearchOtras Ciencias SocialesInglés
Publicado in Aaron Tay's Musings about librarianship
Autor Aaron Tay

Back in 2022, I was hyped about Retrieval-Augmented Generation (RAG).The novelty of seeing a search engine spit out a direct answer — with citations! — in tools like Elicit and Perplexity felt like the future. I even predicted that this “answers-with-citations” model could become the prominent paradigm for academic search. Three years later, that prediction has partly come true.

PapersBiologíaInglés
Publicado in Paired Ends

This week’s recap highlights nanoMDBG for metagenome assembly from nanopore reads, the SCassist AI-based workflow for single-cell analysis, discovery and characterization of GxE and GxG effects in a vertebrate model, the PIGEON framework for estimating gene-environment interaction for polygenic traits, and long-read alignment with multi-level parallelism.

BiologyAIBiocurationBirthdayBOSCBiologíaInglés
Publicado in GigaBlog

Birthdays, BOSC, Beatles and Bioinformatics with a Merseybeat Conference season is upon us, and the GigaScience team have just returned from a magical mystery tour to Liverpool. Regular readers will know GigaScience launched at the ISMB (International Conference on Intelligent Systems for Molecular Biology) in 2012, and every year we attend and celebrate our birthday at the meeting.

NewsNews For DevelopersNews For Hosted ClientsKevin StranackPKP PeopleCiencias SocialesInglés
Publicado in Public Knowledge Project
Autor Alejandra Casas Niño de Rivera

After much reflection, the time has come for me to retire, effective at the end of this year. This marks the end of a deeply fulfilling chapter in my life — one filled with purpose, collaboration, and a shared commitment to open knowledge and community-driven progress in open source, open access publishing. I’ve been fortunate to work with so many brilliant people over the years, and I’m extremely proud of what we’ve achieved together.

Lab LifeResearchInformática y Ciencias de la InformaciónAlemán
Autores Heinz Pampel, Ursula Arning, Brigitte Grote, Gesche Wahlen, Gerald Jagusch, Martin Spenger, Jürgen Rohrwild, Robert Strötgen, Christopher O. Khamis

Wie wird aus einem geförderten Pilotprojekt eine dauerhaft tragfähige Infrastruktur? Diese zentrale Frage stellten sich viele der Teilnehmenden des Hands-on-Labs „Vom Drittmittelprojekt über die Community zur etablierten Struktur – Erfahrungsaustausch und Erarbeitung von Empfehlungen“ (Arning et al. 2025) auf dem 9. Bibliothekskongress / der 113. Bibliocon in Bremen am 26.06.2025.

ArtConferencesDinoCon 2025Natalia JagielskaTimelyCiencias de la Tierra y Ciencias Ambientales relacionadasInglés
Publicado in Sauropod Vertebra Picture of the Week

The DinoCon brochure — really a conference guidebook, with schedule, speaker list, vendor list, maps, etc. — is a free download here. Art by Natalia Jagielska. DinoCon is right around the corner, the weekend of August 16-17. The speaker lineup looks fantastic, and the vendor lineup looks like it will execute a Chicxulub on my wallet. On the speaker side, I’m happy to see sauropods getting so much representation.

Informática y Ciencias de la InformaciónInglés
Publicado in iPhylo

I’ve written several times here about the Make Data Count project and its major output to date, the Data Citation Corpus, currently at version 4 (see The fourth release of the Data Citation Corpus incorporates data citations from Europe PMC and additions to affiliation metadata). In June Make Data Count launched a Kaggle Competition with the goal of developing a tool that will process articles (in either PDF or XML format), extract data