Rogue Scholar Gönderileri

language
MembersBilgisayar ve Bilişim Bilimleriİngilizce
Yayınlandı in DataCite Blog - DataCite
Yazar Maria GouldandMohamad Mostafa

Members are the heart of the DataCite community. Comprising 1600+ research organizations from 60+ countries, members join DataCite to use our infrastructure and services, participate in our governance, and contribute to shared communities of practice. In this post, we wanted to welcome the new organizations that have joined DataCite in the first half of 2025, and highlight a few of their stories.

Diğer Sosyal Bilimlerİngilizce
Yayınlandı in Aaron Tay's Musings about librarianship
Yazar Aaron Tay

Introduction In my last post, I argued that Deep Search—iterative retrieval that blends keyword, semantic, and citation chasing with LLM-based relevance judgments—is the real breakthrough behind today’s “Deep Research” tools. It consistently beats one-shot embedding search in recall/precision, and in hindsight, it’s what I loved all along (the “generation” step just came bundled). The price?

OpenstreetmapCommonsPeer ProductionMappingOsmDiğer Sosyal Bilimlerİngilizce
Yayınlandı in Bastian Greshake Tzovaras

Last weekend was the 21st birthday of OpenStreetMap (OSM), and with some friends we celebrated the occasion with a little mapping party. Our plan was to combine flying drones to collect aerial imagery and collecting street-level imagery with more traditional field mapping. Due to high winds, we mostly ended up with street-level imagery and doing field mapping though, using a variety of tools.

PublishingWikidataScholiaKimya Bilimleriİngilizce
Yayınlandı in chem-bla-ics

The Internet Journal of Chemistry (IJC, issn:1099-8292) was one of the first scientific journals to get published on the world wide web (part of the Internet), see doi:10.1080/00987913.2000.10764578. Issues were published from 1998 to 2004. But because it predates systematic archiving of webpages by libraries, a lot is lost.

AIGPT-5LLMsDoğa Bilimleriİngilizce
Yayınlandı in Chris von Csefalvay
Yazar Chris von Csefalvay

I’ve spent the better part of this weekend putting OpenAI’s latest offerings through their paces - both the newly released open-weight models and GPT-5 itself. Armed with a selection of coding challenges, mathematical problems, and the sort of esoteric research queries that usually separate the wheat from the chaff, I’ve been conducting what amounts to a weekend-long torture test of these systems.

ConferencesDinoCon 2025Fictional PeoplePeople We LikeStinkin' MammalsYeryüzü ve ilgili Çevre Bilimleriİngilizce
Yayınlandı in Sauropod Vertebra Picture of the Week

Mike and I are working on our respective talks for DinoCon 2025 — a timely concern, since Mike presents next Saturday and I’m on next Sunday. My talk will be an adapted and upgraded version of the keynote talk I gave at the Tate Geological Museum’s Annual Summer Conference last summer.

Biyolojik Bilimlerİngilizce
Yayınlandı in Paired Ends

I recently wrote a piece about leaving academia for biotech. I left academia for industry in 2019. I spent four years at a consulting firm before joining Colossal Biosciences. This week I’m returning to the University of Virginia School of Data Science as a tenured associate professor and dean of research. The transition from academia to industry can be tricky, but it’s also increasingly common.

IupacBeilsteinChemblKimya Bilimleriİngilizce
Yayınlandı in chem-bla-ics

A lot is happening. If you have been following this project more closesly, you may have already seen some interesting updates, but I will post it here too. First, a quick recap. In March I started a new Blue Obelisk project to collect CCZero IUPAC names from primary literature (paper still pending). It turned out we can automate that, while legally not violating any laws or licenses.

Large Language ModelAi SearchDiğer Sosyal Bilimlerİngilizce
Yayınlandı in Aaron Tay's Musings about librarianship
Yazar Aaron Tay

Back in 2022, I was hyped about Retrieval-Augmented Generation (RAG).The novelty of seeing a search engine spit out a direct answer — with citations! — in tools like Elicit and Perplexity felt like the future. I even predicted that this “answers-with-citations” model could become the prominent paradigm for academic search. Three years later, that prediction has partly come true.