InformatikEnglischHugo

rOpenSci - open tools for open science

rOpenSci - open tools for open science
Open Tools and R Packages for Open Science
StartseiteJSON-Feed
language
CommunityMeetingsROpenSci TeamCefpInformatikEnglisch
Veröffentlicht
Autor Stefanie Butland

Next week I’ll be in Washington DC to meet my peers in research community management as part of the inaugural class of the AAAS Community Engagement Fellowship Program! The program, funded by the Alfred P. Sloan Foundation, has a mission to improve community building and collaboration in scientific organizations and research collaborations by providing a year of training and support to a cohort of scientific community managers.

BiodiversityGBIFTech NotesInformatikEnglisch
Veröffentlicht
Autor Scott Chamberlain

finch has just been released to CRAN (binaries should be up soon). finch is a package to parse Darwin Core files. Darwin Core (DwC) is: GBIF (Global Biodiversity Information Facility) is the biggest holder of biodiversity data.

CommunityCommunity CallEventsGovernanceInformatikEnglisch
Veröffentlicht
Autor Stefanie Butland

Our Community Call on December 15th covered a big topic in tech communities: “How do I create a code of conduct for my event/lab/codebase?”. Here, we cover some of the key themes and considerations that arose from the discussion and point to curated resources and examples to follow when developing a code of conduct (CoC) for your community. Three guest speakers shared different perspectives.

CommunityFellowshipsInterviewsInformatikEnglisch
Veröffentlicht
Autoren Stefanie Butland, Karthik Ram

rOpenSci’s overarching mission is to promote a culture of transparent, open, and reproducible research across various scientific communities. All of our activities are geared towards lowering barriers to participation, and building a community of practitioners around the world.

PackagesPdftoolsTech NotesInformatikEnglisch
Veröffentlicht
Autor Jeroen Ooms

This week we released version 1.0 of the ropensci pdftools package to CRAN. Pdftools provides utilities for extracting text, fonts, attachments and other data from PDF files. It also supports rendering of PDF files into bitmap images. This release has a few internal enhancements and fixes an annoying bug for landscape PDF pages. The version bump to 1.0 signifies that the package has undergone sufficient testing and the API is stable.

PackagesTesseractTech NotesInformatikEnglisch
Veröffentlicht
Autor Jeroen Ooms

A few weeks ago we announced the first release of the tesseract package: a high quality OCR engine in R. We have now released an update with extra features. Installing Training Data As explained in the first post, the tesseract system is powered by language specific training data. By default only English training data is installed. Version 1.3 adds utilities to make it easier to install additional training data.

GeospatialInformatikEnglisch
Veröffentlicht
Autor Scott Chamberlain

Geospatial data - data embedded in a spatial context - is used across disciplines, whether it be history, biology, business, tech, public health, etc. Along with community contributors, we’re working on a suite of tools to make working with spatial data in R as easy as possible. If you’re not familiar with geospatial tools, it’s helpful to see what people do with them in the real world.

HttpTech NotesInformatikEnglisch
Veröffentlicht
Autor Scott Chamberlain

HTTP, or Hypertext Transfer Protocol is a protocol by which mostof us interact with the web. When we do requests to a website in a browseron desktop or mobile, or get some data from a server in R, all of that isusing HTTP.

PackagesTesseractInformatikEnglisch
Veröffentlicht
Autor Jeroen Ooms

Optical character recognition (OCR) is the process of extracting written or typed text from images such as photos and scanned documents into machine-encoded text. The new rOpenSci package tesseract brings one of the best open-source OCR engines to R. This enables researchers or journalists, for example, to search and analyze vast numbers of documents that are only available in printed form.

APITech NotesInformatikEnglisch
Veröffentlicht
Autor Scott Chamberlain

A new package crul ison CRAN. crul is another HTTP client for R, but is relatively simplifiedcompared to httr, and is being builtto link closely with webmockr and vcr. webmockr andvcr are packages ported from Ruby’s webmockand vcr, respectively.They both make mocking HTTP requests really easy. A major use case for mocking HTTP requests is for unit tests.