InformatikEnglischHugo

rOpenSci - open tools for open science

rOpenSci - open tools for open science
Open Tools and R Packages for Open Science
StartseiteJSON-Feed
language
APIBiodiversitySpeciesOccurrenceInformatikEnglisch
Veröffentlicht
Autor Karthik Ram

UPDATE: mapping functions are in a separate package now (mapr). Examples that do mapping below have been updated. The rOpenSci projects aims to provide programmatic access to scientific data repositories on the web. A vast majority of the packages in our current suite retrieve some form of biodiversity or taxonomic data.

APIClimateInformatikEnglisch
Veröffentlicht
Autor Scott Chamberlain

We recently pushed the first version of rnoaa to CRAN - version 0.1. NOAA has a lot of data, some of which is provided via the National Climatic Data Center, or NCDC. NOAA has provided access to NCDC climate data via a RESTful API - which is great because people like us can create clients for different programming languages to access their data programatically.

APIReproducible ResearchDataArchivingInformatikEnglisch
Veröffentlicht
Autor Thomas J. Leeper

Reproducible research involves the careful, annotated preservation of data, analysis code, and associated files, such that statistical procedures, output, and published results can be directly and fully replicated. As the push for reproducible research has grown, the R community has responded with an increasingly large set of tools for engaging in reproducible research practices (see, for example, the ReproducibleResearch Task View on CRAN).

APITaxizeTaxonomyInformatikEnglisch
Veröffentlicht
Autor Scott Chamberlain

We just released a new version of taxize - version 0.2.0. This release contains a number of new features, and bug fixes. Here is a run down of some of the changes: First, install and load taxize install.packages("rgbif") library(taxize) New things New functions: class2tree Sometimes you just want to have a visual of the taxonomic relationships among taxa.

TaxonomyBiodiversityInformaticsInformatikEnglisch
Veröffentlicht
Autor Karthik Ram

Data on more than 10,000 species of ants recorded worldwide are available through from California Academy of Sciences’ AntWeb, a repository that boasts a wealth of natural history data, digital images, and specimen records on ant species from a large community of museum curators.

APIGBIFInformatikEnglisch
Veröffentlicht
Autor Scott Chamberlain

rgbif is an R package to search and retrieve data from the Global Biodiverity Information Facilty (GBIF). rgbif wraps R code around the [GBIF API][gbifapi] to allow you to talk to GBIF from R. We just pushed a new verion of rgbif to cran - v0.5.0. Source and binary files are now available on CRAN. There are a few new functions: count_facet, elevation, and installations. These are described, with examples, below.

APIEOLInformatikEnglisch
Veröffentlicht
Autor Scott Chamberlain

In a recent blog post we discussed caching calls to the web offline, on your own computer. Just like you can cache data on your own computer, a data provider can do the same thing. Most of the data providers we work with do not provide caching. However, at least one does: EOL, or Encyclopedia of Life.

HackathonMeetingsUnconfUnconf14InformatikEnglisch
Veröffentlicht
Autor Karthik Ram

Our team has been cranking out a large number of tools over the past several months. As regular readers are aware, our software packages provide programmatic access to a diverse and extensive trove of scientific data. More recently we’ve expanded our efforts to build more general purpose and cross-domain tools.

APIInformatikEnglisch
Veröffentlicht
Autor Scott Chamberlain

I’ve recently heard the idea of “offline first” via especially Hood.ie. We of course don’t do web development, but primarily build R interfaces to data on the web. Internet availablility is increasinghly ubiqutous, but there still are times and places where you don’t have internet, but need to get work done.

TaxonomyMuseumsBiodiversityInformaticsInformatikEnglisch
Veröffentlicht
Autor Karthik Ram

Natural history museums have long been valuable repositories of data on species diversity. These data have been critical for fostering and shaping the development of fields such as biogeography and systematics.

TextminingAPIInformatikEnglisch
Veröffentlicht
Autor Scott Chamberlain

A number of the APIs we interact with (e.g., PLOS full text API, and USGS’s BISON API in rplos and rbison, respectively) expose Solr endpoints. Solr is an Apache hosted project - it is a powerful search server.