InformatikEnglischHugo

rOpenSci - open tools for open science

rOpenSci - open tools for open science
Open Tools and R Packages for Open Science
StartseiteJSON-Feed
language
CommunitySoftwareSoftware Peer ReviewHardwareRopenaqInformatikEnglisch
Veröffentlicht
Autor

As you might remember from my blog post about ropenaq, I work as a data manager and statistician for an epidemiology project called CHAI for Cardio-vascular health effects of air pollution in Telangana, India. One of our interests in CHAI is determining exposure, and sources of exposure, to PM2.5 which are very small particles in the air that have diverse adverse health effects.

Data AccessPackagesSpatialGeospatialSoftware Peer ReviewInformatikEnglisch
Veröffentlicht
Autor

The package FedData has gone through software review and is now part of rOpenSci. FedData includes functions to automate downloading geospatial data available from several federated data sources (mainly sources maintained by the US Federal government). Currently, the package enables extraction from six datasets:The National Elevation Dataset (NED) digital elevation models (1 and 1/3 arc-second;

SoftwarePackagesSoftware Peer ReviewCommunityVisdatInformatikEnglisch
Veröffentlicht
Autor

This is a phrase that comes up when you first get a dataset. It is also ambiguous. Does it mean to do some exploratory modelling? Or make some histograms, scatterplots, and boxplots? Is it both? Starting down either path, you often encounter the non-trivial growing pains of working with a new dataset.

Software Peer ReviewReviewerVisdatCommunityPackagesInformatikEnglisch
Veröffentlicht
Autor

Contributing to an open-source community without contributing code is an oft-vaunted idea that can seem nebulous. Luckily, putting vague ideas into action is one of the strengths of the rOpenSci Community, and their package onboarding system offers a chance to do just that.

PackagesTesseractOCRTech NotesInformatikEnglisch
Veröffentlicht
Autor

Last week we released an update of the tesseract package to CRAN. This package provides R bindings to Google’s OCR library Tesseract.install.packages("tesseract") The new version ships with the latest libtesseract 3.05.01 on Windows and MacOS. Furthermore it includes enhancements for managing language data and using tesseract together with the magick package.

PackagesSoftwareImagesMagickInformatikEnglisch
Veröffentlicht
Autor

Last week, version 1.0 of the magick package appeared on CRAN: an ambitious effort to modernize and simplify high quality image processing in R. This R package builds upon the Magick++ STL which exposes a powerful C++ API to the famous ImageMagick library. The best place to start learning about magick is the vignette which gives a brief overview of the overwhelming amount of functionality in this package.

CommunityMeetingsInformatikEnglisch
Veröffentlicht
Autor

You can find members of the rOpenSci team at various meetings and workshops around the world. Come say ‘hi’, learn about how our packages can enable your research, or about our onboarding process for contributing new packages, discuss software sustainability or tell us how we can help you do open and reproducible research. 🔗Where’s rOpenSci?

CommunityMeetingsSoftwareUnconfUnconf17InformatikEnglisch
Veröffentlicht
Autor

Since June, we have been highlighting the many projects that emerged from this year’s rOpenSci Unconf. These projects start many weeks before unconf participants gather in-person. Each year, we ask participants to propose and discuss project ideas ahead of time in a GitHub repo. This serves to get creative juices flowing as well as help people get to know each other a bit through discussion.

DataElasticElasticsearchDatabasesTech NotesInformatikEnglisch
Veröffentlicht
Autor

elastic is an R client for Elasticsearch elastic has been around since 2013, with the first commit in November, 2013. 🔗What is Elasticsearch? If you aren’t familiar with Elasticsearch, it is a distributed, RESTful search and analytics engine.It’s similar to Solr. It falls in the NoSQL bin of databases, holding data in JSON documents, insteadof rows and columns.

EmldownUnconfUnconf17InformatikEnglisch
Veröffentlicht
Autoren , , , ,

How do you get the maximum value out of a dataset? Data is most valuable when it can easily be shared, understood, and used by others. This requires some form of metadata that describes the data. While metadata can take many forms, the most useful metadata is that which follows a standardized specification. The Ecological Metadata Language (EML) is an example of such a specification originally developed for ecological datasets.

TaxonomySoftwarePackagesTaxizeTaxaInformatikEnglisch
Veröffentlicht
Autor

🔗What is Taxonomy? Taxonomy in its most general sense is the practice and science of classification. It can refer to many things. You may have heard or used the word taxonomy used to indicate any sort of classification of things, whether it be companies or widgets. Here, we’re talking about biological taxonomy, the science of defining and naming groups of biological organisms.