Scienze informatiche e dell'informazioneIngleseHugo

rOpenSci - open tools for open science

rOpenSci - open tools for open science
Open Tools and R Packages for Open Science
Pagina inizialeJSON Foraggio
language
CommunityMeetingsScienze informatiche e dell'informazioneInglese
Pubblicato

rOpenSci is holding our annual staff and leadership meeting in Vancouver, so we’re taking the opportunity to share what we do and, if you’re interested, how you can get involved. Join us for a series of 7 short talks and demos followed by informal networking over snacks & refreshments. rOpenSci is a non-profit initiative that promotes open and reproducible research using shared data and reusable software.

RubyHttpMockingRequestCrulScienze informatiche e dell'informazioneInglese
Pubblicato
Autore Scott Chamberlain

🔗webmockr webmockr is an R library for stubbing and setting expectations on HTTP requests.It is a port of the Ruby gem webmock. webmockr works by plugging in to another R package that does HTTP requests. It currently only works with crul right now, but we plan to add support for curl and httr later.

FellowshipsFundingScienze informatiche e dell'informazioneInglese
Pubblicato
Autore Karthik Ram

rOpenSci’s mission is to enable and support a thriving community of researchers who embrace open and reproducible research practices as part of their work. Since our inception, one of the mechanisms through which we have supported the community is by developing high-quality open source tools that lower barriers to working with scientific data.

PackagesTesseractImagesOCRTech NotesScienze informatiche e dell'informazioneInglese
Pubblicato

Earlier this month we released a new version of the tesseract package to CRAN. This package provides R bindings to Google’s open source optical character recognition (OCR) engine Tesseract. Two major new features are support for HOCR and support for the upcoming Tesseract 4. 🔗hOCR output Support for HOCR output was requested by one of our users on Github.

CommunityInterviewsRprofileScienze informatiche e dell'informazioneInglese
Pubblicato
Autori Sean Kross, Kelly O'Briant

[This interview occurred at the 2017 rOpenSci unconference] SK: I’m Sean Kross, I’m the CTO of the Johns Hopkins Data Science Lab. Today I’m interviewing Julia Stewart Lowndes. Julia, what is your current preferred job title? JSL: I’m calling myself a marine data scientist - I’m the Science Program Lead for the Ocean Health Index.

CommunityMeetingsUnconfUnconf18Scienze informatiche e dell'informazioneInglese
Pubblicato

For a fifth year running, we are excited to announce the rOpenSci unconference, our annual event loosely modeled on Foo Camp. rOpenSci unconferences have a rich history. You can get a feel for them by reading collected stories about people and projects from unconf17.

CommunitySoftwareSoftware Peer ReviewPackagesDrakeScienze informatiche e dell'informazioneInglese
Pubblicato

The drake R package is a pipeline toolkit. It manages data science workflows, saves time, and adds more confidence to reproducibility. I hope it will impact the landscapes of reproducible research and high-performance computing, but I originally created it for different reasons. This post is the prequel to drake’s inception. There was struggle, and drake was the answer. 🔗Dissertation frustration My dissertation project was intense.

DatabasesCouchdbMongoElasticsearchRedisScienze informatiche e dell'informazioneInglese
Pubblicato
Autore Scott Chamberlain

🔗DBI What is DBI? DBI is an R package. It defines an interface to relational database management systems (R/DBMS) that other R packages build upon to interact with a specific relational database, such as SQLite or PostgreSQL. 🔗NoSQL NoSQL databases are a very broad class of database that can include document databases such as CouchDB and MongoDB, key-value stores such as Redis, and more.

Text MiningFulltextDataJournalsOpen AccessScienze informatiche e dell'informazioneInglese
Pubblicato
Autore Scott Chamberlain

🔗The problem Text-mining - the art of answering questions by extracting patterns, data, etc. out of the published literature - is not easy. It’s made incredibly difficult because of publishers. It is a fact that the vast majority of publicly funded research across the globe is published in paywall journals.