Published in chem-bla-ics

Open Science doesn’t make publishing easier. That that’s all for the better: our research efforts are complex, so why should the publishing be. Sure, I am not talking about references formatting or moving the Methods section to the right location, or some silly statement that all authors agree with the manuscript when you are the only author. No, let’s talk about data. What should you publish? How, and when?

References

PSnpBind: a database of mutated binding site protein–ligand complexes constructed using a multithreaded virtual screening workflow

Published in Journal of Cheminformatics

AbstractA key concept in drug design is how natural variants, especially the ones occurring in the binding site of drug targets, affect the inter-individual drug response and efficacy by altering binding affinity. These effects have been studied on very limited and small datasets while, ideally, a large dataset of binding affinity changes due to binding site single-nucleotide polymorphisms (SNPs) is needed for evaluation. However, to the best of our knowledge, such a dataset does not exist. Thus, a reference dataset of ligands binding affinities to proteins with all their reported binding sites’ variants was constructed using a molecular docking approach. Having a large database of protein–ligand complexes covering a wide range of binding pocket mutations and a large small molecules’ landscape is of great importance for several types of studies. For example, developing machine learning algorithms to predict protein–ligand affinity or a SNP effect on it requires an extensive amount of data. In this work, we present PSnpBind: A large database of 0.6 million mutated binding site protein–ligand complexes constructed using a multithreaded virtual screening workflow. It provides a web interface to explore and visualize the protein–ligand complexes and a REST API to programmatically access the different aspects of the database contents. PSnpBind is open source and freely available at https://psnpbind.org.

Organic ChemistryFOS: Chemical sciencesCheminformatics

ChemPedia as RDF

Published

ChemPedia was a project by Rich Apodaca for crowd sourcing names of chemical compounds. Users were able to contribute names and up- or downvote names. When the service was discontinued, Rich released the data under the CCZero license.

100708 NanomaterialsFOS: Nanotechnology100713 Nanotoxicology, Health and Safety30304 Physical Chemistry of MaterialsFOS: Chemical sciences

NanoWiki 5

Published

New release with more JRC nanomaterials annotated with ENM ontology terms, and data from a 2013 NanoQSAR study from Small by Lin et al. on 24 metal oxides.

BiochemistryPharmacology39999 Chemical Sciences not elsewhere classifiedFOS: Chemical sciencesSociology

MOESM1 of PubChemRDF: towards the semantic annotation of PubChem compound and substance databases

Published
Authors Gang Fu, Colin Batchelor, Michel Dumontier, Janna Hastings, Egon Willighagen, Evan Bolton

Additional file 1. The supporting information for the paper entitled: PubChemRDF: towards the semantic annotation of PubChem compound and substance databases.