ChemieEnglischJekyll

chem-bla-ics

chem-bla-ics
Chemblaics (pronounced chem-bla-ics) is the science that uses open science and computers to solve problems in chemistry, biochemistry and related fields.
StartseiteJSON-Feed
language
SemwebKasabiChemistryChemieEnglisch
Veröffentlicht

Kasabi is a new, RDF hosting service by Talis. It’s still in beta, and I have been testing their beta service with the RDF version I created of ChemPedia Substances (the now no longer existing cool web service from MetaMolecular to draw and name organic molecules). Kasabi makes the RDF data available via a few APIs, depending on the APIs selected by the uploader. I picked all five of them, just to see how things work.

IccsChemieEnglisch
Veröffentlicht

Julio and Gert placed their ICCS 2011 work online, and today I was going through old CDs (see From the archives: Chemical Web, and the CDK in 2004 and Chiral Molecules: how cool is the SEM picture?). I also ran into my ICCS 2005 poster, and because that too was before I started blogging, I never posted it online.

ChemblRdfChemieEnglisch
Veröffentlicht

Update 2021-02 : this post is still the second-most read post in my blog. Welcome! Some updates: Ammar Ammar in our BiGCaT group has set up a new SPARQL endpoint. Please use and tweet. blog, or otherwise let others now how you use the ChEMBL RDF. Since this post I have blogged a lot more about ChEMBL. Update : this work is now written down in this paper. I’m having a really bad month, as you can see from the number of posts.

GithubChemieEnglisch
Veröffentlicht

Some time ago, the brilliant GitHub people gave me the following tip. Rajarshi is lazy, and might find it interesting. By appending .patch to the commit URL, a commit can easily be downloaded as patch. That way, developers can easily download it with wget or curl and apply it locally with git am, without having the fetch the full repository.

GroovyChemistryRdfJsonChemieEnglisch
Veröffentlicht

Mark’s new CCO/RDF hosting functionality (see also my post two days ago) requires RDF/XML format, so I updated my code to convert the Chempedia Substances data into RDF/XML instead of N3 (I have asked Rich to put a new download link online). This is the Groovy code I used: import groovy.xml.MarkupBuilder import groovy.util.IndentPrinter input = new File("substances.json") json = new JsonSlurper().parse(input); def writer = new StringWriter() def

OscarTextminingChemieEnglisch
Veröffentlicht

Oscar uses a Maximum Entropy Markov Model (MEMM) based on n-grams. Peter Corbett has written this up (doi:10.1186/1471-2105-9-S11-S4). So, it basically is statistics once more. If you really want a proper bioinformatics education, so do your PhD at a (proteo)chemometrics department. N-grams are word parts of n characters. For example, the trigrams of acetic acid include ace, cid, tic, eti, and aci.

OscarChemicaltaggerBeilsteinChemieEnglisch
Veröffentlicht

The two earlier posts in this series showed screenshots of results of Oscar, but the title also promised results by Lezan’s ChemicalTagger. Sam helped with getting the HTML pages online via the Cambridge Hudson installation. Where Oscar find named entities (chemical compounds, processes, etc), ChemicalTagger finds roles, like solvent, acid, base, catalyst. Roles are properties of chemical compounds in certain situations.