ChimieAnglaisJekyll

Jeremy Monat, PhD

Jeremy Monat, PhD
Scientific software developer
Page d'accueilFlux Atom
language
ChimieAnglais
Publié

In cheminformatics, the typical way of representing a molecule is with a SMILES string such as CCO for ethanol. A SMILES string can be converted into a molecular graph, which can be used to determine molecular structure and related properties. However, there are still cases where the molecular formula such as C 2 H 6 O is useful.

ChimieAnglais
Publié

In a previous post, I revisited Wiener’s paper predicting alkanes’ boiling points using modern cheminformatics tools. This follow-up post refits the data with modern mathematical tools to check how well the literature parameters, and the current parameters optimized here, fit the data. Wiener and Egloff’s works are impressive for using cheminformatics parameters that model physical data with simple relationships.

ChimieAnglais
Publié

Harry Wiener was “a pioneer in cheminformatics and chemical graph theory”. In his 1947 Journal of the American Chemical Society article “Structural Determination of Paraffin Boiling Points”, he introduced the path number $\omega$ “as the sum of the distances between any two carbon atoms in the molecule, in terms of carbon-carbon bonds”, which is now known as the Wiener index.

ChimieAnglais
Publié

This utility reports whether the starting materials are commercially available for a set of synthesis targets given reactions. You give it your synthesis targets and the reaction to create each, it determines the starting materials, checks whether they are commercially available, and tells you whether each target is accessible–whether all its starting materials are commercially available.

ChimieAnglais
Publié

In drug discovery, the lead optimization step often involves creating analogues of a hit (a promising compound which produces a desired result in an assay) to optimize selectivity and minimize toxicity. Because it is typically easier to chemically modify the periphery of the molecule (for example the functional groups) than the scaffold, it is helpful to compare the groups off of the common scaffold.

ChimieAnglais
Publié

This example uses machine learning to predict the lipophilicity of compounds. Lipophilicity measures how well a compound dissolves in non-polar media such as fats and lipids. So it’s important for drugs that are delivered orally (for example, via a pill) because the active ingredient needs to be absorbed into the lipids of biological membranes.

ChimieAnglais
Publié

When analyzing a set of molecules, you might want to find the maximum common substructure (MCS) match between them. This utility function SmilesMCStoGridImage does that for a set of molecules specified by SMILES, displays the SMARTS substructure as a molecule, and displays all the molecules in a grid with that substructure highlighted and aligned.