KNIME® — the Konstanz Information Miner — is a modular, extendable data exploration platform to visually create data pipelines. KNIME® is based on the Java and the Eclipse platform and, through its modular API, easily extensible. When desired, custom nodes and types can be implemented in KNIME® within hours thus extending KNIME® to comprehend and provide first-tier support for highly domain-specific data. This modularity and extensibility permits KNIME® to be employed in commercial production environments as well as teaching and research prototyping settings.
BioSolveIT is a Partner of KNIME® and in the process of making its software-tools accessible to all users via KNIME®. Note that these interfaces are distributed separately of KNIME® and available free of charge. KNIME® itself is distributed under open source license and can be used freely as long as the software or extensions built into it are not distributed for commercial gain (see details on the KNIME® website).
With the use of BioSolveIT KNIME® nodes retrieving molecules from chemical spaces is now very easy. However, this is usually not the end of story but the beginning of a virtual hit triage – leading you to those compounds of biggest interest. The REALizer-workflow gives you several post-processing options to quickly get an overview of the best compounds. After searching either the REAL Space or the KnowledgeSpace you can see:
The workflow includes example data, where Ibuprofen is searched inside the REAL Space. After the search in the chemical space four alternative ways of post-processing can be executed. Once you try you will see how different results of your search will be top-ranked at the end of the workflow and provide you with new ideas.
Chemical space searching combines the goals of de novo design and synthetic accessibility. The BioSphere workflow provides you with all tools to build and search chemical spaces based on user-defined reactions and reagents. It consists of two parts:
This workflow exemplifies building incredibly huge spaces and searching them within minutes! It gives you access to reaction definitions, which you may use to supplement the space. Filter differently or use your own reagents to customize the space according to your needs. And benefit from the best in class tools to explore it in your research projects.
The workflow includes example data (where a space is built and searched). The reactions used to construct the space can be found in the file 'reactionoverview.pdf'. The compound CIU was used as a query. This is a ligand of the soluble Epoxide Hydrolase (sEH, pdb code 1vj5). sEH is part of the arachidonic acid cascade and plays a role in several diseases, including hypertension, cardiac hypertrophy, arteriosclerosis, brain and heart ischemia, cancer and pain. Some compounds targeting sEH already reached clinical studies but no drug was approved yet. Known inhibitors of sEH are ureas like the query molecule CIU. Interestingly, under the results are amides, which are known inhibitors of sEH, too.
BioSolveIT-STORM is a virtual library builder and docking-workflow designed to explore binding sites MedChem-style. It supports:
Starting from a template small molecule or fragment in a PDB or proprietary x-ray structure, the user first defines the common element of the library (the unchanging core element), then selects from a pre-installed list the chemical reaction that will create the library (e.g. a particular amide formation). The workflow then docks, scores and ranks all the members of the virtual library, based on the position of the unchanging core. The molecules predicted to be the most potent are then presented for visual inspection in SeeSAR.
The workflow includes example data (where all docking and scoring has been pre-performed). The example shown here is based on a fragment of a known active against Indoleamine 2,3-dioxygenase 1 (IDO1). STORM outputs a number of interesting new ideas for fragment elaboration.
IDO1 is considered as a promising target for the treatment of several diseases, including neurological disorders and cancer. The idea shown on the left uses the Chan-Lam coupling of a Pyrrolidine and might serve as an interesting start point for further optimization.
The MedChemWizard is a KNIME® workflow designed to assist medicinal chemists with idea generation, ligand design and lead optimization using a number of common functional group transformations and medchem rules-of-thumb. The user simply draws in a hit or lead structure, and the Wizard will enumerate new ideas based on bioisosteric replacements, methylene shuffling, diamine morphing, aromatic ring analogs, aromatic ring substitutions, reversed amides/sulfonamides, new virtual rings, protecting metabolic soft-spots and the replacement of hydrogens with solubilising and other groups.
It integrates with BioSolveIT's SeeSAR KNIME® nodes to provide an estimated binding affinity and a predicted binding mode for the new ligands allowing the user to interactively inspect the quality of the new ideas and to select those of greatest interest.
It is intended that this workflow will help medicinal chemistry teams avoid spending time synthesizing and testing poor (low potency) compounds, and facilitate scaffold hopping/morphing into new chemical and IP space — together, these factors should help improve overall odds of project success!
The workflow includes example data (where all docking and scoring has been pre-performed). The example takes a fragment hit active against Protein Kinase B and outputs a large number of interesting new ideas for fragment elaboration.
The idea with ID number 28 is a very interesting spiro fused diamine ring — it is possible that a spiro connected ring system such as this will lie outside of the scope of existing patent claims, and might therefore serve as an interesting start point for further optimization.
Very impressive!, Prof. Michael Berthold, KNIME®-creator and founder of KNIME GmbH
SeeSAR is BioSolveIT's next-generation modeling tool for all drug designers. With an intuitive interface, users can quickly and easily load and visualize protein-ligand complexes alongside an estimated binding affinity as calculated by our unique, proprietary HYDE scoring technology. Additional molecular properties are calculated directly within SeeSAR, whilst custom user properties and descriptors can be included via the SDF Inserter node, allowing the user to interactively sort and filter new ideas. In SeeSAR, ligand atoms are individually depicted not only by atom type, but also by their contribution to binding affinity, allowing the user to rapidly identify potential sites of modification to enhance potency.
Our KNIME® node for predicting small molecule binding affinities (Assess Affinity with HYDE in SeeSAR) allows users to easily access our unique, proprietary HYDE scoring technology non-interactively. Using a pre-prepared binding site, input ligands (SDF or mol2 format) are scored — the outputs are an active-site optimized ligand conformation (SDF) and the predicted binding affinity. These results can easily be sorted by predicted affinity as the node outputs both higher and lower numerical limits of the range.
Performing docking calculations is a routine, yet quite delicate task in every structure-based drug discovery process. We automated the entire process and thereby simplified the UI for you tremendously. All you need to do is provide a PDB and a reference ligand to define a binding site and you are ready to go.
Similarity searching is a task that needs to be performed routinely and oftentimes in conjunction with other tools in a workflow. The FTrees Similarity node compares the molecules in the pipeline to separately provided query molecules. Additionally you may
Feature Trees Fragment Spaces (FTrees-FS) is a unique technology, which enables the user to perform similarity searches in innumerably large compound spaces. The KNIME®-module comprises the search engine. You may use it for example to perform similarity searches in over 12 billion molecules of the so-called KnowledgeSpace™, which based on chemistry protocols to ensure it comprises synthesizable molecules. The KnowledgeSpace™ can be downloaded free of charge here.
This is the package of choice for generating chemical spaces such as the ones to be searched by FTrees-FS. The package consists of three tools:
The Reaction Library Synthesizer takes RXN or SMIRKS as input and builds so-called single-library-spaces (cf. below).
The Fragment Space Merger takes single-library-spaces (cf. above) and puts them together in a unified fashion which enables searching with FTrees-FS.
The Reaction Library Enumerator works similar to the Synthesizer but enumerates the products of your virtual reaction.
Alignments of small molecules can give much insight into structure activity relationships. This very intuitive interface to our FlexS program allows piping in query and a database to screen.
The generation of trustworthy and realistic 3D coordinates for small organic molecules is of great importance in structure-based drug design and cheminformatics. Our Coord3D tool quickly generates high quality 3D coordinates for small molecules based on gold-standard torsion angle data extracted directly from the Cambridge Crystallographic Data Centre's CSD database. Based on 2D inputs (smiles, SD or mol2 format), the tool will convert and output in 3D in the format of your choice.
The Naomi converter contains a super fast compute-engine that reads and writes compounds in a number of standard file formats. It is also capable of calculating basic molecule properties and to use them for filtering. The engine is able to run in parallel on multi-core machines.
Naomi can convert molecules between different representations (SDF, Mol2, and SMILES) and perform substructure-mappings. In addition it is able to generate high quality 2D images in different graphic file formats (SVG, PNG). These can also be used in KNIME® reports as Naomi can be registered as molecule renderer within KNIME®.
The 2D depictor is based on the latest 2D drawing technology from the Center for Bioinformatics in Hamburg (ZBH).
Our suite of KNIME® nodes also comprises two license-free auxiliary tools:
Both tools, the BioSolveIT Interactive Table and the BioSolveIT Viewer support the KNIME® HiLite mechanism. Therefore they seamlessly integrate into the KNIME® workspace.
This is a step-by-step introduction that explains for beginners how to use the BioSolveIT software in KNIME® all the way from the download to a virtual screening workflow executed in parallel:
The download mechanism follows the install/update philosophy of KNIME®.
Simply include this URL to Install New Software:
Under Help → Install New Software, you [Add...] the above URL. Then you select the same URL to Work with, from the drop-down menu. You will then see a listing of the BioSolveIT KNIME® Nodes and Tools. Either checkmark all (recommended!) or individual BioSolveIT interfaces. Then click "Next" and follow the instructions.
|—||KNIME® version 2.11 or later|
|—||only the most recent version of the respective software packages is guaranteed to work with the KNIME® modules|
|—||The following KNIME extensions are required to install the BioSolveIT KNIME® nodes
Please also refer to our documentation how to install the BioSolveIT KNIME® components.
NOTE: In principle any update should seamlessly replace previously installed BioSolveIT KNIME® nodes if these where officially released on our website. However, in some cases this mechanism may fail. Thus, we recommend to first uninstall all prior versions of the BioSolveIT KNIME® nodes before installing the new ones. This can be done very easily if you click on "What is already installed" and follow the instructions.
Alternatively you may use the following links and install the respective package in KNIME® from your local drive: