trans
Distributor
trans
contact
impressum
legal notice
print this page
BioSolveIT
The Premier Scientific Solution Provider
The Distributor facilitates distributed computing on large input files. It serves as a pre- and post-processing tool and interfaces with most standard batch-queuing systems.

In high throughput virtual screening, one usually deals with hundred thousands of compounds. On today's compute farms, an efficient, yet simple parallelisation can be achieved decomposing the input file(s) and letting the software process the much smaller chunks on multiple machines in parallel.

The basic concept behind the Distributor is to take the burden of bookkeeping off of the user and let the software automatically make sure that the job gets done in its entirety. Now the Distributor does not re-invent the wheel. Years of development have been spent on sophisticated job-management and batch-queuing systems assuring optimal use of the compute resources by load balancing. Therefore the Distributor does not replace but rather interfaces with most of those standard technologies and additionally provides automated error reporting (for example, through email) and much more.

In short, given:
trans
a large set of input data blocks (items) in one or several files
trans
some arbitrary tool that processes this kind of input
trans
a number of compute nodes
trans
a batch queuing system

Distributor
trans
splits all input files into segments, each with a subset of items
trans
submits small jobs into the batch queuing system
trans
checks the jobs' status
trans
merges the jobs' output
trans
informs the user via email about the current status

features
trans
Virtually no restriction on the type of input data, except that each item is separable in the input file. Even binary data may serve as input!
trans
Predefined file splitter for most standard file formats: mol2, sdf, sln, smiles, and FTrees data types. Plain text files can be split simply line by line or by a regular expression pattern.
trans
Additional file splitters can be easily written and added by user.
trans
The system is highly configurable.
requirements
trans
Python 2.3 or higher
trans
Installed and configured batch queuing system
status & availability
The current version, 1.5.4, was released on 16.12.2004. The release CHANGES-file contains details of the current release features.

Distributor is available for LinuX x86 platform.

If you would like to evaluate Distributor, or lease the software from us, please contact us to obtain a license using the license request form.

Last modified Friday, 03. Feb 2012 15:15 CET by WebMaster