The Systems Biology Kbase pilot hardware named kandinsky is a new machine designed for optimal support of the Hadoop architecture/runtime. The original recommendation made by the science advisory board focused on examining the MapReduce programming paradigm and it’s applicability to bioinformatics applications. As a result, the hardware configuration includes over 0.5 petabytes of storage on local nodes under the direction of the Hadoop File System.
In addition to supporting Hadoop based applications, support for virtualization will be added. As experience accumulates, an understanding of the compatibility of a system that supports virtualization and the execution of remotely created images and the Hadoop runtime will become more clear.
The system was purchased under the Knowledgebase R&D project sponsored by the Office of Biological and Environmental Research in the DOE Office of Science with American Recovery and Reinvestment Act 2009 funding and performed at Oak Ridge National Laboratory (ORNL). ORNL is managed by UT-Battelle, LLC, for the U.S. Department of Energy under contract DE-AC05-00OR22725.