June 19, 2014
Douglas Santry & Kaladhar Voruganti.
In this paper we describe a storage system called Violet that efficiently marries fine-grained host side data management with capacity optimized backend disk systems.
Currently, for efficiency reasons, real-time analytics applications are forced to map their in-memory graph like data structures on to columnar databases or other intermediate disk friendly data structures when they are persisting these data structures to protect them from node failures. Violet provides efficient fine-grained end-to-end data management functionality that obviates the need to perform this intermediate mapping. Violet presents the following two key innovations that allow us to efficiently do this mapping between the fine-grained host side data structures and capacity optimized backend disk system: 1) efficient identification of updates on the host that leverages hardware in-memory transaction mechanisms and 2) efficient streaming of fine-grained updates on to a disk using a new data structure called Fibonacci Array.
In Proceedings of the USENIX Annual Technical Conference (ATC 2014)
A link to the paper can be found at: https://www.usenix.org/system/files/conference/atc14/atc14-paper-santry.pdf