
Hard disk drives (HDDs) or solid state drives (SSDs) are today's most popular storage solutions. Storage devices are essential to all computing systems that store user data from desktops, to notebooks and Ultrabooks™ to data centers. Our evaluation demonstrates an impressive, up to 800 Mbps transfer throughput for the new GridFTP service. Moreover, to demonstrate the high-throughput properties of the proposed architecture, we prototype a GridFTP server based on ThriftStore. To this end, we develop analytical and simulation-based tools to evaluate the impact of system characteristics (e.g., bandwidth limitations on the durable and the volatile nodes) and design choices (e.g., the replica placement scheme) on data availability and the associated system costs (e.g., maintenance traffic). Although integrating these components has the potential to offer a unique combination of high throughput and durability at a low cost, a number of concerns need to be addressed to architect and correctly provision the system. On the other hand, the volatile nodes provide a high-throughput front-end.


On the one hand, the durable storage forms a back end that enables the system to restore the data the volatile nodes may lose. We propose ThriftStore, a storage architecture that integrates two types of components: volatile, aggregated storage and dedicated, yet low-bandwidth durable storage. This paper explores the feasibility of a storage architecture that offers the reliability and access performance characteristics of a high-end system, yet is cost-efficient.
