GStore: On Demand Atomicity in Key-Value Stores

Key-Value data stores have been extremely successful in supporting large applications with huge amount of data and large number of concurrent requests. But Key-Value stores provide transactional guarantees only at the granularity of single rows which is restrictive for some applications. This project aims to build on the Key-Value data stores to provide transactional guarantees over dynamic groups of keys specified by the applications on demand.

Cloud computing has emerged as a preferred platform for deploying scalable web-applications. With the growing scale of these applications and the data associated with them, scalable data management systems form a crucial part of the cloud infrastructure. Key-Value stores – such as Bigtable, PNUTS, Dynamo, and their open source analogues– have been the preferred data stores for applications in the cloud. In these systems, data is represented as Key-Value pairs, and atomic access is provided only at the granularity of single keys. While these properties work well for current applications, they are insufficient for the next generation web applications – such as online gaming, social networks, collaborative editing, and many more – which emphasize collaboration. Since collaboration by definition requires consistent access to groups of keys, scalable and consistent multi-key access is critical for such applications.

We propose the Key Group abstraction that defines a relationship between a group of keys and is the granule for on-demand transactional access. This abstraction allows the Key Grouping protocol to collocate control for the keys in the group to allow efficient access to the group of keys. Using the Key Grouping protocol, we design and implement G-Store which uses a key-value store as an underlying substrate to provide efficient, scalable, and transactional multi key access.

Project Status: 
Active