A Survey on NoSQL Stores
Tóm tắt
Recent demands for storing and querying big data have revealed various shortcomings of traditional relational database systems. This, in turn, has led to the emergence of a new kind of complementary nonrelational data store, named as NoSQL. This survey mainly aims at elucidating the design decisions of NoSQL stores with regard to the four nonorthogonal design principles of distributed database systems: data model, consistency model, data partitioning, and the CAP theorem. For each principle, its available strategies and corresponding features, strengths, and drawbacks are explained. Furthermore, various implementations of each strategy are exemplified and crystallized through a collection of representative academic and industrial NoSQL technologies. Finally, we disclose some existing challenges in developing effective NoSQL stores, which need attention of the research community, application designers, and architects.
Từ khóa
Tài liệu tham khảo
Ahsan Shegufta Bakht, The CAT theorem and performance of transactional distributed systems
Renzo Angles Marcelo Arenas Pablo Barcelo Aidan Hogan Juan Reutter and Domagoj Vrgoc. 2016. Foundations of modern graph query languages. arXiv preprint arXiv:1610.06264. Renzo Angles Marcelo Arenas Pablo Barcelo Aidan Hogan Juan Reutter and Domagoj Vrgoc. 2016. Foundations of modern graph query languages. arXiv preprint arXiv:1610.06264.
Belaramani Nalini Moti, 2006, NSDI, 6
Berge Claude, Graphs and hypergraphs
Harold Boley. 1992. Declarative operations on nets. Computers 8 Mathematics with Applications 23 6--9 601--637. Harold Boley. 1992. Declarative operations on nets. Computers 8 Mathematics with Applications 23 6--9 601--637.
Bronson Nathan, 2013, USENIX Annual Technical Conference. 49--60
Jerzy Brzezinski Cezary Sobaniec and Dariusz Wawrzyniak. 2004. From session causality to causal consistency. In PDP. 152--158. Jerzy Brzezinski Cezary Sobaniec and Dariusz Wawrzyniak. 2004. From session causality to causal consistency. In PDP. 152--158.
Codasyl DBTG, 1971, Conference on Data Systems Languages, ACM
Didona Diego, 2017, Okapi: Causally consistent geo-replication made faster, cheaper and more available. arXiv preprint arXiv:1702.04263.
Du Jiaqing, 2014, Proceedings of the 1st Workshop on Principles and Practice of Eventual Consistency (PaPEC’14)
Escriva Robert, 2014, Warp: Lightweight multi-key transactions for key-value stores. Technical Report
Feinberg Alex, 2011, Proceedings of the 10th IEEE International Conference on Data Engineering (ICDE’11)
Fox Armando, Proceedings of the 7th Workshop on Hot Topics in Operating Systems. IEEE, 174--178
Gallagher Brian, 2006, Matching structure and semantics: A survey on graph-based pattern matching, AAAI FS, 6, 45
Hale Coda, 2010, Retrieved
Pat Helland. 2007. Life beyond distributed transactions: An apostate’s opinion. In CIDR. 132--141. Pat Helland. 2007. Life beyond distributed transactions: An apostate’s opinion. In CIDR. 132--141.
Kleppmann Martin, 2015, A critique of the CAP theorem. arXiv preprint arXiv:1509.05393
Knisley Debra, Graph theoretic models in chemistry and molecular biology. Handbook of Applied Algorithms
Kyrola Aapo, Presented as Part of the 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI’12). 31--46.
Aapo Kyrola and Carlos Guestrin. 2014. GraphChi-DB: Simple design for a scalable graph database system--on just a PC. arXiv preprint arXiv:1403.0701. Aapo Kyrola and Carlos Guestrin. 2014. GraphChi-DB: Simple design for a scalable graph database system--on just a PC. arXiv preprint arXiv:1403.0701.
Lee Sin Yeung, 2000, Tok Wang Ling, and Hua-Gang Li
Levandoski Justin, Transaction processing techniques for modern hardware and the cloud
Levandoski Justin J., 2011, Deuteronomy: Transaction support for cloud data, CIDR, 11, 123
Daniel Mark Lewin. 1998. Consistent hashing and random trees: Algorithms for caching in distributed networks. Master’s thesis Department of EECS MIT. Daniel Mark Lewin. 1998. Consistent hashing and random trees: Algorithms for caching in distributed networks. Master’s thesis Department of EECS MIT.
Li Cheng, 2014, USENIX Annual Technical Conference. 281--292
Liu Jed, 2014, Warranties for faster strong consistency, NSDI, 14, 503
Lloyd Wyatt, 2013, Stronger semantics for low-latency geo-replicated storage, NSDI, 13, 313
Low Yucheng, 2010, Proceedings of the Conference on Uncertainty in Artificial Intelligence 5, 8
Luo Qiong, 2001, Naughton
Mahajan Prince, Technical Report TR-11-22
Maier D., 1989, Proceedings of the International Workshop on Database Programming Languages. 277--287
Marchioni Francesco, Infinispan Data Grid Platform
Massé Mark, REST API Design Rulebook: Designing Consistent RESTful Web Service Interfaces. O’Reilly Media
Mattern Friedemann, 1989, Virtual time and global states of distributed systems, Parallel and Distributed Algorithms, 1, 215
McAfee Andrew, 2012, Big data. The Management Revolution, Harvard Business Review, 90, 61
Meltz Dean, An Introduction to IMS: Your Complete Guide to IBM’s Information Management System
Merkle Ralph C., 1989, Conference on the Theory and Application of Cryptology. Springer, 218--238
Milano Matthew P., 2016, Myers
Vijayan Prabhakaran Ming Wu Xuetian Weng Frank McSherry Lidong Zhou and Maya Haridasan. 2012. Managing large graphs on multi-cores with graph awareness. In USENIX (ATC’12). 41--52. Vijayan Prabhakaran Ming Wu Xuetian Weng Frank McSherry Lidong Zhou and Maya Haridasan. 2012. Managing large graphs on multi-cores with graph awareness. In USENIX (ATC’12). 41--52.
Rao Ananth, Load balancing in structured P2P systems
Raynal Michel, Foundations of Software Technology and Theoretical Computer Science
James, Proceedings of the 3rd International Conference on Very Large Databases. VLDB Endowment, 48--62
Rowstron Antony, 2001, Pastry: Scalable, decentralized object location, and routing for large-scale peer-to-peer systems. In Middleware
Pramod, 2012, Sadalage and Martin Fowler
Hans Sagan. 1994. Space-Filling Curves. Springer-Verlag Berlin. Hans Sagan. 1994. Space-Filling Curves. Springer-Verlag Berlin.
Michael Schaarschmidt Felix Gessert and Norbert Ritter. 2015. Towards automated polyglot persistence. In BTW. 73--82. Michael Schaarschmidt Felix Gessert and Norbert Ritter. 2015. Towards automated polyglot persistence. In BTW. 73--82.
Stonebraker Michael, 1986, The case for shared nothing, IEEE Database Engineering Bulletin, 9, 4
Michael Stonebraker. 2010a. Errors in database systems eventual consistency and the cap theorem. Communications of the ACM BLOG@ACM. https://cacm.acm.org/blogs/blog-cacm/83396-errors-in-database-systems-eventual-consistency-and-the-cap-theorem/fulltext. Michael Stonebraker. 2010a. Errors in database systems eventual consistency and the cap theorem. Communications of the ACM BLOG@ACM. https://cacm.acm.org/blogs/blog-cacm/83396-errors-in-database-systems-eventual-consistency-and-the-cap-theorem/fulltext.
Michael Stonebraker. 2011. Why enterprises are uninterested in NoSQL. Communications of the ACM BLOG@ACM. https://cacm.acm.org/blogs/blog-cacm/99512-why-enterprises-are-uninterested-in-nosql/fulltext. Michael Stonebraker. 2011. Why enterprises are uninterested in NoSQL. Communications of the ACM BLOG@ACM. https://cacm.acm.org/blogs/blog-cacm/99512-why-enterprises-are-uninterested-in-nosql/fulltext.
Michael Stonebraker. 2012. Newsql: An alternative to NoSQL and old SQL for new OLTP apps. Communications of the ACM BLOG@ACM. https://cacm.acm.org/blogs/blog-cacm/109710-new-sql-an-alternative-to-nosql-and-old-sql-for-new-oltp-apps/fulltext. Michael Stonebraker. 2012. Newsql: An alternative to NoSQL and old SQL for new OLTP apps. Communications of the ACM BLOG@ACM. https://cacm.acm.org/blogs/blog-cacm/109710-new-sql-an-alternative-to-nosql-and-old-sql-for-new-oltp-apps/fulltext.
Stonebraker Michael, 2013, SciDB DBMS research at MIT, IEEE Database Engineering Bulletin, 36, 21
Stonebraker Michael, Object Relational DBMSs: The Next Great Wave
Terry Douglas B., Proceedings of the 3rd International Conference on Parallel and Distributed Information Systems. IEEE, 140--149
Wada Hiroshi, 2011, Data consistency properties and the trade-offs in commercial cloud storage: The consumers’ perspective, CIDR, 11, 134
Wu Xingbo, 2015, Proceedings of the 2015 USENIX Conference on Usenix Annual Technical Conference.
Zaniolo C., 1985, D. Beech, S. Cammarata, L. Kerschberg, and D. Maier.
Marek Zawirski Annette Bieniusa Valter Balegas Sérgio Duarte Carlos Baquero Marc Shapiro and Nuno Preguiça. 2013. SwiftCloud: Fault-tolerant geo-replication integrated all the way to the client machine. arXiv preprint arXiv:1310.3107. Marek Zawirski Annette Bieniusa Valter Balegas Sérgio Duarte Carlos Baquero Marc Shapiro and Nuno Preguiça. 2013. SwiftCloud: Fault-tolerant geo-replication integrated all the way to the client machine. arXiv preprint arXiv:1310.3107.