Spanner

ACM Transactions on Computer Systems - Tập 31 Số 3 - Trang 1-22 - 2013
James C. Corbett1, Jay B. Dean1, Michael Epstein1, Andrew Fikes1, Christopher Frost1, J. J. Furman1, Sanjay Ghemawat1, Andrey Gubarev1, Christopher Heiser1, Peter Hochschild1, Wilson C. Hsieh1, Sebastian Kanthak1, Eugene Kogan1, Hongyi Li1, Alexander Lloyd1, Sergey Melnik1, David Mwaura1, David F. Nagle1, Sean Quinlan1, Rajesh Rao1, Lindsay Rolig1, Yasushi Saitō1, Michal Szymaniak1, Christopher M. Taylor1, Ruth Wang1, Dale Woodford1
1Google Inc

Tóm tắt

Spanner is Google’s scalable, multiversion, globally distributed, and synchronously replicated database. It is the first system to distribute data at global scale and support externally-consistent distributed transactions. This article describes how Spanner is structured, its feature set, the rationale underlying various design decisions, and a novel time API that exposes clock uncertainty. This API and its implementation are critical to supporting external consistency and a variety of powerful features: nonblocking reads in the past, lock-free snapshot transactions, and atomic schema changes, across all of Spanner.

Từ khóa


Tài liệu tham khảo

Abouzeid , A. , Bajda-Pawlikowski , K. , Abadi , D. , Silberschatz , A. , and Rasin , A . 2009. Hadoopdb: An architectural hybrid of mapreduce and DBMS technologies for analytical workloads . In Proceedings of the International Conference on Very Large Data Bases. 922--933 . Abouzeid, A., Bajda-Pawlikowski, K., Abadi, D., Silberschatz, A., and Rasin, A. 2009. Hadoopdb: An architectural hybrid of mapreduce and DBMS technologies for analytical workloads. In Proceedings of the International Conference on Very Large Data Bases. 922--933.

10.1145/223784.223787

Amazon. 2012. Amazon dynamodb. http://aws.amazon.com/dynamodb. Amazon. 2012. Amazon dynamodb. http://aws.amazon.com/dynamodb.

Armbrust , M. , Curtis , K. , Kraska , T. , Fox , A. , Franklin , M. , and Patterson , D . 2011. PIQL: Success-tolerant query processing in the cloud . In Proceedings of the International Conference on Very Large Data Bases. 181--192 . Armbrust, M., Curtis, K., Kraska, T., Fox, A., Franklin, M., and Patterson, D. 2011. PIQL: Success-tolerant query processing in the cloud. In Proceedings of the International Conference on Very Large Data Bases. 181--192.

Baker , J. , Bond , C. , Corbett , J. C. , Furman , J. , Khorlin , A. , Larson , J. , Léon , J.-M. , Li , Y. , Lloyd , A. , and Yushprakh , V . 2011. Megastore: Providing scalable, highly available storage for interactive services . In Proceedings of CIDR. 223--234 . Baker, J., Bond, C., Corbett, J. C., Furman, J., Khorlin, A., Larson, J., Léon, J.-M., Li, Y., Lloyd, A., and Yushprakh, V. 2011. Megastore: Providing scalable, highly available storage for interactive services. In Proceedings of CIDR. 223--234.

10.1145/223784.223785

10.1145/1376616.1376645

10.1109/TSE.1985.232196

10.1145/1365815.1365816

Cooper , B. F. , Ramakrishnan , R. , Srivastava , U. , Silberstein , A. , Bohannon , P. , Jacobsen , H.-A. , Puz , N. , Weaver , D. , and Yerneni , R . 2008. PNUTS: Yahoo!’s hosted data serving platform . In Proceedings of the International Conference on Very Large Data Bases. 1277--1288 . Cooper, B. F., Ramakrishnan, R., Srivastava, U., Silberstein, A., Bohannon, P., Jacobsen, H.-A., Puz, N., Weaver, D., and Yerneni, R. 2008. PNUTS: Yahoo!’s hosted data serving platform. In Proceedings of the International Conference on Very Large Data Bases. 1277--1288.

Cowling , J. and Liskov , B . 2012. Granola: Low-overhead distributed transaction coordination . In Proceedings of USENIX ATC. 223--236 . Cowling, J. and Liskov, B. 2012. Granola: Low-overhead distributed transaction coordination. In Proceedings of USENIX ATC. 223--236.

10.1145/1629175.1629198

Douceur J. and Howell J. 2003. Scalable Byzantine-fault-quantifying clock synchronization. Tech. rep. MSR-TR-2003-67 MS Research. Douceur J. and Howell J. 2003. Scalable Byzantine-fault-quantifying clock synchronization. Tech. rep. MSR-TR-2003-67 MS Research.

Douceur , J. R. and Howell , J . 2006. Distributed directory service in the Farsite file system . In Proceedings of the USENIX Symposium on Operating Systems Design and Implementation. 321--334 . Douceur, J. R. and Howell, J. 2006. Distributed directory service in the Farsite file system. In Proceedings of the USENIX Symposium on Operating Systems Design and Implementation. 321--334.

10.1145/945445.945450

Gifford D. K. 1982. Information storage in a decentralized computer system. Tech. rep. CSL-81-8 Xerox PARC. Gifford D. K. 1982. Information storage in a decentralized computer system. Tech. rep. CSL-81-8 Xerox PARC.

10.1145/2043556.2043559

Google. 2008. Protocol buffers --- Google’s data interchange format. https://code.google.com/p/protobuf. Google. 2008. Protocol buffers --- Google’s data interchange format. https://code.google.com/p/protobuf.

10.1145/1132863.1132867

Helland , P. 2007 . Life beyond distributed transactions: An apostate’s opinion . In Proceedings of the Biennial Conference on Innovative Data Systems Research. 132--141 . Helland, P. 2007. Life beyond distributed transactions: An apostate’s opinion. In Proceedings of the Biennial Conference on Innovative Data Systems Research. 132--141.

10.1145/78969.78972

10.1145/279227.279229

10.1145/1753171.1753191

10.1007/BF02242709

10.1109/ICDE.2009.56

10.1145/1217935.1217946

MarkLogic. 2012. Marklogic 5 product documentation. http://community.marklogic.com/docs. MarkLogic. 2012. Marklogic 5 product documentation. http://community.marklogic.com/docs.

10.1145/800221.806730

Melnik , S. , Gubarev , A. , Long , J. J. , Romer , G. , Shivakumar , S. , Tolton , M. , and Vassilakis , T . 2010. Dremel: Interactive analysis of Web-scale datasets . In Proceedings of the International Conference on Very Large Data Bases. 330--339 . Melnik, S., Gubarev, A., Long, J. J., Romer, G., Shivakumar, S., Tolton, M., and Vassilakis, T. 2010. Dremel: Interactive analysis of Web-scale datasets. In Proceedings of the International Conference on Very Large Data Bases. 330--339.

Mills D. 1981. Time synchronization in DCNET hosts. Internet project rep. IEN--173 COMSAT Laboratories. Mills D. 1981. Time synchronization in DCNET hosts. Internet project rep. IEN--173 COMSAT Laboratories.

Oracle. 2012. Oracle total recall. http://www.oracle.com/technetwork/database/focus-areas/storage/total-recall-whitepaper-171749.pdf. Oracle. 2012. Oracle total recall. http://www.oracle.com/technetwork/database/focus-areas/storage/total-recall-whitepaper-171749.pdf.

10.1145/1559845.1559865

Peng , D. and Dabek , F . 2010. Large-scale incremental processing using distributed transactions and notifications . In Proceedings of the USENIX Symposium on Operating Systems Design and Implementation. 1--15 . Peng, D. and Dabek, F. 2010. Large-scale incremental processing using distributed transactions and notifications. In Proceedings of the USENIX Symposium on Operating Systems Design and Implementation. 1--15.

10.1145/320251.320260

Shraer , A. , Reed , B. , Malkhi , D. , and Junqueiera , F . 2012. Dynamic reconfiguration of primary/backup clusters . In Proceedings of USENIX ATC. 425--438 . Shraer, A., Reed, B., Malkhi, D., and Junqueiera, F. 2012. Dynamic reconfiguration of primary/backup clusters. In Proceedings of USENIX ATC. 425--438.

10.1145/2213836.2213954

10.1145/2043556.2043592

Stonebraker M. 2010a. Six SQL urban myths. http://highscalability.com/blog/2010/6/28/voltdb-decapitates -six-sql-urban-myths-and-delivers-internet.html. Stonebraker M. 2010a. Six SQL urban myths. http://highscalability.com/blog/2010/6/28/voltdb-decapitates -six-sql-urban-myths-and-delivers-internet.html.

Stonebraker M. 2010b. Why enterprises are uninterested in NoSQL. http://cacm.acm.org/blogs/blog-cacm/99512-why-enterprises-are-uninterested-in-nosql/fulltext. Stonebraker M. 2010b. Why enterprises are uninterested in NoSQL. http://cacm.acm.org/blogs/blog-cacm/99512-why-enterprises-are-uninterested-in-nosql/fulltext.

Stonebraker , M. , Madden , S. , Abadi , D. J. , Harizopoulos , S. , Hachem , N. , and Helland , P . 2007. The end of an architectural era: (It’s time for a complete rewrite) . In Proceedings of the International Conference on Very Large Data Bases. 1150--1160 . Stonebraker, M., Madden, S., Abadi, D. J., Harizopoulos, S., Hachem, N., and Helland, P. 2007. The end of an architectural era: (It’s time for a complete rewrite). In Proceedings of the International Conference on Very Large Data Bases. 1150--1160.

10.1145/2213836.2213838

Thusoo , A. , Sarma , J. S. , Jain , N. , Shao , Z. , Chakka , P. , Zhang , N. , Antony , S. , Liu , H. , and Murthy , R . 2010. Hive --- A petabyte scale data warehouse using Hadoop . In Proceedings of the International Conference on Data Engineering. 996--1005 . Thusoo, A., Sarma, J. S., Jain, N., Shao, Z., Chakka, P., Zhang, N., Antony, S., Liu, H., and Murthy, R. 2010. Hive --- A petabyte scale data warehouse using Hadoop. In Proceedings of the International Conference on Data Engineering. 996--1005.

VoltDB. 2012. VoltDB resources. http://voltdb.com/resources/whitepapers. VoltDB. 2012. VoltDB resources. http://voltdb.com/resources/whitepapers.