Structure-independent disaster recovery: Concept, architecture and implementations

Science in China Series F: Information Sciences - Tập 52 - Trang 813-823 - 2009
WeiMin Zheng1, BinXing Fang2
1Department of Computer Science and Technology, Tsinghua University, Beijing, China
2Beijing University of Posts and Telecommunications, Beijing, China

Tóm tắt

Disaster recovery (DR) techniques ensure the data safety and service continuity under different natural and human-made disasters by constructing a high reliable storage system. Traditional disaster recovery methods are structure-dependent. It is hard to share the DR resources between different DR systems, which made it expensive. We present a structure-independent disaster recovery theory and its implementation methods in this paper. By backup the whole system but not just the data, the goal of device and application-independent disaster recovery has been achieved. We further present a parallel recovery model and an on demand data retrieval method based on the theory. Some implementation details of prototype recovery system are also discussed. With the methods independent from specific devices or applications, the cost of disaster recovery infrastructure can be essentially reduced by resource sharing. Experiments show that the recovery time has also been greatly shortened with little service degradation.

Tài liệu tham khảo

Engineering White Paper, Using EMC SnapView and MirrorView for Remote Backup. EMC Corporation, April, 2002 Azagury A C, Factor M E, Micka W F, et al. Advanced functions for storage subsystems: Supporting continuous availability. IBM Syst J, 2003, 42(2): 268–279 Engineering White Paper, Software Solutions Guide for Enterprise Storage. Hitachi Data Systems Corporation, December, 2000 Engineering White Paper VERITAS Volume Replicator Successful Replication and Disaster Recovery, Veritas Software Corporation. see http://eval.vertias.com/downloads/pro/volume replicator whitepaper.pdf Patterson H, Manley S, Federwisch M, et al. SnapMirror: File system based asynchronous mirroring for disaster recovery. In: USENIX FAST, 2002 Verma A, Voruganti K, Routray R, et al. SWEEPER: An efficient disaster recovery point identification mechanism. In: USENIX FAST, 2008 Laden G, Ta-Shma P, Yaffe E, et al. Architectures for controller based CDP. In: USENIX FAST, 2007 Keeton K, Beyer D, Brau E, et al. On the road to recovery: restoring data after disasters. In: ACM Eurosys, 2006 Keeton K, Santos C, Beyer D, et al. Designing for disasters. In: USENIX FAST, 2004 Meersch E V. Designing highly available architectures: A methodology. Sun BluePrints Online, November 2002 Laaden O, Nieh J. Transparent checkpoint-restart of multiple processes on commodity operating systems. In: USENIX Annual Technical Conference, 2007 Soltesz S, Potzl H, Fiuczynski M E, et al. Container-based operating system virtu-alization: A scalable, high-performance alternative to hypervisors. In: ACM Eurosys, 2007 Nelson M, Lim B -H, Hutchins G. Fast transparent migration for virtual machines. In: USENIX Annual Technical Conference, 2005 Clark C, Fraser K, Hand S, et al. Live Migration of Virtual Machines. In: USENIX NSDI, 2005 Orabm. Oracle OLTP benchmarking and stress testing suite. http://www.linxcel.co.uk/software orastress.html Cully B, Lefebvre G, Meyer D, et al. Remus: High availability via asynchronous virtual machine replication. In: USENIX NSDI, 2008 Ji M W, Veitch A, Wilkes J. Seneca: remote mirroring done write. In: USENIX Annual Technical Conference, 2003