|
We have created Zap, a novel system for transparent migration of
legacy and networked applications. Zap provides a thin virtualization
layer on top of the operating system that introduces pods, which are
groups of processes that are provided a consistent, virtualized view
of the system. This decouples processes in pods from dependencies to
the host operating system and other processes on the system. By
integrating Zap virtualization with a checkpoint-restart mechanism,
Zap can migrate a pod of processes as a unit among machines running
independent operating systems without leaving behind any residual
state after migration.
We have implemented a Zap prototype in Linux that supports transparent
migration of unmodified applications without any kernel modifications.
Our Linux Zap system extends a novel checkpoint / restart mechanism
implementation from our earlier work on CRAK, a system that provided
process Checkpoint and Restart As a Kernel Module for Linux. We
demonstrate that our Linux Zap prototype can provide general-purpose
process migration functionality with low overhead. Our experimental
results for migrating pods used for running a standard user's X
windows desktop computing environment and for running an Apache web
server show that these kinds of pods can be migrated with subsecond
checkpoint and restart latencies.
More Information:
- Kernel-based checkpoint
and restart, LWN.net, August 2008.
- Stelios Sidiroglou, Oren Laadan, Carlos R. Pérez, Nicolas Viennot, Jason Nieh, and Angelos D. Keromytis, "ASSURE: Automatic Software Self-healing Using REscue points", Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2009), Washington, DC, March 7-11, 2009, pp. 37-48.
- Shaya Potter, Jason Nieh, and Matthew Selsky, "Secure Isolation of Untrusted Legacy Applications", Proceedings of the Twenty-first Large Installation System Administration Conference (LISA 2007), Dallas, TX, November 11-16, 2007, pp. 117-130.
- Oren Laadan, Ricardo Baratto, Dan Phung, Shaya Potter, and Jason Nieh, "DejaView: A Personal Virtual Computer Recorder", Proceedings of the Twenty-first ACM Symposium on Operating Systems Principles (SOSP 2007), Stevenson, WA, October 14-17, 2007, pp. 279-292.
- Oren Laadan and Jason Nieh, "Transparent Checkpoint-Restart of Multiple Processes on Commodity Operating Systems", Proceedings of the 2007 USENIX Annual Technical Conference, Santa Clara, CA, June 17-22, 2007, pp. 323-336.
- Stelios Sidiroglou, Oren Laadan, Angelos D. Keromytis, and Jason Nieh, "Using Rescue Points to Navigate Software Recovery (Short Paper)", Proceedings of the IEEE Symposium on Security and Privacy, Oakland, CA, May 20-23, 2007, pp. 273-280.
- Shaya Potter and Jason Nieh, "Highly Reliable Mobile Desktop Computing in Your Pocket", Proceedings of the IEEE Computer Society Signature Conference on Software Technology and Applications (COMPSAC 2006), Chicago, IL, September 18-21, 2006, pp. 247-254.
- Shaya Potter and Jason Nieh, "Reducing Downtime Due to System Maintenance and Upgrades", Proceedings of the Nineteenth Large Installation System Administration Conference (LISA 2005), San Diego, CA, December 4-9, 2005, pp. 47-62. (Best Student Paper Award)
- Shaya Potter and Jason Nieh, "Breaking the Ties that Bind: Process Isolation and Migration", ;login, USENIX Association, 30(6), December 2005, pp. 14-17.
- Oren Laadan, Dan Phung, and Jason Nieh, "Transparent Checkpoint-Restart of Distributed Applications on Commodity Clusters", Proceedings of the 2005 IEEE International Conference on Cluster Computing (Cluster 2005), Boston, MA, September 27-30, 2005.
- Shaya Potter and Jason Nieh, "AutoPod: Unscheduled System Updates with Zero Data Loss", Abstract in Proceedings of the Second IEEE International Conference on Autonomic Computing (ICAC 2005), Seattle, WA, June 13-16, 2005, pp. 367-368.
- Shaya Potter and Jason Nieh, "WebPod: Persistent Web Browsing Sessions with Pocketable Storage Devices", Proceedings of the Fourteenth International World Wide Web Conference (WWW 2005), Chiba, Japan, May 10-14, 2005, pp. 603-612.
- Angelos Stavrou, Angelos D. Keromytis, Jason Nieh, Vishal Misra, and Dan Rubenstein, "MOVE: An End-to-End Solution To Network Denial of Service", Proceedings of the Twelfth Annual Network and Distributed System Security Symposium (NDSS 2005), San Diego, CA, February 2-4, 2005, pp. 81-96.
- Shaya Potter and Jason Nieh, "WebPod: Persistent Web Browsing Sessions with Pocketable Storage Devices", Technical Report CUCS-047-04, Department of Computer Science, Columbia University, November 2004.
- Gong Su, "MOVE: Mobility with Persistent Network Connections", Ph.D. Thesis, Department of Computer Science, Columbia University, October 2004.
- Ricardo Baratto, Shaya Potter, Gong Su, and Jason Nieh, "MobiDesk: Mobile Virtual Desktop Computing", Proceedings of the Tenth Annual ACM International Conference on Mobile Computing and Networking (MobiCom 2004), Philadelphia, PA, September 26-October 1, 2004, pp. 1-15. (Best Student Paper Award)
- Shaya Potter, Jason Nieh, and Dinesh Subhraveti, "Secure Isolation and Migration of Untrusted Legacy Applications", Technical Report CUCS-005-04, Department of Computer Science, Columbia University, January 2004.
- Angelos D. Keromytis, Janak Parekh, Philip N. Gross, Gail Kaiser, Vishal Misra, Jason Nieh, Dan Rubenstein, and Sal Stolfo, "A Holistic Approach to Service Survivability", Proceedings of the 2003 ACM Workshop on Survivable and Self-Regenerative Systems, Fairfax, VA, October 31, 2003, pp. 11-22.
- Angelos D. Keromytis, Janak Parekh, Philip N. Gross, Gail Kaiser, Vishal Misra, Jason Nieh, Dan Rubenstein, and Sal Stolfo, "A Holistic Approach to Service Survivability", Technical Report CUCS-021-03, Department of Computer Science, Columbia University, July 2003.
- Steven Osman, Dinesh Subhraveti, Gong Su, and Jason Nieh, "The Design and Implementation of Zap: A System for Migrating Computing Environments", Proceedings of the Fifth Symposium on Operating Systems Design and Implementation (OSDI 2002), Boston, MA, December 9-11, 2002, pp. 361-376.
- Gong Su and Jason Nieh, "Mobile Communication with Virtual Network Address Translation", Technical Report CUCS-003-02, Department of Computer Science, Columbia University, February 2002.
- Hua Zhong and Jason Nieh, "CRAK: Linux Checkpoint / Restart As a Kernel Module", Technical Report CUCS-014-01, Department of Computer Science, Columbia University, November 2001.
- CRAK source code for the kernel module and some utilities for Linux 2.2.19, 2.4.4 (beta), and 2.4.4 with socket support (alpha).
- Zap Developers Mailing List. Open to Zap and CRAK developers to subscribe to discuss Zap and CRAK development and deployment issues.
|