-
- Peter M. Chen
- Computer Science and Engineering Division, Department of Electrical Engineering and Computer Science, University of Michigan
-
- Wee Teck Ng
- Computer Science and Engineering Division, Department of Electrical Engineering and Computer Science, University of Michigan
-
- Subhachandra Chandra
- Computer Science and Engineering Division, Department of Electrical Engineering and Computer Science, University of Michigan
-
- Christopher Aycock
- Computer Science and Engineering Division, Department of Electrical Engineering and Computer Science, University of Michigan
-
- Gurushankar Rajamani
- Computer Science and Engineering Division, Department of Electrical Engineering and Computer Science, University of Michigan
-
- David Lowell
- Computer Science and Engineering Division, Department of Electrical Engineering and Computer Science, University of Michigan
書誌事項
- タイトル別名
-
- surviving operating system crashes
抄録
<jats:p> One of the fundamental limits to high-performance, high-reliability file systems is memory's vulnerability to system crashes. Because memory is viewed as unsafe, systems periodically write data back to disk. The extra disk traffic lowers performance, and the delay period before data is safe lowers reliability. The goal of the Rio (RAM I/O) file cache is to make ordinary main memory safe for persistent storage by enabling memory to survive operating system crashes. Reliable memory enables a system to achieve the best of both worlds: reliability equivalent to a write-through file cache, where every write is instantly safe, and performance equivalent to a pure write-back cache, with no reliability-induced writes to disk. To achieve reliability, we protect memory during a crash and restore it during a reboot (a "warm" reboot). Extensive crash tests show that even without protection, warm reboot enables memory to achieve reliability close to that of a write-through file system. Adding protection makes memory even <jats:italic>safer</jats:italic> than a write-through file system while adding essentially no overhead. By eliminating reliability-induced disk writes, Rio performs 4-22 times as fast as a write-through file system, 2-14 times as fast as a standard Unix file system, and 1-3 times as fast as an optimized system that risks losing 30 seconds of data and metadata. </jats:p>
収録刊行物
-
- ACM SIGPLAN Notices
-
ACM SIGPLAN Notices 31 (9), 74-83, 1996-09
Association for Computing Machinery (ACM)