CSE 290S: Preserving digital information for future generations
- Class will be held in 380 Engineering 2, starting on Thursday April 6th.
- No class on Thursday, May 4th.
As many of you know, I am retiring from UCSC in June 2023, so this is the last class that I will teach as a regular faculty member. This course will cover a research area that I have worked on for nearly twenty years, and will describe the underpinnings of a critically important problem: how can we build systems that reliably and securely preserve information for future generations?
CSE 290S (Spring 2023) will cover the technologies and techniques necessary to preserve digital information for decades to centuries. We will first cover currently-available technologies that might be used for archival storage, such as disk, tape, and flash, followed by a discussion of potential new technologies such as glass media and DNA. We will then cover techniques that we must use to ensure that the system can preserve integrity and security of the bits themselves. We will conclude by touching on issues relating to understanding the stored bits.
- Instructor: Professor Ethan L. Miller
- Quarter: Spring 2023
- When: Tuesday & Thursday 09:50–11:25 AM
- Where: 380 Engineering 2 (moved from Crown Classroom 105)
- Required readings: Papers (see below) will be available online
Each student in the class will:
- Read the papers on the reading list, and be prepared to discuss them in class.
- Present several papers from the reading list. The exact number of papers will depend on the number of enrolled students.
- Complete a term-long project on a topic related to long-term data storage.
- Complete an open-book and open-note take-home final exam. The final exam will be submitted online, and will be due at the end of the final exam slot during exam week.
The approximate week-by-week schedule is listed below. Papers will be added for each week, at least a week in advance. Note that the links to the papers may require access to various digital libraries. All papers will be freely downloadable from a campus-connected computer; if you're off-campus, you might need to use the UCSC VPN.
- Mary Baker, Kimberly Keeton, Sean Martin,
- David S. H. Rosenthal, Thomas Robertson, Tom Lipkis, Vicky Reich, Seth Morabito,
- Mark W. Storer, Kevin Greenan, Ethan L. Miller,
- Existing storage technologies
- Yuhui Deng,
- Nitin Agrawal, Vijayan Prabhakaran, Ted Wobber, John Davis, Mark Manasse, Rina Panigrahy,
- Kazuo Goda, Masaru Kitsuregawa,
- OPTIONAL: Inside Solid State Drives (SSDs)
- OPTIONAL: Solid-State-Drives (SSDs) Modeling
- New storage technologies
- Ian F. Adams, Ethan L. Miller, David S. H. Rosenthal,
- Anderson, et al.,
- Luis Ceze, Jeff Nivala, Karin Strauss, https://doi.org/10.1038/s41576-019-0125-3
- Andromachi Chatzieleftheriou, Ioan Stefanovici, Dushyanth Narayanan, Benn Thomsen, Antony Rowstron,
- Erasure coding
- Cheng Huang, Huseyin Simitci, Yikang Xu, Aaron Ogus, Brad Calder, Parikshit Gopalan, Jin Li, Sergey Yekhanin,
- James S. Plank, Mario Blaum, James L. Hafner,
- Yuchong Hu, Liangfeng Cheng, Qiaori Yao, Patrick P. C. Lee, Weichun Wang, Wei Chen,
- OPTIONAL: Myna Vajha, et al.,
- Building reliable storage systems
NOTE: no class on Thursday, May 4th
- Andreas Haeberlen, Alan Mislove, Peter Druschel,
- Shobana Balakrishnan, et al.,
- REFERENCE: Microsoft's Project Pelican
- Security and integrity
- Mahesh Kallahalla, Erik Riedel, Ram Swaminathan, Qian Wang, Kevin Fu, Plutus: Scalable secure file sharing on untrusted storage, Proceedings of FAST 2003, March 2003.
- Adi Shamir, How to Share a Secret, Communications of the ACM 22(11), November 1979, pages 612–613.
- Jason K. Resch, James S. Plank, AONT-RS: Blending Security and Performance in Dispersed Storage Systems, Proceedings of FAST 2011, February 2011.
- Petros Maniatis, Mary Baker, Secure History Preservation through Timeline Entanglement, Proceedings of the 11th USENIX Security Symposium, August 2002.
- Mark W. Storer, Kevin M. Greenan, Ethan L. Miller, Kaladhar Voruganti, POTSHARDS—A Secure, Long-Term Storage System ACM Transactions on Storage 5(2), June 2009.
- Sinjoni Mukhopadhyay, Joel Frank, Daniel Bittman, Darrell D. E. Long, Ethan L. Miller, Efficient Reconstruction Techniques for Disaster Recovery in Secret-Split Datastores Proceedings of MASCOTS 2018, September 2018.
- Maniatis, et al., The LOCKSS peer-to-peer digital preservation system, ACM Transactions on Computer Systems 23(1), February 2005.
- T. Schwarz, E. L. Miller,
- M. W. Storer, K. M. Greenan, E. L. Miller, K. Voruganti,
- J. J. Wylie, et al., Survivable information storage systems, IEEE Computer 33(8), August 2000.
- A. Bessani, et al., DepSky: Dependable and Secure Storage in a Cloud-of-Clouds, ACM Transactions on Storage 9(4), November 2013.
- A. Celesti, M. Fazio, M. Villari, A. Puliafito, Adding long-term availability, obfuscation, and encryption to multi-cloud storage systems, Journal of Network and Computer Applications 59, January 2016.
- J. Braun, et al., LINCOS - A Storage System Providing Long-Term Integrity, Authenticity, and Confidentiality, Proceedings of the 13th ACM Asia Conference on Computer and Communications Security (ASIACCS 2017), 2017.
- OPTIONAL: S. Rhea, et al., Pond: the OceanStore Prototype, Proceedings of FAST ’03: 2nd USENIX Conference on File and Storage Technologies, February 2003.
No additional readings for this week. However, please make sure that you've read (and refreshed your memory!) all of the readings for the quarter. We will be discussing future directions for the research area, and covering unanswered questions as well as possible directions for answering them.