Authors: HuubStoffers, Mark van de Sanden, Johan Raber, Tom Langborg, GeorgeTsouloupas, Olivier Rouchon, Florent Marceteau
(SARA, SNIC, CaSToRC, CINES)
Abstract: In SARA, SNIC-NSC, CaSToRC and CINES, different strategies to store and preserve academic data produced as part of research projects are in place. Even if they are fairly similar, the policies and technologies which have been deployed to manage those data have a few differences which will be detailed. The way they address the increasing need for long time archival storage, in combination with the ever increasing size and rate of data produced will be also described, as well as data sharing problems in joint research efforts where large data sets need sharing.
Authors: OlivierRouchon, Philippe Prat, Mathieu Cloirec
Abstract: During the past twenty years, the long-term preservation of digital information has only been a matter under consideration for a few scientific or patrimonial institutions. These have played a key role in the understanding of the subsequent risks and the definition of standards in this domain. The best practices rely on four technological risks which are now commonly agreed: the loss of the knowledge of the content, file format obsolescence, aging media causing data loss, sudden software or technology changes. They have been put in place in institutions dealing with text, images, sounds or video where quality assurance procedures have been developed to guarantee the integrity and accessibility of the data. The way this translates into raw, primary data produced by Tier-0 systems will be evaluated as part of this whitepaper.
Authors: FlorentMarceteau, Olivier Rouchon, Johan Raber, George Tsouloupas
(CINES, SNIC, CaSToRC)
Abstract: Reliability, performance, costs and return on investment are key factors in the long term preservation of digital data. They differ from one technology to another. The different media and technologies used for storage and transfer will be compared, with a particular focus on disks and tapes.
Author: Huub Stoffers
Abstract: “Huygens”, the IBM P6 system in Amsterdam and current incarnation of the Dutch National supercomputer for the academic community, was one of the DEISA systems and is now a PRACE Tier-1 system in the PRACE 2IP project. In the PRACE preparatory phase it also was a prototype system. An informal “PRACE_HOME” storage space on Huygens provides a “next stop” for PRACE Tier-0 produced data that have to be preserved for a longer time. The experience feedback presented is tied with the project of a Dutch investigator, Harm Jonker. The simulation produces an important amount of output data to be preserved, and some issues were encountered with the data preservation.
Authors: Johan Raber,Per Lundqvist and Bengt Persson
Abstract: Vagn-Ekman is a dual cluster setup with specialized functionality of the two parts. Ekman is a large compute cluster located in Stockholm, and Vagn is a storage and post-processing cluster located at NSC in Linköping. To an extent, the Vagn and Ekman clusters resemble future PRACE operations in the sense that from a large data production facility there will be a need to conveniently transfer the produced data to the researchers, potentially scattered across Europe, in a safe manner with respect to data integrity. The experiences and tools developed during the Vagn-Ekman project can serve as an example for how this data flow can be carried out.
These whitepapers have been prepared by the PRACE Implementation Phase Projects and in accordance with the Consortium Agreements and Grant Agreements n° RI-261557, n°RI-283493, or n°RI-312763.
They solely reflect the opinion of the parties to such agreements on a collective basis in the context of the PRACE Implementation Phase Projects and to the extent foreseen in such agreements. Please note that even though all participants to the PRACE IP Projects are members of PRACE AISBL, these whitepapers have not been approved by the Council of PRACE AISBL and therefore do not emanate from it nor should be considered to reflect PRACE AISBL’s individual opinion.
© 2014 PRACE Consortium Partners. All rights reserved. This document is a project document of a PRACE Implementation Phase project. All contents are reserved by default and may not be disclosed to third parties without the written consent of the PRACE partners, except as mandated by the European Commission contracts RI-261557, RI-283493, or RI-312763 for reviewing and dissemination purposes.
All trademarks and other rights on third party products mentioned in the document are acknowledged as own by the respective holders.