i v o a . n e t

/ WebHome / IvoaEvents / InterOpMay2005 / IVOA.IvoaCP

IVOA.net


Wiki Home 

Twiki Meta & Help


Astrodata
IVOA
Know
Sandbox
TWiki
Trash

TWiki intro
TWiki tutorial
User registration
Notify me

Working Groups


Data Access
Data Model
GWS
Query Language
Registry
Stds&Procs
Semantics
VOEvent
VOTable

Applications
Astro-RG
Data Curation
Theory

Interest Groups


www.ivoa.net
Documents
Events
Members
XML Schema

IVOA Data Curation and Preservation

This Interest Group is chaired by AlbertoAccomazzi (appointed May 2010), who succeeds BobHanisch (appointed May 2007).



Charter

The Data Curation and Preservation (DCP) Interest Group is established to share best practices and engage IVOA member projects in the long-term curation and preservation of astronomical data. Discussion topics include:

The Interest Group will from time to time produce white papers and/or bring proposed actions to the attention of the IVOA Technical Coordination Group and IVOA Executive.

Mailing-list

Meetings

May 2010, Victoria Interop InterOpMay2010DCP

May 2008, Trieste Interop InterOpMay2008DCP

October 2007, Cambridge Interop InterOpSep2007CP

Members

There is no fixed membership; anyone interested in data curation and preservation is welcome to participate.

People who are currently involved include Bob Hanisch, Francoise Genova, Pepi Fabbiano, Arnold Rots, Wolfgang Voges, and Bob Mann.

Please add your name to this list!

Background

At the IVOA meeting #11 on 23 June 2004, a request was made for the creation of an Interest Group for Data Curation and Preservation. Reagan Moore (SDSC, USA) and Francoise Genova (CDS, France) were tasked with developing the initial description of the activities for the interest group. The goal of the group is to identify both mechanisms for the long-term preservation of astrophysics collections and sustainability procedures to ensure continued access to astrophysics collections that are at risk.

Mechanisms for long-term preservation address challenges related to:

Sustainability procedures for continued support address challenges related to:

The IVOA Data Curation and Preservation interest group will develop:

(1) A white paper discussing the concepts involved in preservation, and providing a description of existing preservation environments.

(2) A white paper discussing the sustainability procedures.

A combined document will then be circulated to the IVOA as an IVOA Working Draft.

The technologies that are available to build preservation environments come from the grid and digital library communities. Examples include:

One expectation is that multiple preservation models may be used, and that interoperability mechanisms will be developed for exchange of data and metadata between preservation systems. This capability will be required if for no other reason than to manage migration between different versions of technology over time.

Interface of the WG with other parts of the IVOA

Preservation environments can be assembled by federating existing collections with deep archives. This means that interactions with existing archives is essential for building a viable system.

Interactions will be needed with the Data Access Layer WG for providing uniform interfaces to the preservation environment, and with the UCD WG to provide relevant discovery information for items in the preservation environment.

Requirements from other WGs

We will seek requirements from other WGs through specification of use cases. A first use case is being defined in collaboration with Interpares (International Research on Permanent Authentic Records in Electronic Systems). The Canadian MOST image collection is being analyzed for preservation requirements. A report will be generated in 1st quarter, 2005. Other examples of interactions with IVOA working groups include integration of preservation environments with processing pipelines, and support for discovery through portals.

Currently the process to provide requirements is by email to the IVOA PE mailing list.

Namespaces

The preservation environment will use at least four name spaces for identifying files:

The preservation environment provides additional name spaces to describe:

By managing each of these name spaces independently from the underlying storage repositories and administrative domains, the preservation environment can control the authenticity and integrity of the collections.

Federation

To ensure preservation against natural disasters, operator error, and malicious users, replicas of the data must be kept with differentiated levels of access. One copy should be accessible by users, a second copy user-accessible copy should be created at a geographically remote site for fault-tolerance, and a third copy should be kept in a deep archive that is not user-accessible. Each of the copies should be managed by an independent metadata catalog. Federated data grids provide this level of support.

The project will track preservation approaches across multiple communities:

Links

Papers

Attachment sort Action Size Date Uploaded by Comment
IVOA-Kyoto.ppt manage 68.5 K 19 May 2005 - 06:04 ReaganMoore Presentation on Data Curation and Preservation
Preservation.pdf manage 46.3 K 19 May 2005 - 07:07 ReaganMoore Interest Group meeting notes

 
 
© 2003-2011 by the contributing authors.  - You are TWikiGuest