Collaboration on Source Catalogue DM, ADQL and SkyNodes

Clive Page cgp at star.le.ac.uk
Wed Dec 21 08:04:24 PST 2005


On Wed, 21 Dec 2005, Jim Gray wrote:

> There is also the cross match that Tanu & Maria implemented in SQL.
> That may be the easiest way to do the implementation if the nodes all
> have a SQL backend.

I'd like to point out that if the data are already in a relational DBMS 
then by far the simplest way to do the cross-match, and in many cases also 
the fastest, is to use R-tree indexing and a spatial join.  I think the 
first astronomical use of this was by Andrea Baruffolo (see 
http://monet.ncsa.uiuc.edu/adass98/Proceedings/baruffoloa1/ ) but it has 
also been extensively tested here and documented on the AstroGrid wiki, 
see: http://wiki.astrogrid.org/bin/view/Astrogrid/DataDocs

Support for spatial indexing is now included in or readily available for 
DB2, Oracle, Informix, Sybase, MySQL, and Postgres, i.e. just about all 
the DBMS widely used in astronomy (with perhaps just one exception, which 
Jim can tell you about :-).

> But, getting objects into a node dominates all other costs (moving stuff
> thru xml is expensive).

Indeed that is a very serious problem.  I wonder if we can't solve this by 
using, instead of XML, some more efficient data format, e.g. one which 
holds tabular data in binary form with just the metadata in plain text. 
There's something called the "FITS table" with exactly these properties 
which perhaps astronomers should investigate :-)

Merry Christmas (translation for those in countries where Christianity is 
not the established religion: Happy Holidays).

-- 
Clive Page
Dept of Physics & Astronomy,
University of Leicester,
Leicester, LE1 7RH,  U.K.



More information about the voql mailing list