[utypes] Wild edit towards more principled definitions

Omar Laurino olaurino at head.cfa.harvard.edu
Tue Dec 20 07:27:17 PST 2011


Hi Markus,

Again, thank you for moving this effort forward.

I like your formal approach very much, I think we need it. And I like your
efforts to abstract the data model description.

However, I have some concerns:
1. Your formalization (and the previous draft, to some extent), starts from
the XML schema to define the DM graph and then the Utypes. However, I think
that the XSD should be a final product of the Data Modeling effort.

I think this for several reasons. First of all, you need to be sure that
the XSD is properly representing your Data Model, and this looks more like
a "trial and error" process to me [write the XML schema, see how the graph
looks like, then go back to the XSD and iterate]. In other words, I think
the Data Model comes first, and then you produce all the data products (you
need the Data Model at least in your mind to describe it using XSD,
right?). To be honest, I even think that the DM description document should
be written automatically as a stub, and then the human should simply add
the human relevant information. So, I would go the other way, turning over
the algorithm to go from the Graph to the XSD.

I think this was the approach of the Theory guys, by the way: they start
from the Data Model UML description (in XMI) and then use it to derive all
the "products" (XSD schema, utypes, documentation). I don't think this
approach can work for a standard specification, though, because it assumes
the use of tools (XMI compliant Data Modeling tools) that are hardly
interoperable. I still think the approach is great, but it is an approach
that *uses* a standard definition of Data Models and Utypes, not being the
standard itself. It can be considered a reference implementation that
validates the standard employing a particular set of tools, though. In
other terms, we still need an interoperable description for IVOA Data
Models (which is a subspace of the infinite data modeling space).

2. It seems to me that your model doesn't capture instantiation and
inheritance. In your sample XSD you define both TimeAxisType and
SpectralAxisType, even though SpectralAxis and TimeAxis are two instances
of the same class: your algorithm reflects this, if I am not mistaken. I am
sure you can complicate your algorithm to adapt, but this requires modeling
the Data Model itself, not its representation, so that you work in a "meta
Data Model" space. Also, in your schema there is no room for extensions,
which is quite a hot topic.

3. In your description, you can drop the prefix without losing any
information. Also, let's say I have got two Data Models: they both employ a
SpectralAxis and a TimeAxis to describe the very same information. They
will have two different Utypes, which means that they point to different
concepts! Unless you ignore the prefix, which, again, makes it redundant.

So, I think your formalization works for building the Utype paths, but this
is only part of the deal. We need to define a meta space in which the Data
Models live, we need to provide mechanisms for allowing the clients to spot
instances of known classes in the datasets, and to allow data models to
correctly include and/or extend classes defined in other models (which is
what Data Models already do in a non standard nor formal way).

Another couple of minor concerns: we still don't include the DataTypes into
the picture. Again, I think this is due to the fact that the document
focuses on building the Utype path, so we need to take a step back and fit
the path in the bigger picture of Data Model instances serialization. Also,
there is still no room for versions. You make reference to it, but you seem
to offload the problem.

That said, I would like to send you guys the draft I have been working on
ASAP. I'm validating it by building a reference implementation in the form
of a java library, which helps a lot the design.

Here is, very briefly, the big picture: I am trying to address the three
main problems we have in order to make utypes *much more* useful (they are
useful already) and to match our requirements:
1) Data Model design specification (data modeling of the IVOA Data Model
space, reusable components in terms of inheritance and instantiation,
collections of objects, versioning).

2) Data Model description (utypes, automatic generation of code, etc.)

3) Standardization of abstract serialization strategies based on utypes.
The serialization must allow clients to spot the instances of the objects
they know and to build a tree representation of those they don't know.

The prototype library looks in pretty good shape. As soon as I've got
something useful to show I will send you my results.

Cheers,

Omar.



On Mon, Dec 19, 2011 at 11:34 AM, Omar Laurino <
olaurino at head.cfa.harvard.edu> wrote:

> Hi Markus, All,
>
> I wanted to come up with a draft last week, but I didn't make it. Also I
> haven't had time to review your draft. I will work on the Utypes document
> today and provide you with some feedback.
>
> Thanks for your contribution!
>
> Cheers,
>
> Omar.
>
>
> On Wed, Dec 14, 2011 at 11:52 AM, Mireille Louys <
> mireille.louys at unistra.fr> wrote:
>
>> Hello, Markus, all,
>>
>> Thanks for pushing this effort forward.
>> I 'll have a close look next week.
>> I currently have deep teaching commitments.
>>
>> best wishes , Mireille
>>
>> Markus Demleitner <msdemlei at ari.uni-heidelberg.**de<msdemlei at ari.uni-heidelberg.de>>
>> a écrit :
>>
>>  Dear colleagues,
>>>
>>> As those of you who followed my talk in Pune
>>> (http://docs.g-vo.org/talks/**2011-pune-utypes.pdf<http://docs.g-vo.org/talks/2011-pune-utypes.pdf>)
>>> may have feared (or
>>> hoped; there were some approving gestures back at the talk), I've now
>>> spent some quality time rewriting the chapters 2-4 for the utype
>>> specification.
>>>
>>> ...
>>>
>>>
>>> If, on the other hand, you think I'm mad and you'll never agree to
>>> such extensive changes to the utypes document (or to such a
>>> dumbing-down of data modelling, or whatever), *please* by all means
>>> speak up now.  I'll not be cross.  Promised.  You'll be saving me a
>>> lot of work.
>>>
>>> Finally, of course, if you think only parts of what I've tried are
>>> dead wrong, you're of course welcome to complain (or even fix), too.
>>>
>>> Cheers,
>>>
>>>          Markus
>>>
>>
>> --
>> Mireille Louys, assistant professor at  UDS: ENSPS, Laboratoire ICube et
>> CDS
>> Observatoire de Strasbourg
>> mail to: mireille.louys at unistra.fr
>> Tel: +33 3 68 85 24 34
>> Adress 1: CDS/Observatoire de Strasbourg
>> 11, rue de l'Université
>> 67000 STRASBOURG
>>
>>
>>
>> ______________________________**_________________
>> utypes mailing list
>> utypes at ivoa.net
>> http://www.ivoa.net/mailman/**listinfo/utypes<http://www.ivoa.net/mailman/listinfo/utypes>
>>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.ivoa.net/pipermail/utypes/attachments/20111220/e18305fe/attachment-0001.html>


More information about the utypes mailing list