Sequences, Lists, and Relational Constraint Expressions

James Gallagher jhrg at mac.com
Mon May 8 08:53:37 PDT 2006


On May 8, 2006, at 6:27 AM, Roy Mendelssohn wrote:

> I think to be useful you must be able to constrain more than lat- 
> lon-depth-time.  You should be able to constrain things like  
> "station", "line", and some other properties of the variables.  I  
> understand that allowing constraints on everything can be  
> difficult. If there were a series of well-formed conventions, than  
> some limit could be agreed to based on the structure of the  
> convention.
>
> Certainly formalizing the DAPPER convention, and also  firming up  
> and comparing the conventions John Caron has recommended woul dbe a  
> good start.

I agree. Originally the DAP had a data type called 'Function' which  
was not a function like f(x) but a Sequence where the columns were  
divided into two groups: Independent and dependent. It was never used  
and I dropped it (just like I dropped List). There's no silver bullet  
here, but I thought the history might be useful in some way. Maybe  
now we're a point where it makes sense to divide things in a sequence  
along these lines?

James
>
> -Roy
>
>
> At 2:45 PM -0600 5/5/06, John Caron wrote:
>> Steve Hankin wrote:
>>> Hi John,
>>>
>>> This dilemma is the result of a mismatch between the "discipline  
>>> neutral" generality of the OPeNDAP DAP and the relatively  
>>> focussed data models that Unidata is working with.  The obvious  
>>> way to address this is through conventions -- just as the CF  
>>> conventions are already needed in order to map the generality of  
>>> netCDF onto the Common Data Model.
>>>
>>> It seems to me that the DAPPER conventions, if fleshed out, could  
>>> address the problem that you pose.  Fully fleshed out DAPPER  
>>> conventions would clarify the process for identifying the  
>>> independent (coordinate) variables, georeferencing, the  
>>> requirement (or not) for a 2-level Sequence, the scope of  
>>> structures that would be permitted inside the sequences, the  
>>> adoption (or not) of CF standard names, etc.   Through the  
>>> conventions one could also impose restrictions (or convey  
>>> information to clients) on the types of relational operations  
>>> that are allowed and/or the variables to which they can be applied.
>>
>> Hmmm, I dont think you could use conventions to alter the  
>> semantics of the data access protocol. That is, operations will  
>> still be allowed, but they may fail because they exceed resource  
>> limits.
>>
>> The problem is we want to model our data with variable length  
>> arrays, and right now we only have Sequences to do that, which  
>> require supporting RCE on all fields. We could move to supporting  
>> a List data type, but that has no RCE on any fields. I want to  
>> support RCE on some fields of a variable length array, because I  
>> can only do that efficiently on some of the fields. (This is true  
>> even with an RDB, namely the fields that you have indexed).
>>
>> More abstractly, this is an example where I (ideally) want to  
>> express what opendap calls are practical. The client should be  
>> able to exmine the DDS and decide how to best get the info it  
>> wants. Because of the generality of opendap (i assume following  
>> relational theory) i cant do that.
>>
>> So you are suggesting that instead of indicating "what is  
>> practical" in the DDS, we should do it at some layer above that,  
>> eg the "Dapper convention layer". Thats probably what we have to  
>> do if we cant do it in the DDS. I wanted to see if anyone else had  
>> other ideas before we follow that path, though.
>>
>>>
>>>    - steve
>>>
>>> =================================================
>>>
>>> John Caron wrote:
>>>
>>>> Hi James, et al:
>>>>
>>>> One problem I have in implementing opendap sequences is that you  
>>>> have to support relational constraint expressions (RCE) on all  
>>>> of the fields of a sequence. This only scales if you use a DBMS.  
>>>> I would prefer to support RCE on some of the fields, eg the  
>>>> space and time coordinates and station ids.
>>>> Im wondering if anyone has this problem, and if there is  
>>>> anything to do about it?
>
>
> -- 
> **********************
> "The contents of this message do not reflect any position of the  
> Government or NOAA."
> **********************
> Roy Mendelssohn
> Supervisory Operations Research Analyst
> NOAA/NMFS
> Environmental Research Division
> Southwest Fisheries Science Center
> 1352 Lighthouse Avenue
> Pacific Grove, CA 93950-2097
>
> e-mail: Roy.Mendelssohn at noaa.gov (Note new e-mail address)
> voice: (831)-648-9029
> fax: (831)-648-8440
> www: http://www.pfeg.noaa.gov/
>
> "Old age and treachery will overcome youth and skill."
>

--
James Gallagher                jgallagher at opendap.org
OPeNDAP, Inc                   406.723.8663



More information about the Opendap-tech mailing list