Dapper in-situ conventions spec available

Joe Sirott Joe.Sirott at noaa.gov
Wed Oct 18 12:57:05 PDT 2006


John Caron wrote:
>>>
>>> 3. has_data attribute.  to use the spec effectively, particularly in 
>>> a time series sense, we have found that any parameter in the inner 
>>> sequence show always be there, but often it will not be observed 
>>> while other parameters were.  Rather than having to look at the data 
>>> itself to see if it is totally missing, do we want a "has_data" 
>>> attribute required?
>>
>> Yes, this is an issue that we too have encountered with our Web 
>> interface to Dapper. The problem is that the DDS contains the union 
>> of all of the variables in the dataset, even if a given variable is 
>> only measured in a few time series or profiles. One example,  is the 
>> World Ocean Database 2005. A chlorophyll variable is contained in the 
>> DDS, but very few of the millions of profiles in the database 
>> actually contain measurements for this variable. So any user who is 
>> looking for chlorophyll measurements will have to sift through many 
>> profiles before finding a profile with valid data.
>>
>> Adding a has_data as an optional variable attribute sounds like a 
>> good idea to me.
>
>
> presumably you would want to be able to select on has_data ?
... or any other attributes. Can't do this in current version of Dapper, 
though.

>
> another possibility that ive seen used somewhere is that you show N 
> generic data variables, and add N fields that say which variable this 
> sample is, eg:
>
>   Sequence {
>    time;
>    float sample1, sample2, ...
>    String sample1Name, sample2Name, ...       String unit1, unit2, ...
>   } obs; 
> Better would be if you basically had an enumerated type:
>
>  Structure{
>    int id;
>    String name, long_name, units ...
>   } quantity[nq];    
>   Sequence {
>    time;
>    float sample1, sample2, ...
>    int quantity1, quantity2, ...  // quantity id    } obs; 
> If you can select on the quantityi values, you can winnow out the 
> chlorophyll data.
> I realize this is a bit complicated.
> John
Seems like it would be a lot easier to just tag a variable with an 
attribute ... also would be easier to postprocess a dataset that doesn't 
already provide the information (as is true of most in-situ datasets)...

- Joe



More information about the Opendap-tech mailing list