Discussion JPorter/RNottrott on minimum viable systems for
LTER Data catalog
As a starting point in LTER NIS design/implementation, we discussed
four systems that could be used to implement a successor to the
existing LTER Catalog of Core Data Sets, 1) table-based, 2) Harvest-Web
page retrieval and indexing system, 3) LTER Catalog entry and
4) Use NASA Global Change Master Directory.
- Table-based
- link Hartman table
to actual data
- query capabilities
- none
- directory tree capabilities
- requires selection of suitable data categories and consistent
coding
- one dataset might fall into multiple categories
- can be done using multiple entries in the table per dataset
- ease of establishment
- can be done by one person over network
- ease of update
- difficult, requires re-accessing WWW sites individually
- could be semi-automated using Webcrawler-like software to
detect new URLS
- Harvest-Web
page retrieval and indexing system
- query capabilities
- free text only
- directory tree capability
- none
- ease of establishment
- not clear at this point (to be explored)
- may be automated
- central site accesses site harvester
- ease of update
- automated
- RNott and KBaker have looked at some existing Harvest systems
- LTER Catalog entry (direct successor to the Core Data Set Catalog)
- create our own system focusing on very minimal metadata
(catagories, title of dataset, authors, URL(s))
- query capabilities
- text search based on title and keywords
- directory tree
- based on catagories/keywords
- ease of establishment
- requires sites to input
- or could be done by central site where sufficient metadata
is online
- ease of updates
- requires manual updates by sites
- could be facilitated by WWW forms
- comments
- need to make sure keywords etc. match up with other similar
systems (e.g. GCMD)
- Use NASA Global Change Master Directory (GCMD)
- work with GCMD on developing modified data input and query
forms that give the appearance of a stand-alone LTER system (Porter
and Nottrott have worked on this with NASA in the past. As a result,
present LTER Core Data Set Catalog entries are in GCMD)
- query capabilities
- both full-text and fielded searches
- directory-tree
- use facilities under development with GCMD
- ease of establishment
- would require sites to input appropriate descriptions, either
via automated creation of DIF's or using WWW or PC-based forms
- GCMD would provide curation in the selection of themes, topics
and terms
- ease of update
- sites would need to submit updates when datasets were added
or changed
- Comments: requires cooperative relationship be developed
with GCMD