ICSM ISO 19115-1 Metadata Best Practice Guide

Metadata Identifier ★★★★★

In order for machines and people to distinguish a metadata record from all others a Metadata record MD_Metadata needs a unique identifier.

   
Element Name metadataIdentifier
Parent MD_Metadata
Class/Type MD_Identification
Governance Common ICSM
Purpose Linkage, Identification
Audience machine resource - ⭑ ⭑ ⭑
  general - ⭑
  resource manager - ⭑ ⭑ ⭑
  specialist - ⭑ ⭑
Metadata type administrative
ICSM Level of Agreement ⭑ ⭑ ⭑ ⭑

Definition

The persistent unique alphanumeric identifier for the metadata record that describes a spatial resource.

ISO Obligation

Discussion

Every metadata record needs a unique identifier so as to provide certainty as to the identity of the record and to provide a primary key for linkages. This element should hold UUID, preferably in the form of a resolvable URI (either standalone or in combination with another element e.g. <mcc:codespace>). This provides the machine-readable (and human) ability to determine if this is the same or different to other metadata records. It is useful in machine to machine activities such as metadata harvesting. It is also at times useful to data managers and others to determine if a record is the same as another.

This identifier must never change, irrespective of where that metadata record is stored. This allows linkages to a metadata record to persist.

The equivalent of this field was recommended as mandatory in the previous ANZLIC Standard. Common practice is to record a UUID in this field.

Best Practice Recommendations

Therefore - this element should be populated in all metadata records. It is Highly Recommended that every metadata record should have one and only one metadata identifier (MD_Metadata > mdb:metadataIdentifier > MD_Identifier). The child element <mcc:code> should contain the persistent uuid for the metadata record. While the child element <mcc:codespace> should contain the reference URL path by which, when combined with the uuid, this metadata record can be retrieved. It is beneficial to populate other sibling elements as needed such as <mcc:authority> to indicated the authority that minted the uuid.

From class MD_Identifier

Also Consider

Outstanding Issues

CORE ISSUE: In ISO the identifier for the Metadata can differ from that of the data. This is not necessarily the case in other metadata standards such as Dublin Core or DCAT.

Resolvable URIs Of note - the ISO conceptual reference model for ISO 19115-1 available at https://www.isotc211.org/hmmg/HTML/ConceptualModels/EARoot/EA1/EA12/EA2/EA4095.htm. This includes a description that differs slightly from previous description - “Unique Identifier and onlineResource for this metadata record” In many ways it would be preferable if this field were populated with a URI that ultimately resolved to a point of truth for the metadata record. Some issues remain in software that incorrectly prohibit characters needed by URIs. The nested tag that actually holds the UUID character string is mcc:code. This is noted in the UML for this element: “NOTE: avoid characters that are not legal in URLs”. This would suggest that URIs of type URL are to be allowed. While I would not currently recommend URIs in this field, due to software limitations, it is a practice that would likely be encouraged in the future when linked data is more prevalent. A recommended solution is to populate codespace with the path by which, when combined with the uuid, the metadata can be retrieved. In practice metadataLinkage (Link) often contains the online linkage (to the point of truth metadata record).

Geonetwork usage By default GeoNetwork calculates a unique value for the subelement code and populates the codespace element with “urn::uuid”. These fields are not editable once generated.

Other Discussion

DCAT Notes DCAT 2 makes several recommendations about the use of identifiers. From the specification’s Dereferencable identifiers section:

DCAT 2 suggests some ways of improving non-HTTP dereferenceable identifiers:

The concept captured by DCAT 2 using adms:schemaAgency or dct:creator is the authority in this specification.

From data.gov.au The fileIdentifier for a metadata record must never change, irrespective of where that metadata record is stored. Should be system generated. In CKANs case the UUID is common to dataset and metadata record, and takes the UUID with it across new systems. Automatically generated unique ID. Decided against DOI as unique ID already generated in CKAN. DOI records created in ANDS can be leveraged for those who want them given data.gov.au metadata to be harvested by ANDS.

Crosswalk considerations

ISO19139

In iso19115-1 Data type CI_ResponsibleParty (iso19115:2004) changed to type CI_Responsibility such as in MD_Metadata.contact, MD_DataIdentification.pointOfContact, or CI_Citation.citedResponsibleParty

Dublin core / CKAN / data.gov.au

Maps to metadata URI

DCAT

Maps to dct:identifier or adms:identifier. From DCAT 2: “The property adms:identifier can express other locally minted identifiers or external identifiers, like DOI, ELI, arΧiv for creative works and ORCID, VIAF, ISNI for actors such as authors and publishers, as long as the identifiers are globally unique and stable.” and “adms:schemaAgency is used when the authority has no URI associated.”

RIF-CS

Maps to Key Identifier

Examples

Other

data.gov.au

http://data.gov.au/dataset/559708e5-480e-4f94-8429-c49571e82761

XML

<mdb:MD_Metadata>
....
   <mdb:metadataIdentifier>
   <mcc:MD_Identifier>
     <mcc:code>
      <gco:CharacterString>
      314eb989-3771-4c24-a399-d22631973279
      </gco:CharacterString>
     </mcc:code>
     <mcc:codeSpace>
      <gco:CharacterString>
      https://geodata.nz/geonetwork/srv/metadata/
      </gco:CharacterString>
     </mcc:codeSpace>
   </mcc:MD_Identifier>
 </mdb:metadataIdentifier>
....
</mdb:MD_Metadata>

\pagebreak

UML diagrams

Recommended elements highlighted in yellow

MDIdentifier

\pagebreak