Skip to content

void:Dataset a owl:Class

Subclasses

Database

GBOLDataSet

DataFile

Annotations

skos:definition The fundamental concept of VoID is the dataset. A dataset is a set of RDF triples that are published, maintained or aggregated by a single provider. Unlike RDF graphs, which are purely mathematical constructs [RDF-CONCEPTS], the term dataset has a social dimension: we think of a dataset as a meaningful collection of triples, that deal with a certain topic, originate from a certain source or process, are hosted on a certain server, or are aggregated by a certain custodian. Also, typically a dataset is accessible on the Web, for example through resolvable HTTP URIs or through a SPARQL endpoint, and it contains sufficiently many triples that there is benefit in providing a concise summary.

Since most datasets describe a well-defined set of entities, datasets can also be seen as a set of descriptions of certain entities, which often share a common URI prefix (such as http://dbpedia.org/resource/).

In VoID, a dataset is modeled as an instance of the void:Dataset class. Such a void:Dataset instance is a single RDF resource that represents the entire dataset, and thus allows us to easily make statements about the entire dataset and all its triples.

The relationship between a void:Dataset instance and the concrete triples contained in the dataset is established through access information, such as the address of a SPARQL endpoint where the triples can be accessed.
subDomain DocumentCore
skos:editorialNote #Fields we could include from FAIR
#fields taken from fair
#dcterms:isPartOf IRI;
#fdp-o:metadataIssued xsd:dateTime;
#fdp-o:metadataModified xsd:dateTime;
#dcterms:issued xsd:dateTime
;
#dcterms:modified xsd:dateTime;
# dcterms:rights IRI
;
#dcterms:conformsTo IRI;
#fdp-o:metadataIdentifier @:metadataID;
# dcterms:publisher @:agent+;;
#dcat:landingPage IRI
;
#dcat:theme IRI+;
#dcat:keyword xsd:string*;
#Links to distribution

Properties

property description cardinality type
dc:license A legal document giving official permission to do something with this dataset. It must be an IRI referencing to an resolvable URI returning the text of the license. 0:1 IRI
dc:hasVersion The textual description of the version of the document, please not the version property in GBOLDataSet 1:1 xsd:string
void:uriSpace This property can be used to state that all entity URIs in a dataset start with a given string. In other words, they share a common “URI namespace”. 0:1 IRI
void:uriRegexPattern In cases where a simple string prefix match is insufficient, this property can be used. It expresses a regular expression pattern that matches the URIs of the dataset's entities. 0:1 xsd:string
wv:waiver Best practice is use the URI of a waiver legal document as the value of this property. 0:1 xsd:string
wv:norms Norms are not legally binding but represent the general principles or "code of conduct" adopted by a community for access and use of resources 0:1 IRI
dc:language The language of this resource. 0:N xsd:string
dc:description A textual description of the dataset. 0:1 xsd:string
void:uriLookupEndpoint A protocol endpoint for simple URI lookups for this data set. 0:1 IRI
void:sparqlEndpoint An optional sparql endpoint to query the dataset 0:1 IRI
void:dataDump If available, the RDF dump of the dataset. 0:1 IRI
rdfs:comment A comment on the dataset 0:1 xsd:string
dc:title The name of the dataset. 0:1 xsd:string
rdfs:label Used to provide a human-readable version of a resource's name 0:1 xsd:string