Skip to content

How to add new datasets?

To add new datasets to the data catalog, an automated request system has been enabled through GitHub Issues and csv's files.

Each of the datasets that you want to add to the data catalog must be attached as a different csv file. The template for this file can be found at the following link: Dataset Template.

Plantilla de archivo csv

The Dataset Template

The dataset template is a file in csv format that can be divided into two parts depending on its content.

The first part collects general information about the dataset:

Description
DATASET-NAME Dataset name.
SUPER-NODE Name of the super node to which it belongs.
TEF-NODE Name of the TEF node to which it belongs.
SITE Name of the site to which it belongs.
DATA-MODEL Link to the data model that complies. For example, the smart data model of firmware https://github.com/smart-data-models.
SAMPLING-TIME Sampling time of the dataset (Realtime, X h/min/seg).
HISTORICAL Historical data availability (From XXX/No).
OWNER Owner of the dataset.
GET-ACCESS Contact information to access the dataset.

The second part collects information about the fields (metadata) of the dataset:

Description
ATTRIBUTE Variable name.
TYPE Type of variable (Integer, Float, String, Boolean, etc.).
UNITS (SI) Units of the variable in the International System.
DESCRIPTION/COMMENTS Description/Explanation of the variable.

Warning

  • If you do not have information to complete a field, DO NOT delete it, LEAVE IT EMPTY.

  • If a field is a web link, copy it in plain text, not as a hyperlink.

Example: A filled template

The following image shows an example of a filled template:

filled_template

Making the request (GitHub Issues)

A GitHub account is required.

To make the request to add new datasets to the data catalog, you must create an issue using the New dataset request template and follow the instructions.

From the data catalog page and clicking on the Add New Datasets button you will directly access to adding a new issue.

Add new dataset

In the Github Issues tab, click Get started in the New dataset request issue.

new issue

This will be the template that you must fill out.

issue template

Filling the issue

  1. Add a Title: Only replace the <site_name> with the name of the site to which the dataset belongs (e.g. Valencia).
  2. Attach files: Select the > Add files here. line and insert the csv by clicking on the paper clip in the top bar. All the files you attach must appear one under the other within this Attached csv files section.

    attach files

  3. Comments: If you have any comments you can add them in the corresponding section.

  4. Submit new issue: Once everything is complete, publish the request.

    submit issue

Reviewing the request

Submitted the request, it will be processed automatically. Notifying in the same issue the start of the process and ending with the closing of the issue and the creation of a pull-request with the requested changes.

issue closed

This pull request will be reviewed by the site administrators and, if everything is correct, it will be accepted and the information will be added to the data catalogue. If not, the user will be notified to make the necessary corrections.