Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider rethinking IDs #46

Open
tkw1536 opened this issue May 27, 2020 · 1 comment
Open

Consider rethinking IDs #46

tkw1536 opened this issue May 27, 2020 · 1 comment
Labels
seminar Raised during the MathData Seminar

Comments

@tkw1536
Copy link
Collaborator

tkw1536 commented May 27, 2020

Suggestion by @florian-rabe:

Every dataset should have

  • an id (ideally a user-chosen short string)
  • a set of properties, whose concatenated string renderings define unique identifiers for the items.

The former should be the primary id of the dataset.
The pair of those two should be the id of each item.
Every property should have an id that is unique within the dataset.

The triple of of dataset, item, and property id should be the id of each datum.

These id's should be used both for internal references across datasets and for citations from the outside.

@tkw1536 tkw1536 added the seminar Raised during the MathData Seminar label May 27, 2020
@tkw1536
Copy link
Collaborator Author

tkw1536 commented May 27, 2020

The current behavior is that each item gets an RFC4122 random UUID. It is guaranteed unique across MathDataHub. These UUIDs are a direct consequence of our table structure and have no meaning attached.

I agree that we might want author-defined local IDs of items within datasets. I am not sure if those should be the primary ID used by the system.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
seminar Raised during the MathData Seminar
Projects
None yet
Development

No branches or pull requests

1 participant