Preparing a dataset in order to publish on an aggregator like Canadensys or GBIF can be quite fastidious, but data cleaning and standardisation are the key points to good quality and useful datasets. Fortunately, a lot of tools and protocols have been develop through time to help preparing biodiversity datasets.
This workshop aims to give curators, collection managers, students, and everyone willing to share their data on GBIF, a set of useful tools and methodologies to go from their in-house database to the publication of data on GBIF.
We will even go further, and discover how to search and use data published on Canadensys, which is now based on the Atlas of Living Australia framework, and on GBIF.
In particular, the workshop will focus on:
- Data cleaning with Open Refine
- How to map data to DarwinCore
- How to prepare a DarwinCore Archive on an IPT
- GBIF tools: Data Validator, Species matching, Datasets Metrics
- How to search and use the Canadensys and GBIF Explorers
- Open Data usage