Processes for validating and improving data quality prior to publication often require either separate tools or manual intervention. Besides consuming extra time and resources, these approaches can be difficult if not impossible when working with large datasets—or publishing data in languages other than English.
In this project, SiB Colombia (in Spanish, the Colombian Biodiversity Information System) will work with the U.S.-based collections collaboration VertNet to translate the interface and documentation for the Darwin Core Data Migrator Toolkit. The collaboration between these two GBIF Participants will result in a version of the tool that fills an important technical gap for the numerous Spanish-speaking staff across the GBIF community.
By generating automatic data quality check and improvement reports on datasets, the Darwin Core Data Migrator Toolkit reflects VertNet’s long-standing experience in developing and automating routines to monitor and improve data quality. The team also hopes that the project can act as the pilot for future cooperation between stakeholders elsewhere around the world interested in procedures for improving biodiversity data quality early and often.