The network that publishes occurrence records through GBIF spans hundreds of publishing institutions worldwide. Data holders manage content in either spreadsheets or databases and then use specific publishing tools to expose those data for querying and access over the internet. The existence of the dataset and the technical protocols required to access the data are entered into the GBIF registry.
Aggregators such as the GBIF global portal and national GBIF data portals, crawl datasets and build sophisticated indexes to allow users to efficiently search and access content across datasets.
This page briefly describes the architecture and operations performed in the global GBIF portal when crawling and indexing occurrence data for user search and download.