Data papers

Peer-reviewed data papers describing biodiversity datasets published through the GBIF network.

82 publications for Data papers

  • Mauri A, Strona G, San-Miguel-Ayanz J (2017)

    EU-Forest, a high-resolution tree occurrence dataset for Europe

    Scientific Data 4 160123.

    We present EU-Forest, a dataset that integrates and extends by almost one order of magnitude the publicly available information on European tree species distribution. The core of our dataset (~96% of the occurrence records) came from an unpublished, large database harmonising forest plot surveys from National Forest Inventories on an INSPIRE-compliant 1 km×1 km grid. These new data can potentially benefit several disciplines, including forestry, biodiversity conservation, palaeoecology, plant ecology, the bioeconomy, and pest management.


  • Nualart N, Ibáñez N, Luque P, Pedrol J, Vilar L, Guàrdia R (2017)

    Dataset of herbarium specimens of threatened vascular plants in Catalonia

    PhytoKeys 77 41-62.

    This data paper describes a specimens’ dataset of the Catalonian threatened vascular plants conserved in five public Catalonian herbaria (BC, BCN, HGI, HBIL and MTTE). Catalonia is an administrative region of Spain that includes large autochthon plants diversity and 199 taxa with IUCN threatened categories (EX, EW, RE, CR, EN and VU). This dataset includes 1,618 records collected from 17th century to nowadays. For each specimen, the species name, locality indication, collection date, collector, ecology and revision label are recorded. More than 94% of the taxa are represented in the herbaria...

    Keywords: Catalonia, Cormophyta, conservation, herbarium, northeastern Iberian Peninsula, specimen, threatened flora


  • Philippe A, Plumejeaud-Perreau C, Jourde J, Pineau P, Lachaussée N, Joyeux E et al. (2017)

    Building a database for long-term monitoring of benthic macrofauna in the Pertuis-Charentais (2004-2014)

    Biodiversity Data Journal 5 e10288.

    Background Long-term benthic monitoring is rewarding in terms of science, but labour-intensive, whether in the field, the laboratory, or behind the computer. Building and managing databases require multiple skills, including consistency over time as well as organisation via a systematic approach. Here, we introduce and share our spatially explicit benthic database, comprising 11 years of benthic data. It is the result of intensive benthic sampling that has been conducted on a regular grid (259 stations) covering the intertidal mudflats of the Pertuis-Charentais (Marennes-Oléron Bay and...

    Keywords: Intertidal mudflats, Pertuis-Charentais, annelids, benthic macrofauna, database management, molluscs, monitoring, shorebirds


  • Alexander N, Massei G, Wint W (2016)

    The European Distribution of Sus Scrofa. Model Outputs from the Project Described within the Poster – Where are All the Boars? An Attempt to Gain a Continental Perspective

    Open Health Data 4(1).

    Wild boar is a host of a number of arthropod-vectored diseases and its numbers are on the rise in mainland Europe. The species potentially impacts ecosystems, humans and farming practices and so its distribution is of interest to policy makers in a number of fields beyond that of the primarily epidemiological goal of this study. Three statistical model outputs describing the distribution and abundance of the species Sus scrofa (Wild boar) are included in this data package. The extent of this dataset covers continental Europe. These data were presented as a poster [1] at the conference Genes,...

    Keywords: Abundance, Distribution, Europe, Random Forest, Statistical modelling, Sus scrofa


  • De Grave S (2016)

    Global Caridean Shrimp Fauna

    Freshwater Metadata Journal 1-5.

    The dataset consists of all geographical records of freshwater shrimps, prior to 2013. Records have various levels of accuracy.

    Keywords: Atyidae, Caridea, Decapoda, Palaemonidae, distribution


  • Escribano N, Galicia D, Ariño A, Escala C (2016)

    Long-term data set of small mammals from owl pellets in the Atlantic-Mediterranean transition area

    Scientific Data 3 160085.

    We describe the pellet sampling data set from the Vertebrate Collection of the Museum of Zoology of the University of Navarra. This data set compiles all information about small mammals obtained from the analysis of owl pellets. The collection consists on skulls, mandibles, and some skeletons of 36 species of more than 72,000 georeferenced specimens. These specimens come from the Iberian Peninsula although most samples were collected in Navarra, a highly diverse transitional area of 10,000 kilometre square sitting across three biogeographical regions. The collection spans more than forty...

    Keywords: Atyidae, Caridea, Decapoda, Palaemonidae, distribution


  • Escribano N, Galicia D, Ariño A, Escala C (2016)

    Mammals in the MZNA Vertebrate Collection of University of Navarra, Spain

    ZooKeys 634 137-150.

    In this paper five datasets are described that provide information about records of mammals in the Vertebrate Collection of the Museum of Zoology of the University of Navarra (MZNA-VERT). The datasets contain 3,466 records belonging to 20 species of mammals sampled across the transition zone between the Atlantic and Mediterranean biogeographical regions (north Iberian Peninsula). The datasets include both distributional data (georeferenced records) and basic biometric data of most of the vouchered specimens stored in the museum facilities. The samples originated mainly within research...

    Keywords: Iberian Peninsula, Mammals, biogeographical regions, biometry, occurrence, specimens


  • Estrada-Peña A, de la Fuente J, Wisz M, Estrada-Peña A, Fuente J, Meller L et al. (2016)

    Species interactions in occurrence data for a community of tick-transmitted pathogens

    Scientific Data 3 160056.

    Interactions between tick species, their realized range of hosts, the pathogens they carry and transmit, and the geographic distribution of species in the Western Palearctic were determined based on evidence published between 1970–2014. These relationships were linked to remotely sensed features of temperature and vegetation and used to extract the network of interactions among the organisms. The resulting datasets focused on niche overlap among ticks and hosts, species interactions, and the fraction of the environmental niche in which tick-borne pathogens may circulate as a result of...

    Keywords: Iberian Peninsula, Mammals, biogeographical regions, biometry, occurrence, specimens


  • Gilles D, Zaiss R, Blach-Overgaard A, Catarino L, Damen T, Deblauwe V et al. (2016)

    RAINBIO: a mega-database of tropical African vascular plants distributions

    PhytoKeys 74 1-18.

    The tropical vegetation of Africa is characterized by high levels of species diversity but is undergoing important shifts in response to ongoing climate change and increasing anthropogenic pressures. Although our knowledge of plant species distribution patterns in the African tropics has been improving over the years, it remains limited. Here we present RAINBIO, a unique comprehensive mega-database of georeferenced records for vascular plants in continental tropical Africa. The geographic focus of the database is the region south of the Sahel and north of Southern Africa, and the majority of...

    Keywords: Herbarium specimens, biodiversity assessmen, cultivated species, digitization, georeferencing, habit, native species, taxonomic backbone, tropical forests


  • Kano Y, Musikasinthorn P, Iwata A, Tun S, Yun L, Win S et al. (2016)

    A dataset of fishes in and around Inle Lake, an ancient lake of Myanmar, with DNA barcoding, photo images and CT/3D models

    Biodiversity Data Journal 4 e10539.

    ackground Inle (Inlay) Lake, an ancient lake of Southeast Asia, is located at the eastern part of Myanmar, surrounded by the Shan Mountains. Detailed information on fish fauna in and around the lake has long been unknown, although its outstanding endemism was reported a century ago. New information Based on the fish specimens collected from markets, rivers, swamps, ponds and ditches around Inle Lake as well as from the lake itself from 2014 to 2016, we recorded a total of 948 occurrence data (2120 individuals), belonging to 10 orders, 19 families, 39 genera and 49 species. Amongst them, 13...

    Keywords: 3D model, COI, CT scan, GBIF, Inle Lake, Myanmar, Shan State, alien, endemic species, freshwater fishes, mitochondrial DNA