This is a test site. The production site with full data is available at GBIF.org
{{nav.loginGreeting}}
  • Obter dados
      • Ocorrências
      • GBIF API
      • Espécies
      • Conjuntos de dados
      • Occurrence snapshots
      • Hosted portals
      • Tendências
  • How-to
    • Partilhar

      • Quick-start guide
      • Tipos de conjuntos de dados
      • Alojamento dos dados
      • Padrões
      • Tornar-se um publicador
      • Qualidade de dados
      • Artigos de dados
    • Use data

      • Uso de dados destacado
      • Guia de citação
      • GBIF citations
      • Citation widget
      • Guides and documentation
  • Ferramentas
    • Publicadores

      • IPT
      • Validador de dados
      • GeoPick
      • New data model
      • GRSciColl
      • Sugerir um conjunto de dados
      • Metabarcoding data toolkit
    • Utilizadores

      • Hosted portals
      • Scientific collections
      • Processamento de dados
      • Derived datasets
      • rgbif
      • pygbif
      • SQL downloads
      • Catálogo de ferramentas
    • GBIF Labs

      • Species matching
      • Name parser
      • Sequence ID
      • Tendências de observações relativas
      • Blog de programadores
  • Comunidade
    • Network

      • A rede GBIF
      • Nodes
      • Publicadores
      • Network contacts
      • Fórum da comunidade
      • uma aliança para o conhecimento da biodiversidade
    • Volunteers

      • Mentors
      • Ambassadors
      • Tradutores
      • Ciência cidadã
    • Activities

      • Capacity development
      • Programas e projetos
      • Training and learning resources
      • Data Use Club
      • Living Atlases
  • About
    • Por dentro do GBIF

      • O que é o GBIF?
      • Tornar-se membro
      • Governança
      • Strategic framework
      • Programa de trabalho
      • Financiadores
      • Parcerias
      • Notas da versão
      • Contactos
    • Notícias e divulgação

      • Notícias
      • Boletins informativos e listas
      • Eventos
      • Prémios
      • Science Review
      • Uso dos dados
      • Thematic communities
  • User profile

Mediated Machine Vision - About

Flammulina P.Karst.
Flammulina P.Karst. Observed in Sweden. Photo by Thomas Stjernegaard Jeppesen via Danish Mycological Society, fungal records database.

This demo showcases the use of images from the Danish Mycological Society's SvampeAtlas and models how the GBIF community can serve as a mechanism for improvements in suggesting species identification and mediating machine vision training datasets to AI modellers.

Machine vision technology can already provide suggestions for identifying images for tens of thousands of species across a wide range of taxonomic groups—witness iNaturalist, which suggests species IDs to users in real-time as they create observation records. Rapid and ongoing advances in training of artificial intelligence (AI) will soon make the detection of species in video feeds or the use the camera in a mobile device to search for species-related content on the Internet commonplace.

The GBIF network has an important role to play in advancing and improving AI with respect to biodiversity data, cross-disciplinary collaboration and citation practice—not least because the GBIF infrastructure holds tens of millions of records associated with one or more images of labelled species, one of the largest datasets of its kind in the world.

As a community, GBIF has implemented not only key technical practices around data standards, but also important social and cultural improvements, including the adoption of open licences, guidance on data citation, and the development of a DOI-based system for tracking reuse of data. Currently applying these lessons alongside an expert team, GBIF is assisting research to increase machine vision's power and availability while seeking to improve understanding and accepted practice regarding the use of GBIF-mediated data in training for AI.

Training datasets are critical to achieving species recognition capability in any machine vision system. These datasets compile representative images containing explicit, verifiable identifications of the species they include. High-powered computers run algorithms to analyse the imagery, building complex models that characterize the defining features for each species or taxonomic group. Researchers can then apply the models to new images, offering predictions as to which species or group they likely contain.

Current research in machine vision is exploring:

  • the use of location and date information to further improve model results
  • identification methods beyond species-level into attribute, character, trait, or part-level ID, with an eye toward human interpretability
  • expertise modeling for improved determination of “research grade” images and metadata

We propose the following approach:

  1. To assist in developing and refining machine vision models, GBIF will provide AI training datasets and ensure that licensing and citation practices are respected. Each of these datasets receive a DOI, and all of contributing datasets will receive credit and attribution of uses through GBIF's DOI citation system.

  2. To assist application developers, Google and Visipedia will build openly-licensed models and publish tutorials on how to adapt them for local use.

  3. Together, the project partners will strive to ensure responsible and transparent use of AI training datasets to close the gap between machine vision scientists, application developers. We will also prepare taxonomic trees that clarify and clarify the confidence with which machine vision models can identify a taxon rank based on an image’s visual characteristics.

O que é o GBIF? API Perguntas frequentes Boletim informativo Privacidade Termos e acordos Citações Code of Conduct Agradecimentos
Contato GBIF Secretariat Universitetsparken 15 DK-2100 Copenhagen Ø Denmark
GBIF is a Global Core Biodata Resource