This is a test site. The production site with full data is available at GBIF.org
{{nav.loginGreeting}}
  • 获取数据
      • 发生记录
      • GBIF API
      • 物种
      • 数据集
      • 发生记录快照
      • 托管门户
      • 趋势
  • How-to
    • 共享数据

      • 快速入门指南
      • 数据集类别
      • 数据托管
      • 标准
      • 成为发布者
      • 数据质量
      • 数据论文
    • Use data

      • 精选的数据使用
      • 引用指南
      • GBIF citations
      • Citation widget
      • Guides and documentation
  • 工具
    • 发布

      • IPT
      • 数据验证器
      • GeoPick
      • New data model
      • GRSciColl
      • 建议一个数据集
      • Metabarcoding data toolkit
    • 数据获取和使用

      • 托管门户
      • Scientific collections
      • 数据处理
      • Derived datasets
      • rgbif
      • pygbif
      • SQL下载
      • 工具目录
    • GBIF实验室

      • 物种匹配
      • 名称解析器
      • 序列ID
      • 相对观测趋势
      • GBIF数据博客
  • Community
    • Network

      • 参与者网络
      • 节点
      • 发布者
      • Network contacts
      • 社群论坛
      • 生物多样性知识联盟
    • 志愿者

      • 指导
      • 大使
      • 翻译人员
      • 公民科学
    • Activities

      • Capacity development
      • 方案和项目
      • 培训和学习资源
      • 数据使用俱乐部
      • 生命图集
  • 关于
    • GBIF机构内部

      • GBIF是什么机构?
      • 成为会员
      • 管理
      • Strategic framework
      • 工作方案
      • 资助机构
      • 合作伙伴
      • 发行说明
      • 联系信息
    • 新闻与宣传

      • 新闻
      • 通讯和列表
      • 活动
      • 奖项
      • 科学评论
      • 数据使用
      • Thematic communities
  • User profile

Mediated Machine Vision - About

Flammulina P.Karst.
Flammulina P.Karst. Observed in Sweden. Photo by Thomas Stjernegaard Jeppesen via Danish Mycological Society, fungal records database.

This demo showcases the use of images from the Danish Mycological Society's SvampeAtlas and models how the GBIF community can serve as a mechanism for improvements in suggesting species identification and mediating machine vision training datasets to AI modellers.

Machine vision technology can already provide suggestions for identifying images for tens of thousands of species across a wide range of taxonomic groups—witness iNaturalist, which suggests species IDs to users in real-time as they create observation records. Rapid and ongoing advances in training of artificial intelligence (AI) will soon make the detection of species in video feeds or the use the camera in a mobile device to search for species-related content on the Internet commonplace.

The GBIF network has an important role to play in advancing and improving AI with respect to biodiversity data, cross-disciplinary collaboration and citation practice—not least because the GBIF infrastructure holds tens of millions of records associated with one or more images of labelled species, one of the largest datasets of its kind in the world.

As a community, GBIF has implemented not only key technical practices around data standards, but also important social and cultural improvements, including the adoption of open licences, guidance on data citation, and the development of a DOI-based system for tracking reuse of data. Currently applying these lessons alongside an expert team, GBIF is assisting research to increase machine vision's power and availability while seeking to improve understanding and accepted practice regarding the use of GBIF-mediated data in training for AI.

Training datasets are critical to achieving species recognition capability in any machine vision system. These datasets compile representative images containing explicit, verifiable identifications of the species they include. High-powered computers run algorithms to analyse the imagery, building complex models that characterize the defining features for each species or taxonomic group. Researchers can then apply the models to new images, offering predictions as to which species or group they likely contain.

Current research in machine vision is exploring:

  • the use of location and date information to further improve model results
  • identification methods beyond species-level into attribute, character, trait, or part-level ID, with an eye toward human interpretability
  • expertise modeling for improved determination of “research grade” images and metadata

We propose the following approach:

  1. To assist in developing and refining machine vision models, GBIF will provide AI training datasets and ensure that licensing and citation practices are respected. Each of these datasets receive a DOI, and all of contributing datasets will receive credit and attribution of uses through GBIF's DOI citation system.

  2. To assist application developers, Google and Visipedia will build openly-licensed models and publish tutorials on how to adapt them for local use.

  3. Together, the project partners will strive to ensure responsible and transparent use of AI training datasets to close the gap between machine vision scientists, application developers. We will also prepare taxonomic trees that clarify and clarify the confidence with which machine vision models can identify a taxon rank based on an image’s visual characteristics.

GBIF是什么? API 常见问题解答 电子通讯 隐私政策 使用条款与协议 引用 行为准则 致谢
联系我们 GBIF Secretariat Universitetsparken 15 DK-2100 Copenhagen Ø Denmark
GBIF is a Global Core Biodata Resource