We’re sorry, but GBIF doesn’t work properly without JavaScript enabled.
Our website has detected that you are using an outdated insecure browser that will prevent you from using the site. We suggest you upgrade to a modern browser.
{{nav.loginGreeting}}
  • Get data
      • Occurrences
      • GBIF API
      • Species
      • Datasets
      • Trends
  • How-to
    • Share data

      • Quick-start guide
      • Dataset classes
      • Data hosting
      • Standards
      • Become a publisher
      • Data quality
      • Data papers
    • Use data

      • Featured data use
      • Citation guidelines
      • GBIF citations
      • Citation widget
  • Tools
    • Publishing

      • IPT
      • Data validator
      • Scientific Collections
      • Suggest a dataset
    • Users

      • Data processing
      • Derived datasets
      • rgbif
      • MAXENT
      • Tools catalogue
    • GBIF labs

      • Species matching
      • Name parser
      • Sequence ID
      • Relative observation trends
      • GBIF data blog
  • Community
    • Network

      • Participant network
      • Nodes
      • Publishers
      • Network contacts
      • Community forum
      • alliance for biodiversity knowledge
    • Volunteers

      • Mentors
      • Ambassadors
      • Translators
      • Citizen scientists
    • Activities

      • Capacity enhancement
      • Programmes & projects
      • Training and learning resources
      • Data Use Club
      • Living Atlases
  • About
    • Inside GBIF

      • What is GBIF?
      • Become a member
      • Governance
      • Funders
      • Partnerships
      • Release notes
      • Implementation plan
      • Contacts
    • News & outreach

      • News
      • Newsletters and lists
      • Events
      • Ebbe Nielsen Challenge
      • Young Researchers Award
      • Science Review
  • User profile

Global Register of Introduced and Invasive Species - United States (Contiguous)

Dataset homepage

Citation

Simpson A, Sellers E, Pagad S (2022). Global Register of Introduced and Invasive Species - United States (Contiguous). Version 1.4. Invasive Species Specialist Group ISSG. Checklist dataset https://doi.org/10.5066/p95xl09q accessed via GBIF.org on 2022-05-23.

Description

This is version 2.1 of the dataset published to GBIF by the Invasive Species Specialist Group (ISSG) on behalf of the U.S. Geological Survey on October 12, 2020, at https://www.gbif.org/dataset/6b64ef7e-82f7-47a3-8ddb-ec6794ea07d6. Like that checklist, V2.1 presents validated and verified national checklists of introduced (alien) and invasive alien species at the sub-country level. The other two related checklists for the United States, published separately as V1.0, are for the States of Alaska and Hawaii.

Differences between V1.0 and V2.1 (this dataset): SIZE: V1.0 - 5,006 accepted names; V2.1 - 8,654 accepted names and two unranked hybrids. FIELDS: V1.0: 14 fields - 9 in Taxon Core, 4 in Species Distribution Extension, 1 in Species Profile Extension; V2.1: 20 fields in Taxon Core, 39 in Occurrence Extension, 9 in Species Distribution Extension, 8 in Vernacular Names Extension, 8 in Literature Reference Extension, and 6 in Species Profile Extension. OTHER DIFFERENCES: V2.1 provides: a broader inclusion of arthropods; approximate dates of introduction (where available); 4,693 references; improved disambiguation of scientific names; biocontrol species information (where applicable); taxonomic synonyms, where available, in taxonRemarks field; unique occurrenceIDs; no habitat information.

OVERVIEW: Introduced (non-native) species that becomes established may eventually become invasive, so tracking introduced species provides a baseline for effective modeling of species trends and interactions, geospatially and temporally. The umbrella dataset, called United States Register of Introduced and Invasive Species (US-RIIS), is comprised of three lists, one each for Alaska (AK, with 532 records), Hawaii (HI, with 6,075 records), and the conterminous (contiguous) 48 United States (L48, or Lower 48), (this dataset, with 8,656 records). Each list includes introduced (non-native), established (reproducing) taxa that: are, or may become, invasive (harmful) in the locality; are not known to be harmful there; and/or have been used for biological control in the locality.

To be included in the Global Register of Introduced and Invasive Species - United States (Contiguous), or GRIIS-L48 (with L48 meaning the Lower 48 Conterminous United States), a taxon must be non-native everywhere in the locality and established (reproducing) anywhere in the locality. Native pest species are not included.

Each record has information on taxonomy, dates of introduction (where available; currently for 40%), invasion status (invasive or introduced), and citations for the authoritative sources from which this information is drawn. The umbrella dataset US-RIIS builds on a previous dataset, A Comprehensive List of Non-Native Species Established in Three Major Regions of the U.S.: Version 3.0 (Simpson et al., 2020, https://doi.org/10.5066/p9e5k160). There are 15,264 records in the umbrella list (US-RIIS) and 12,981 unique scientific names. The US-RIIS is derived from 5,951 authoritative sources, was reviewed by or based on input from 30 invasive species scientists, and continues to be updated. Publication of version 2.0 (USGS' versioning) of the US-RIIS is anticipated (but not guaranteed) in approximately 12 months.

ACKNOWLEDGEMENTS: Many thanks to these additional Reviewers/Contributors: Alexander Salazar, Miami University, Ohio; Alma Solis, Smithsonian Institution; Andrew P. Landsman, National Park Service; Bethany Bradley, University of Massachusetts, Amherst; Bruce Cutler, University of Kansas; Cayla Morningstar, USGS-NAS; Chris Taliga, USDA-ARS PLANTS database; Connor Davidson Crouch, Northern Arizona University; Danielle Froelich, SWCA Environmental Consultants; Darrell Ubick, Cal Academy; Faith Campbell, Center for Invasive Species Prevention; Gerry Moore, USDA-ARS PLANTS database; Matt Bowser, US Fish and Wildlife Service; Matt Neilson, USGS-NAS; Michael Gates, USDA-ARS; Nancy Khan, Smithsonian Institution; Rachel Neville, Invasive Species Consultant; Roy Van Driesche, University of Massachusetts-Amherst; Shyama Pagad, IUCN Invasive Species Specialists Group; Terry L. Whitworth, Washington State University; Thomas Henry, USDA-ARS; Vickie Brewster, USDA-APHIS; Warren Wagner, Smithsonian Institution. Our sincere apologies to the many contributors whom we may not have mentioned. We appreciate your assistance improving the quality of this dataset.

Purpose

We anticipate the GRIIS-L48 will be used to 1) refine species lists for horizon scanning (established species should not be included), 2) create introduced species lists for smaller areas within the conterminous U.S., and 3) tag species occurrence data records with an introduced status where the species may not have been initially reported as introduced. The GRIIS-L48 also contributes to the Global Register of Introduced and Invasive Species (http://griis.org/about.php).

Sampling Description

Study Extent

Temporal: data collection began in 2012 and is ongoing. Publication has been annual since 2018.

Sampling

Methods used to create the integrated dataset have varied somewhat over time. Additional fields have been added from those that were used at the start of the project. Initially, each region (AK, HI, L48) had a lead data searcher and the lists were combined afterwards within one Excel spreadsheet. The interpretation of the content of the citation fields follow the rules established by ITIS and described here: http://www.itis.gov/submit_guidlines.html. Each lead data searcher was trained in the methods to discern an authoritative source; how to extract scientific names from the source; how to generate from the source and other reference databases such as ITIS all of the information in the spreadsheet columns. There are two spreadsheets to populate, plus a data dictionary sheet. The United States (Contiguous) Checklist dataset shared with GBIF contains these fields: locality - Required; from Darwin Core; value = "Conterminous 48 United States". scientificName - Required; modified from Darwin Core; an accepted species, subspecies, variety, form, or hybrid name, preferably validated in ITIS, or secondarily, in GBIF, or otherwise by another taxonomic authority; Genus names alone are not allowed; does not include scientificNameAuthorship. scientificNameAuthorship - Optional; from Darwin Core; as provided by the taxonomic authority or the reference. vernacularName - Required; from Darwin Core; generally provided by ITIS, the authority, or another reliable source; may be a very generic term where specific common names are not in use. taxonRank - Required; from Darwin Core; as provided by the taxonomic authority or derived by data searcher. establishmentMeans - Required; from Darwin Core; controlled vocabulary: introduced (alien, exotic, non-native, nonindigenous), introduced: assisted colonization, native (indigenous), native: reintroduced, uncertain (unknown, cryptogenic), vagrant (casual). degreeOfEstablishment - Required; from Darwin Core; an assertion provided by or derived from the authority describing the established or invasive status of the taxa within the region; controlled vocabulary (smaller for this dataset than from Darwin Core, due to the nature of the dataset): established (category C3), meaning reproducing in the locality; invasive (category D2), meaning causing, or likely to cause, harm; widespread invasive (category E), meaning at least locally abundant and harmful ecologically, economically, or to health. occurrenceStatus - Required; from Darwin Core; value = "present". associatedTaxa - Optional; from Darwin Core; only contains content if a biocontrol species. Consists of "biocontrol for: " followed by a text description of the name of the organism(s) that the species was introduced to control. eventRemarks - Optional; from Darwin Core; consists of "approx. date of introduction: " and a date or text string describing when the taxon is believed to have been introduced to the locality. Also includes brief (Author (YYYY)) value if the associatedReferences authority is not the authority providing the date of introduction. taxonRemarks - Optional; from Darwin Core; only included if the scientific name has many synonyms or misspellings, or, if the name provided by the authority is an unaccepted synonym of a valid/accepted name in a taxonomic authority; Multiple names are separated by a semicolon and a space. kingdom - Required; from Darwin Core; one of seven names as specified in ITIS or another naming authority; controlled vocabulary: Animalia, Bacteria, Chromista, Fungi, Plantae, Protozoa, Virus. phylum - Required; from Darwin Core; as provided by the taxonomic authority; occasionally = "undefined". class - Required; from Darwin Core; as provided by the taxonomic authority; occasionally = "undefined". order - Required; from Darwin Core; as provided by the taxonomic authority; occasionally = "undefined". family - Required; from Darwin Core; as provided by the taxonomic authority; occasionally = "undefined". taxonomicStatus - Required; from Darwin Core; value = "Accepted " followed by the acronym of the taxonomic authority providing the taxonID. taxonID - Required; from Darwin Core; a unique number for the taxa, as provided by the taxonomic authority. Where possible, a URN. Otherwise a stable URL. associatedReferences - Required; from Darwin Core; the abbreviated name for the author of the resource asserting the taxa's non-native species status. Generally, but not always, followed by (year), followed by a URL back to an online version of the reference, or by a full reference. eventDate - Required; from Darwin Core; the date of creation of the record, in the format YYYY-MM-DD; manually generated so errors are possible. NOTE: if earliest occurrence for the species is known or suspected, it is provided in the eventRemarks field. modified - Required; from Darwin Core; date of latest edit/change made to the record; is manually generated, so errors are possible. occurrenceRemarks - Optional; from Darwin Core; text field with comments about the record, generally extracted from the text of the associatedReference or other references. occurrenceID - Required; from Darwin Core; an identifier for the Organism instance (at the record level); a controlled vocabulary that is manually generated using an ordinal number that is not repeated, prefixed by the umbrella dataset's abbreviated name (USRIIS), the record's locality (L48), and the record's kingdom; since the content of this field is not auto-generated, it is subject to human error. Deleted occurrenceIDs are not reused.

Quality Control

Incoming data is error checked. Scientific names are validated against the Integrated Taxonomic Information System (ITIS), then the Global Biodiversity Information Facility (GBIF), and other taxonomic authorities when necessary. Other data fields are assumed correct. The validity of the assertion of non-native status itself may subsequently be questioned and the entry deleted or moved to a watch list if: the scientific name is not in ITIS and not reaffirmed by other authoritative taxonomic sources; other authoritative sources state the species is not present or is native or cryptic; irreconcilable typographical errors in scientific names are found; entries are found to be duplicate or synonomous taxonomic names; nominate subspecies names are removed if they are the only subspecies in the list and the parent binomial is already in the regional list.

Method steps

  1. Tools used are described in the following steps:
  2. Microsoft Word to assist in formatting data.
  3. Microsoft Excel to hold the data and sort and display it.
  4. Integrated Taxonomic Information System (ITIS) to validate taxonomic names.
  5. Global Biodiversity Information Facility to validate taxonomic names not found in ITIS.
  6. Microsoft OneDrive to store the data.
  7. Adobe Pro to extract text from image-based pdfs.
  8. Atlassian Jira to track update and review tasks.
  9. Sublime Text to assist with text formatting and metadata eml generation.
  10. Various browsers (MS Edge, Google Chrome, Mozilla Firefox, Apple Safari), species databases, and search engines, to discover authoritative introduced species lists for AK, HI, and L48.
  11. PROVENANCE: Our team tracks the provenance of the dataset as any modifications, transformations, edits, or decisions to accept or reject data points are made. Provenance is essential to the dataset, which consists entirely of assertions of the presence and non-native status of a species in an area.
  12. This tracking involves the inclusion of an authority or data source with each assertion that a species is introduced and established in a given area. The actual authorities' lists and manuscripts themselves are also saved digitally as written proof of the individual assertions and to justify, as necessary, each species' inclusion in the list.
  13. To assist with tracking the evolution of a data record, there are "Acquisition Date" and "modified" fields (which are not auto-generated and so subject to human error). The "modified" field is a Darwin Core term for the date of the last modification of the record (YYYY-MM-DD), and the cumulative modifications to the record are described in the "Update Remarks" field, separated within the field by commas (for modifications made on the same date) and semicolons (separating modifications made on different dates).
  14. The dataset and the reference authorities are maintained in Microsoft OneDrive. The list is worked on in MS Excel and the file's history can be used to 'roll back' to previous versions, if necessary. A versionHistory.txt file published online along with the data describes major changes and differences in each published version of the list. Past versions of the precursor to this dataset are maintained and made available as part of the deprecated .zip files at: https://doi.org/10.5066/P9E5K160.
  15. ANALYZE CRITERIA: Assertions of introduced and established status for a species are accepted if the authority is a trusted source, such as a governmental or government-affiliated, non-governmental organization, international biodiversity organization, academic institution, or a taxonomic expert. If a list is found on the Web and judged as authoritative, it is downloaded for future reference. Authorities, in addition to being cited within each record, are accumulated in an AuthorityReferences spreadsheet. These criteria are part of our documentation. When there is disagreement among experts about establishment, taxonomy, native status, and approximate dates of introduction a wider search of available information is used to determine the best resolution. Cryptic species (of unknown origin) are not included in the GRIIS-L48. Taxa with conflicting status or taxonomy among different sources, that cannot be reconciled through further research, are moved to an informal watch list (available upon request).

Additional info

During the data cleaning of this species checklist (that can be downloaded as part of the dataaset at https://doi.org/10.5066/P95XL09Q): 1) 'Other Names' has been mapped to 'taxonRemarks' with the preface 'Other names: ' 2) 'locality' values were translated during data cleaning: L48 = Conterminous (or Contiguous) 48 United States 3) 'establishmentMeans' values were generated and derived as either 'introduced (alien, exotic, non-native, nonindigenous)' if not a biocontrol species or 'introduced: assisted colonization' if there was an 'associatedTaxa' value present 4) 'Introduced or Invasive' was mapped to 'degreeOfEstablishment' and to 'isInvasive' 5) 'isHybrid' values were calculated from 'scientificName' and 'vernacularName' values during data cleaning 6) 'associatedReferences' values were generated during data cleaning, by merging 'Authority' and 'associatedReferences'. If not available online, a full reference was extracted from the MasterListOfAuthorities. 7) 'taxonID' was generated during data cleaning, from the authority listed in 'taxonomicStatus' 8) 'associatedTaxa' values, where provided, were prefixed during data cleaning with 'Biocontrol species of: ' 9) 'eventRemarks' was generated from 'Approximate Introduction Date' with the prefix 'Approx. date of introduction: ' added during data cleaning. 10) 'eventDate' reflects the date the record was first added to the GRIIS-L48. NOTE: if earliest occurrence for the species is known or suspected, it is provided in the eventRemarks field. 11) Content for these fields (where appropriate) was globally added during data cleaning: administrativeArea, basisOfRecord, continent, country, countryCode, datasetID, datasetName, dcterms:bibliographicCitation, dcterms:language, dcterms:license, dcterms:references, dcterms:rightsHolder, dcterms:source, dcterms:subject, dcterms:temporal, dcterms:type, establishmentMeans, higherGeography, institutionCode, occurrenceStatus.

Taxonomic Coverages

All taxa are included, from Kingdoms Animalia, Bacteria, Chromista, Fungi, Plantae, Protozoa, and Virus. In the umbrella dataset, there are 12,981 unique scientific names, mostly taxon rank species, but also subspecies, variety, form, and hybrid. There are 8,656 names in the Conterminous 48 United States (GRIIS-L48) dataset. Just two Classes comprise 88% of the GRIIS-L48: 46% Magnoliopsida (dicotyledonous plants) and 42% Insecta (insects).
  1. Animalia
    common name: Animals rank: kingdom
  2. Bacteria
    common name: Bacteria rank: kingdom
  3. Chromista
    common name: Chromists rank: kingdom
  4. Fungi
    common name: Mushrooms rank: kingdom
  5. Plantae
    common name: Plants rank: kingdom
  6. Protozoa
    common name: Protists rank: kingdom
  7. Virus
    common name: Viruses rank: kingdom
  8. Magnoliopsida
    common name: Dicots rank: class
  9. Insecta
    common name: Insects rank: class

Geographic Coverages

The Conterminous 48 United States, also known as the Contiguous 48 United States.

Bibliographic Citations

Contacts

Annie Simpson
originator
position: Biologist and Information Scientist
United States Geological Survey
12201 Sunrise Valley Dr. Mailstop 302
Reston
20192
Virginia
US
Telephone: +1 703 648 4281
email: US-RIIS@usgs.gov
homepage: https://www.usgs.gov/programs/science-analytics-and-synthesis-sas
userId: http://orcid.org/0000-0001-8338-5134
Elizabeth Sellers
originator
position: Data Manager
U.S. Geological Survey
12201 Sunrise Valley Dr. Mailstop 302
Reston
20192
Virginia
US
Telephone: +1 703 648 4385
email: esellers@usgs.gov
homepage: https://www.usgs.gov/programs/science-analytics-and-synthesis-sas
userId: http://orcid.org/0000-0003-4676-2994
Shyama Pagad
originator
position: Deputy Chair, Information
IUCN Invasive Species Specialists Group
Department of Biological Sciences, University of Auckland
Auckland
1072
Auckland
NZ
Telephone: +64 210754381
email: s.pagad@auckland.ac.nz
homepage: http://www.issg.org
userId: http://orcid.org/0000-0002-5225-6580
Annie Simpson
metadata author
position: Biologist and Information Scientist
U.S. Geological Survey
12201 Sunrise Valley Dr. Mailstop 302
Reston
20192
Virginia
US
Telephone: +1 703 648 4281
email: US-RIIS@usgs.gov
homepage: https://www.usgs.gov/programs/science-analytics-and-synthesis-sas
userId: http://orcid.org/0000-0001-8338-5134
Annie Simpson
author
position: Biologist and Information Scientist
U.S. Geological Survey
12201 Sunrise Valley Dr. Mailstop 302
Reston
20192
Virginia
US
Telephone: +1 703 648 4281
email: US-RIIS@usgs.gov
homepage: https://www.usgs.gov/programs/science-analytics-and-synthesis-sas
userId: http://orcid.org/0000-0001-8338-5134
Elizabeth Sellers
author
position: Data Manager
U.S. Geological Survey
12201 Sunrise Valley Dr. Mailstop 302
Reston
20192
Virginia
US
Telephone: +1 703 648 4385
email: US-RIIS@usgs.gov
homepage: https://www.usgs.gov/programs/science-analytics-and-synthesis-sas
userId: http://orcid.org/0000-0003-4676-2994
Shyama Pagad
author
position: Deputy Chair, Information
IUCN Invasive Species Specialists Group
University of Auckland, School of Biological Sciences
Auckland
1072
Auckland
NZ
Telephone: +64 210754381
email: s.pagad@auckland.ac.nz
homepage: http://www.issg.org
userId: http://orcid.org/0000-0002-5225-6580
Annie Simpson
administrative point of contact
position: Biologist and Information Scientist
U.S. Geological Survey
12201 Sunrise Valley Dr. Mailstop 302
Reston
20192
Virginia
US
Telephone: +1 703 648 4281
email: US-RIIS@usgs.gov
homepage: https://www.usgs.gov/programs/science-analytics-and-synthesis-sas
userId: http://orcid.org/0000-0001-8338-5134
What is GBIF? API FAQ Newsletter Privacy Terms and agreements Citation Code of Conduct Acknowledgements
Contact GBIF Secretariat Universitetsparken 15 DK-2100 Copenhagen Ø Denmark