Global Register of Introduced and Invasive Species - United States (Contiguous) (ver.2.0, 2022)
Citation
Simpson A, Sellers E, Pagad S (2023). Global Register of Introduced and Invasive Species - United States (Contiguous) (ver.2.0, 2022). Invasive Species Specialist Group ISSG. Checklist dataset https://doi.org/10.5066/p9kfftod accessed via GBIF.org on 2024-12-15.Description
This is the latest version of the dataset initially published to GBIF by the Invasive Species Specialist Group (ISSG) on behalf of the U.S. Geological Survey on October 12, 2020, at https://www.gbif.org/dataset/6b64ef7e-82f7-47a3-8ddb-ec6794ea07d6. Like that checklist, this version presents validated and verified national checklists of introduced (alien) and invasive alien species at the sub-country level. The other two related checklists for the United States, also newly published separately as V2.0, are for the States of Alaska and Hawaii.
Differences between two previous versions and ver.2.0, 2022 (this dataset): SIZE: the first version V1.0 - 5,006 accepted names (arthropods were not included); the previous version - 8,654 accepted names and two unranked hybrids; ver.2.0, 2022 (this dataset) - 8,525 accepted names and two unranked hybrids. OTHER DIFFERENCES: the previous version provided: a broader inclusion of arthropods; approximate dates of introduction (where available); 4,693 references; improved disambiguation of scientific names; biocontrol species information (where applicable); taxonomic synonyms, where available, in taxonRemarks field; unique occurrenceIDs; no habitat information; ver.2.0, 2022 (this dataset) adds pathway and habitat information, where available, more precise management of names and synonyms (and so is smaller than the previous version), and additional data on approximate dates of introduction.
OVERVIEW: Introduced (non-native) species that becomes established may eventually become invasive, so tracking introduced species provides a baseline for effective modeling of species trends and interactions, geospatially and temporally. The umbrella dataset, called United States Register of Introduced and Invasive Species (US-RIIS), is comprised of three lists, one each for Alaska (AK, with 545 records), Hawaii (HI, with 5,628 records), and the conterminous (or lower 48) United States (L48, with 8,527 records, this dataset). Each list includes introduced (non-native), established (reproducing) taxa that: are, or may become, invasive (harmful) in the locality; are not known to be harmful there; and/or have been used for biological control in the locality.
To be included in the Global Register of Introduced and Invasive Species - United States (Contiguous), or GRIIS-L48 (with L48 meaning the Lower 48 Conterminous United States), a taxon must be non-native everywhere in the locality and established (reproducing) anywhere in the locality. Native pest species are not included.
Each record has information on taxonomy, a vernacular name, establishment means designation (introduced unintentionally, or assisted colonization), degree of establishment (established, invasive, or widespread invasive), hybrid status, pathway of introduction (where available), habitat (where available), whether a biocontrol species, dates of introduction (where available; currently 46% of the records for the conterminous United States), associated taxa (where applicable), native and introduced distributions (where available), and citations for the authoritative source(s) from which this information is drawn. The umbrella dataset US-RIIS builds on a previous dataset, A Comprehensive List of Non-Native Species Established in Three Major Regions of the U.S.: Version 3.0 (Simpson et al., 2020, https://doi.org/10.5066/p9e5k160).
There are 14,700 records in the master list (USRIISv2_MasterList) and 12,571 unique scientific names. The list is derived from more than 5,800 authoritative sources (USRIISv2_AuthorityReferences) and was reviewed by (or based on input from) more than 30 taxonomic experts and invasive species scientists.
Many thanks to these reviewers and contributors: Coauthors Pam Fuller (USGS Emeritus), Kevin Faccenda (University of Hawaii), Neal Evenhuis (Bishop Museum), Janis Matsunaga (Hawaii Department of Agriculture), and Matt Bowser (US-Fish and Wildlife Service); contributors Rachael Blake (data science), National Socio-Environmental Synthesis Center (SESYNC); M. Lourdes Chamorro (Curculionidae), USDA-ARS Entomology; Meghan C. Eyler (data reviewer), US Fish & Wildlife Service; Danielle Froelich (Hawaiian botany), SWCA Environmental Consultants; Thomas Henry (Heteroptera), USDA-ARS Entomology; Sam James (Annelida), Maharishi University; Nancy Khan (Hawaiian botany), Smithsonian Institution; Alex Konstantinov (Chrysomelidae), USDA-ARS Entomology; Andrew P. Landsman (Arachnida), National Park Service, C&O Canal National Historical Park; Christopher Lepczyk (Vertebrata), Auburn University; Sandy Liebhold (Coleoptera), USDA-FS; Steven Lingafelter (Cerambycidae), USDA-APHIS; Walter Meshaka (Herpetology), State Museum of Pennsylvania; Gary L. Miller (Aphididae), USDA-ARS Entomology; Allen Norrbom (Tephritidae), USDA-ARS Entomology; Shyama Pagad (global invasive species), IUCN SSC Invasive Species Specialists' Group; John Reynolds (Annelida), Oligochaetology Laboratory; Alexander Salazar (Lycosidae), Miami University, Ohio; Elizabeth A. Sellers (data manager), USGS; Derek Sikes (Alaskan invertebrates), University of Alaska; Bruce A. Snyder (Annelida), Georgia College and State University; Alma Solis (Pyralid moths), USDS-ARS at the Smithsonian Institution; Rebecca Turner (data manager), Scion Inc., New Zealand; Darrell Ubick (Arachnida), Cal Academy; Warren Wagner (Hawaiian botany), Smithsonian Institution; Mark Wetzel (Annelida), Illinois Natural History Survey; and James D. Young (Lepidoptera), USDA-APHIS-PPQ-PHP. Our apologies to the many contributing experts we may have inadvertently omitted.
Purpose
We anticipate the GRIIS-L48 will be used to 1) refine species lists for horizon scanning (established species should not be included), 2) create introduced species lists for smaller areas within the conterminous U.S., and 3) tag species occurrence data records with an introduced status where the species may not have been initially reported as introduced. The GRIIS-L48 also contributes to the Global Register of Introduced and Invasive Species (http://griis.org/about.php).
Sampling Description
Study Extent
Temporal: data collection began in 2012 and is ongoing. Although publication has been annual since 2018, it is likely that version 3.0 will take 2 years or more to complete.Sampling
Methods used to create the integrated dataset have varied somewhat over time. Additional fields have been added from those that were used at the start of the project. Initially, each region (AK, HI, L48) had a lead data searcher and the lists were combined afterwards within one Excel spreadsheet. The interpretation of the content of the citation fields follow the rules established by ITIS and described here: http://www.itis.gov/submit_guidlines.html. Each lead data searcher was trained in the methods to discern an authoritative source; how to extract scientific names from the source; how to generate from the source and other reference databases such as ITIS all of the information in the spreadsheet columns. There are two spreadsheets to populate (MasterList and AuthorityReferences), plus a data dictionary sheet (DataDictionary). The US-RIIS dataset shared with GBIF contains these fields: locality - Required; L48 = conterminous United States of America; scientificName - Required; modified from Darwin Core; an accepted species, subspecies, variety, form, or hybrid name, preferably validated in GBIF, or secondarily, in ITIS, or otherwise by another taxonomic authority; Genus names alone are not allowed; does not include scientificNameAuthorship. scientificNameAuthorship - Optional; from Darwin Core; as provided by the taxonomic authority or the reference. vernacularName - Required; from Darwin Core; generally provided by ITIS, the authority, or another reliable source; may be a very generic term where specific common names are not in use. taxonRank - Required; from Darwin Core; as provided by the taxonomic authority or derived by data searcher; species, subspecies, variety, form, unranked. establishmentMeans - Required; from Darwin Core; controlled vocabulary: introduced (alien, exotic, non-native, nonindigenous); introduced: assisted colonization; (or both terms). degreeOfEstablishment - Required; from Darwin Core; an assertion provided by or derived from the authority describing the established or invasive status of the taxa within the region; controlled vocabulary (smaller for this dataset than from Darwin Core, due to the nature of the dataset): established (category C3), meaning reproducing in the locality; invasive (category D2), meaning causing, or likely to cause, harm; widespread invasive (category E), meaning at least locally abundant and harmful ecologically, economically, or to health. occurrenceStatus - Required; from Darwin Core; mapped value = "present". associatedTaxa - Optional; from Darwin Core; consists of a descriptor of the relationship (e.g., biocontrol for:, infests:, parasite of:, etc.) followed by a text description of the name of the organism(s) that the species was introduced to control. eventRemarks - Optional; from Darwin Core; consists of "approx. date of introduction: " and a date or text string describing when the taxon is believed to have been introduced to the locality. Also includes brief (Author (YYYY)) value if the associatedReferences authority is not the authority providing the date of introduction. taxonRemarks - Optional; from Darwin Core; only included if the scientific name has many synonyms or misspellings, or, if the name provided by the authority is an unaccepted synonym of a valid/accepted name in a taxonomic authority; Multiple names are separated by a semicolon and a space. kingdom - Required; from Darwin Core; one of seven names as specified in GBIF or another naming authority; controlled vocabulary: Animalia, Bacteria, Chromista, Fungi, Plantae, Protozoa, Viruses. phylum - Required; from Darwin Core; as provided by the taxonomic authority; occasionally = "undefined". class - Required; from Darwin Core; as provided by the taxonomic authority; occasionally = "undefined". order - Required; from Darwin Core; as provided by the taxonomic authority; occasionally = "undefined". family - Required; from Darwin Core; as provided by the taxonomic authority; occasionally = "undefined". taxonomicStatus - Required; from Darwin Core; value = "Accepted " followed by the acronym of the taxonomic authority providing the taxonID. taxonID - Required; from Darwin Core; a unique number for the taxa, as provided by the taxonomic authority. Where possible, a URN. Otherwise a stable URL. associatedReferences - Required; from Darwin Core; the abbreviated name for the author of the resource asserting the taxa's non-native species status in the locality; Generally, but not always, followed by (year), followed by a URL back to an online version of the reference, or by a full reference. eventDate - Required; from Darwin Core; the date of creation of the record, in the format YYYY-MM-DD; manually generated so errors are possible; NOTE: if earliest occurrence for the species is known or suspected, it is provided in the eventRemarks field, not eventDate. modified - Required; from Darwin Core; date of latest edit/change made to the record; is manually generated, so errors are possible. occurrenceRemarks - Optional; from Darwin Core; text field with comments about the record, generally extracted from the text of the associatedReference or other references. occurrenceID - Required; from Darwin Core; an identifier for the Organism instance (at the record level); a controlled vocabulary that is manually generated using an ordinal number that is not repeated, prefixed by the umbrella dataset's abbreviated name (USRIIS), the record's locality (L48), and the record's kingdom; since the content of this field is not auto-generated, it is subject to human error. Deleted occurrenceIDs are not reused.Quality Control
Incoming data is error checked. Scientific names are validated against the Global Biodiversity Information Facility (GBIF), then the Integrated Taxonomic Information System (ITIS), and other taxonomic authorities when necessary. Other data fields are assumed correct. The validity of the assertion of non-native status itself may subsequently be questioned and the entry deleted or moved to a watch list if: the scientific name is not in GBIF or ITIS and not reaffirmed by other authoritative taxonomic sources; other authoritative sources state the species is not present or is native or cryptic; irreconcilable typographical errors in scientific names are found; entries are found to be duplicate or synonomous taxonomic names.Method steps
- Tools used are described in the following steps:
- Microsoft Word to assist in formatting data.
- Microsoft Excel to hold the data and sort and display it.
- Global Biodiversity Information Facility to validate taxonomic names.
- Integrated Taxonomic Information System (ITIS) to validate taxonomic names not found in GBIF.
- Microsoft OneDrive to store the data.
- Adobe Pro to extract text from image-based pdfs.
- Sublime Text to assist with text formatting and metadata eml generation.
- Various browsers (MS Edge, Google Chrome, Mozilla Firefox, Apple Safari), species databases, and search engines, to discover authoritative introduced species lists for L48.
- PROVENANCE: Our team tracks the provenance of the dataset as any modifications, transformations, edits, or decisions to accept or reject data points are made. Provenance is essential to the dataset, which consists entirely of assertions of the presence and non-native status of a species in an area.
- This tracking involves the inclusion of an authority or data source with each assertion that a species is introduced and established in a given area. The actual authorities' lists and manuscripts themselves are also saved digitally as written proof of the individual assertions and to justify, as necessary, each species' inclusion in the list.
- To assist with tracking the evolution of a data record, there are "Acquisition Date" and "modified" fields (which are not auto-generated and so subject to human error). The "modified" field is a Darwin Core term for the date of the last modification of the record (YYYY-MM-DD), and the cumulative modifications to the record are described in the "Update Remarks" field, separated within the field by commas (for modifications made on the same date) and semicolons (separating modifications made on different dates).
- The dataset and the reference authorities are maintained in Microsoft OneDrive. The list is worked on in MS Excel and the file's history can be used to 'roll back' to previous versions, if necessary. A versionHistory.txt file published online along with the data describes major changes and differences in each published version of the list. Past versions of the precursor to this dataset are maintained and made available as part of the deprecated .zip files at: https://doi.org/10.5066/P9E5K160, and the previous superseded version of the US-RIIS are maintained at https://doi.org/10.5066/P95XL09Q. All versions of the US-RIIS are accessible here: https://doi.org/10.5066/P92DN8D8.
- ANALYZE CRITERIA: Assertions of introduced and established status for a species are accepted if the authority is a trusted source, such as a governmental or government-affiliated, non-governmental organization, international biodiversity organization, academic institution, or a taxonomic expert. If a list is found on the Web and judged as authoritative, it is downloaded for future reference. Authorities, in addition to being cited within each record, are accumulated in an AuthorityReferences spreadsheet. These criteria are part of our documentation. When there is disagreement among experts about establishment, taxonomy, native status, and approximate dates of introduction a wider search of available information is used to determine the best resolution. Cryptic species (of unknown origin) are not included in the GRIIS-L48. Taxa with conflicting status or taxonomy among different sources, that cannot be reconciled through further research, are moved to an informal watch list (available upon request).
Additional info
During the data cleaning of this species checklist (that can be downloaded as part of the dataset at https://doi.org/10.5066/P9KFFTOD): 1) 'locality' values were translated during data cleaning: L48 = Conterminous United States of America 2) 'eventDate' reflects the date the record was first added to the GRIIS-L48. NOTE: if earliest occurrence for the species is known or suspected, it is provided in the eventRemarks field not the eventDate field. 3) Content for these fields (where appropriate) was globally added during data cleaning: administrativeArea, basisOfRecord, occurrenceStatus, continent, country, countryCode, datasetID, datasetName, dcterms:bibliographicCitation, dcterms:language, dcterms:license, dcterms:references, dcterms:rightsHolder, dcterms:source, dcterms:subject, dcterms:temporal, dcterms:type, higherGeography, institutionCode, occurrenceStatus, stateProvince.Taxonomic Coverages
-
Animaliacommon name: Animals rank: kingdom
-
Bacteriacommon name: Bacteria rank: kingdom
-
Chromistacommon name: Chromists rank: kingdom
-
Fungicommon name: Mushrooms rank: kingdom
-
Plantaecommon name: Plants rank: kingdom
-
Protozoacommon name: Protists rank: kingdom
-
Virusescommon name: Viruses rank: kingdom
-
Magnoliopsidacommon name: Dicots rank: class
-
Insectacommon name: Insects rank: class
Geographic Coverages
Bibliographic Citations
Contacts
Annie Simpsonoriginator
position: Biologist and Information Scientist
United States Geological Survey
12201 Sunrise Valley Dr. Mailstop 302
Reston
20192
Virginia
US
Telephone: +1 703 648 4281
email: US-RIIS@usgs.gov
homepage: https://www.usgs.gov/programs/science-analytics-and-synthesis-sas
userId: http://orcid.org/0000-0001-8338-5134
Elizabeth Sellers
originator
position: Data Manager
U.S. Geological Survey
12201 Sunrise Valley Dr. Mailstop 302
Reston
20192
Virginia
US
Telephone: +1 703 648 4385
email: esellers@usgs.gov
homepage: https://www.usgs.gov/programs/science-analytics-and-synthesis-sas
userId: http://orcid.org/0000-0003-4676-2994
Shyama Pagad
originator
position: Deputy Chair, Information
IUCN Invasive Species Specialists Group
Department of Biological Sciences, University of Auckland
Auckland
1072
Auckland
NZ
Telephone: +64 210754381
email: s.pagad@auckland.ac.nz
homepage: http://www.issg.org
userId: http://orcid.org/0000-0002-5225-6580
Annie Simpson
metadata author
position: Biologist and Information Scientist
U.S. Geological Survey
12201 Sunrise Valley Dr. Mailstop 302
Reston
20192
Virginia
US
Telephone: +1 703 648 4281
email: US-RIIS@usgs.gov
homepage: https://www.usgs.gov/programs/science-analytics-and-synthesis-sas
userId: http://orcid.org/0000-0001-8338-5134
Annie Simpson
author
position: Biologist and Information Scientist
U.S. Geological Survey
12201 Sunrise Valley Dr. Mailstop 302
Reston
20192
Virginia
US
Telephone: +1 703 648 4281
email: US-RIIS@usgs.gov
homepage: https://www.usgs.gov/programs/science-analytics-and-synthesis-sas
userId: http://orcid.org/0000-0001-8338-5134
Elizabeth Sellers
author
position: Data Manager
U.S. Geological Survey
12201 Sunrise Valley Dr. Mailstop 302
Reston
20192
Virginia
US
Telephone: +1 703 648 4385
email: US-RIIS@usgs.gov
homepage: https://www.usgs.gov/programs/science-analytics-and-synthesis-sas
userId: http://orcid.org/0000-0003-4676-2994
Shyama Pagad
author
position: Deputy Chair, Information
IUCN Invasive Species Specialists Group
University of Auckland, School of Biological Sciences
Auckland
1072
Auckland
NZ
Telephone: +64 210754381
email: s.pagad@auckland.ac.nz
homepage: http://www.issg.org
userId: http://orcid.org/0000-0002-5225-6580
Annie Simpson
administrative point of contact
position: Biologist and Information Scientist
U.S. Geological Survey
12201 Sunrise Valley Dr. Mailstop 302
Reston
20192
Virginia
US
Telephone: +1 703 648 4281
email: US-RIIS@usgs.gov
homepage: https://www.usgs.gov/programs/science-analytics-and-synthesis-sas
userId: http://orcid.org/0000-0001-8338-5134