← All checklists

MIxS Checklist · ERC000028

ENA prokaryotic pathogen minimal sample checklist

Minimum information required for a prokaryotic pathogen sample

20 fields6 requiredmixsv1.0.0
SeqDesk offers this GSC MIxS environment checklist when you define studies and samples, so contributors enter exactly the metadata the standard expects. The catalog is kept in sync with the upstream GSC MIxS / ENA checklist definitions (registry v2), last updated 2026-06-03, and is published through our registry API. Source: https://www.ebi.ac.uk/ena/browser/view/ERC000028.

Collection event information

isolation_sourceRequired
isolation_source

describes the physical, environmental and/or local geographical source of the biological sample from which the sample was derived

text
lat_lonOptional
lat_lon

geographical coordinates of the location where the specimen was collected

text
collected_byOptional
collected_by

name of persons or institute who collected the specimen

text
collection dateRequired
collection_date

The date the sample was collected with the intention of sequencing, either as an instance (single point in time) or interval. In case no exact time is available, the date/time can be right truncated i.e. all of these are valid ISO8601 compliant times: 2008-01-23T19:23:10+00:00; 2008-01-23T19:23:10; 2008-01-23; 2008-01; 2008.

text
geographic location (region and locality)Optional
geographic_location_region_and_locality

The geographical origin of the sample as defined by the specific region name followed by the locality name.

text
identified_byOptional
identified_by

name of the expert who identified the specimen taxonomically

text

sample collection

environmental_sampleOptional
environmental_sample

identifies sequences derived by direct molecular isolation from a bulk environmental DNA sample (by PCR with or without subsequent cloning of the product, DGGE, or other anonymous methods) with no reliable identification of the source organism

select2 options

Organism characteristics

mating_typeOptional
mating_type

mating type of the organism from which the sequence was obtained; mating type is used for prokaryotes, and for eukaryotes that undergo meiosis without sexually dimorphic gametes

text
geographic location (country and/or sea)Required
geographic_location_country_andor_sea

The geographical origin of where the sample was collected from, with the intention of sequencing, as defined by the country or sea name. Country or sea names should be chosen from the INSDC country list (http://insdc.org/country.html).

select294 options

host description

host health stateRequired
host_health_state

Health status of the host at the time of sample collection.

select14 options
lab_hostOptional
lab_host

scientific name of the laboratory host used to propagate the source organism from which the sample was obtained

text
host scientific nameRequired
host_scientific_name

Scientific name of the natural (as opposed to laboratory) host to the organism from which sample was obtained.

text

Pointer to physical material

bio_materialOptional
bio_material

Unique identifier that references the biological material from which the sample was obtained and that ideally exists in a curated collection (e.g. stock centres, seed banks, DNA banks). The ID should have the following structure: name of the institution (institution code) followed by the collection code (if available) and the voucher id (institution_code:collection_code:voucher_id). Please note institution codes and collection codes are taken from a controlled vocabulary maintained by the INSDC: https://ftp.ncbi.nih.gov/pub/taxonomy/biocollections/

text
culture_collectionOptional
culture_collection

Unique identifier that references the culture (e.g. live microbial and viral cultures and cell lines) from which the sample has been obtained and that have been deposited in curated culture collections. The ID needs to provide an institution code and the culture id, with optional collection code, in the following structure: (-institution_code:(collection_code):voucher_id. Please note institution codes (and optional collection codes) are taken from a controlled vocabulary maintained by the INSDC: https://ftp.ncbi.nih.gov/pub/taxonomy/biocollections/

text
specimen_voucherOptional
specimen_voucher

Unique identifier that references the physical specimen that remains after the sequence has been obtained and that ideally exists in a curated collection. The ID should have the following structure: name of the institution (institution code) followed by the collection code (if available) and the voucher id (institution_code:collection_code:voucher_id). Please note institution codes and collection codes are taken from a controlled vocabulary maintained by the INSDC: https://ftp.ncbi.nih.gov/pub/taxonomy/biocollections/

text

Infraspecies information

isolateRequired
isolate

individual isolate from which the sample was obtained

text
sub_speciesOptional
sub_species

name of sub-species of organism from which sample was obtained

text
sub_strainOptional
sub_strain

name or identifier of a genetically or otherwise modified strain from which sample was obtained, derived from a parental strain (which should be annotated in the strain field; sub_strain from which sample was obtained

text
serovarOptional
serovar

serological variety of a species (usually a prokaryote) characterized by its antigenic properties

text
strainOptional
strain

Name of the strain from which the sample was obtained.

text