← All checklists

MIxS Checklist · ERC000033

ENA virus pathogen reporting standard checklist

Minimum information about a virus pathogen. A checklist for reporting metadata of virus pathogen samples associated with genomic data. This minimum metadata standard was developed by the COMPARE platform for submission of virus surveillance and outbreak data (such as Ebola) as well as virus isolate information.

36 fields10 requiredmixsv1.0.0
SeqDesk offers this GSC MIxS environment checklist when you define studies and samples, so contributors enter exactly the metadata the standard expects. The catalog is kept in sync with the upstream GSC MIxS / ENA checklist definitions (registry v2), last updated 2026-06-03, and is published through our registry API. Source: https://www.ebi.ac.uk/ena/browser/view/ERC000033.

sample collection: methods, storage and transport

sample storage conditionsOptional
sample_storage_conditions

Conditions at which sample was stored, usually storage temperature, duration and location. In soil context: Explain how and for how long the soil sample was stored before DNA extraction (fresh/frozen/other).

text

Human surveillance data

subject exposureOptional
subject_exposure

Exposure of the subject to infected human or animals, such as poultry, wild bird or swine. If multiple exposures are applicable, please state them separated by semicolon. Example: poultry; wild bird

text
type exposureOptional
type_exposure

Setting within which the subject is exposed to animals, such as farm, slaughterhouse, food preparation. If multiple exposures are applicable, please state their type in the same order in which you reported the exposure in the field 'subject exposure'. Example: backyard flock; confined animal feeding operation

text
personal protective equipmentOptional
personal_protective_equipment

Use of personal protective equipment, such as gloves, gowns, during any type of exposure. Example: mask

text
hospitalisationOptional
hospitalisation

Was the subject confined to a hospital as a result of virus infection or problems occurring secondary to virus infection?

select2 options
illness durationOptional
illness_duration

The number of days the illness lasted. Example: 4

text
illness symptomsOptional
illness_symptoms

The symptoms that have been reported in relation to the illness, such as cough, diarrhea, fever, headache, malaise, myalgia, nausea, runny_nose, shortness_of_breath, sore_throat. If multiple exposures are applicable, please state them separated by semicolon.

text

Collection event information

collection dateRequired
collection_date

The date the sample was collected with the intention of sequencing, either as an instance (single point in time) or interval. In case no exact time is available, the date/time can be right truncated i.e. all of these are valid ISO8601 compliant times: 2008-01-23T19:23:10+00:00; 2008-01-23T19:23:10; 2008-01-23; 2008-01; 2008.

text
geographic location (latitude)Optional
geographic_location_latitude

The geographical origin of the sample as defined by latitude. The values should be reported in decimal degrees and in WGS84 system

text
geographic location (longitude)Optional
geographic_location_longitude

The geographical origin of the sample as defined by longitude. The values should be reported in decimal degrees and in WGS84 system

text
geographic location (region and locality)Optional
geographic_location_region_and_locality

The geographical origin of the sample as defined by the specific region name followed by the locality name.

text

internal environment

subject exposure durationOptional
subject_exposure_duration

Duration of the exposure of the subject to an infected human or animal. If multiple exposures are applicable, please state their duration in the same order in which you reported the exposure in the field 'subject exposure'. Example: 1 day; 0.33 days

text

sample collection

sample capture statusOptional
sample_capture_status

Reason for the sample collection.

select7 options

Organism characteristics

geographic location (country and/or sea)Required
geographic_location_country_andor_sea

The geographical origin of where the sample was collected from, with the intention of sequencing, as defined by the country or sea name. Country or sea names should be chosen from the INSDC country list (http://insdc.org/country.html).

select294 options

host disorder

host disease outcomeOptional
host_disease_outcome

Disease outcome in the host.

select3 options

host description

host common nameRequired
host_common_name

common name of the host, e.g. human

text
host subject idRequired
host_subject_id

a unique identifier by which each subject can be referred to, de-identified, e.g. #131

text
host ageOptional
host_age

age of host at the time of sampling; relevant scale depends on species and study, e.g. could be seconds for amoebae or centuries for trees

text
host health stateRequired
host_health_state

Health status of the host at the time of sample collection.

select14 options
host sexRequired
host_sex

Gender or sex of the host.

select17 options
lab_hostOptional
lab_host

scientific name of the laboratory host used to propagate the source organism from which the sample was obtained

text
host scientific nameRequired
host_scientific_name

Scientific name of the natural (as opposed to laboratory) host to the organism from which sample was obtained.

text

Virus isolate information

virus identifierOptional
virus_identifier

Unique laboratory identifier assigned to the virus by the investigator. Strain name is not sufficient since it might not be unique due to various passsages of the same virus. Format: up to 50 alphanumeric characters

text

General collection event information

collector nameRequired
collector_name

Name of the person who collected the specimen. Example: John Smith

text
collecting institutionRequired
collecting_institution

Name of the institution to which the person collecting the specimen belongs. Format: Institute Name, Institute Address

text
receipt dateOptional
receipt_date

Date on which the sample was received. Format:YYYY-MM-DD. Please provide the highest precision possible. If the sample was received by the institution and not collected, the 'receipt date' must be provided instead. Either the 'collection date' or 'receipt date' must be provided. If available, provide both dates.

text

Serology detection

definition for seropositive sampleOptional
definition_for_seropositive_sample

The cut off value used by an investigatior in determining that a sample was seropositive.

text
serotype (required for a seropositive sample)Optional
serotype_required_for_a_seropositive_sample

Serological variety of a species characterised by its antigenic properties. For Influenza, HA subtype should be the letter H followed by a number between 1-16 unless novel subtype is identified and the NA subtype should be the letter N followed by a number between 1-9 unless novel subtype is identified. If only one of the subtypes have been tested then use the format H5Nx or HxN1. Example: H1N1

text

Infraspecies information

isolateRequired
isolate

individual isolate from which the sample was obtained

text
strainOptional
strain

Name of the strain from which the sample was obtained.

text

Associated host information

host habitatOptional
host_habitat

Natural habitat of the avian or mammalian host.

select7 options
isolation source host-associatedOptional
isolation_source_hostassociated

Name of host tissue or organ sampled for analysis. Example: tracheal tissue

text
host descriptionOptional
host_description

Other descriptive information relating to the host.

text

host details

gravidityOptional
gravidity

Whether or not the subject is gravid. If so, report date due or date post-conception and specify which of these two dates is being reported.

text
host behaviourOptional
host_behaviour

Natural behaviour of the host.

select4 options

Environmental information

isolation source non-host-associatedOptional
isolation_source_nonhostassociated

Describes the physical, environmental and/or local geographical source of the biological sample from which the sample was derived. Example: soil

text