MIxS Checklist · ERC000033
Minimum information about a virus pathogen. A checklist for reporting metadata of virus pathogen samples associated with genomic data. This minimum metadata standard was developed by the COMPARE platform for submission of virus surveillance and outbreak data (such as Ebola) as well as virus isolate information.
sample_storage_conditionsConditions at which sample was stored, usually storage temperature, duration and location. In soil context: Explain how and for how long the soil sample was stored before DNA extraction (fresh/frozen/other).
subject_exposureExposure of the subject to infected human or animals, such as poultry, wild bird or swine. If multiple exposures are applicable, please state them separated by semicolon. Example: poultry; wild bird
type_exposureSetting within which the subject is exposed to animals, such as farm, slaughterhouse, food preparation. If multiple exposures are applicable, please state their type in the same order in which you reported the exposure in the field 'subject exposure'. Example: backyard flock; confined animal feeding operation
personal_protective_equipmentUse of personal protective equipment, such as gloves, gowns, during any type of exposure. Example: mask
hospitalisationWas the subject confined to a hospital as a result of virus infection or problems occurring secondary to virus infection?
illness_durationThe number of days the illness lasted. Example: 4
illness_symptomsThe symptoms that have been reported in relation to the illness, such as cough, diarrhea, fever, headache, malaise, myalgia, nausea, runny_nose, shortness_of_breath, sore_throat. If multiple exposures are applicable, please state them separated by semicolon.
collection_dateThe date the sample was collected with the intention of sequencing, either as an instance (single point in time) or interval. In case no exact time is available, the date/time can be right truncated i.e. all of these are valid ISO8601 compliant times: 2008-01-23T19:23:10+00:00; 2008-01-23T19:23:10; 2008-01-23; 2008-01; 2008.
geographic_location_latitudeThe geographical origin of the sample as defined by latitude. The values should be reported in decimal degrees and in WGS84 system
geographic_location_longitudeThe geographical origin of the sample as defined by longitude. The values should be reported in decimal degrees and in WGS84 system
geographic_location_region_and_localityThe geographical origin of the sample as defined by the specific region name followed by the locality name.
subject_exposure_durationDuration of the exposure of the subject to an infected human or animal. If multiple exposures are applicable, please state their duration in the same order in which you reported the exposure in the field 'subject exposure'. Example: 1 day; 0.33 days
sample_capture_statusReason for the sample collection.
geographic_location_country_andor_seaThe geographical origin of where the sample was collected from, with the intention of sequencing, as defined by the country or sea name. Country or sea names should be chosen from the INSDC country list (http://insdc.org/country.html).
host_disease_outcomeDisease outcome in the host.
host_common_namecommon name of the host, e.g. human
host_subject_ida unique identifier by which each subject can be referred to, de-identified, e.g. #131
host_ageage of host at the time of sampling; relevant scale depends on species and study, e.g. could be seconds for amoebae or centuries for trees
host_health_stateHealth status of the host at the time of sample collection.
host_sexGender or sex of the host.
lab_hostscientific name of the laboratory host used to propagate the source organism from which the sample was obtained
host_scientific_nameScientific name of the natural (as opposed to laboratory) host to the organism from which sample was obtained.
virus_identifierUnique laboratory identifier assigned to the virus by the investigator. Strain name is not sufficient since it might not be unique due to various passsages of the same virus. Format: up to 50 alphanumeric characters
collector_nameName of the person who collected the specimen. Example: John Smith
collecting_institutionName of the institution to which the person collecting the specimen belongs. Format: Institute Name, Institute Address
receipt_dateDate on which the sample was received. Format:YYYY-MM-DD. Please provide the highest precision possible. If the sample was received by the institution and not collected, the 'receipt date' must be provided instead. Either the 'collection date' or 'receipt date' must be provided. If available, provide both dates.
definition_for_seropositive_sampleThe cut off value used by an investigatior in determining that a sample was seropositive.
serotype_required_for_a_seropositive_sampleSerological variety of a species characterised by its antigenic properties. For Influenza, HA subtype should be the letter H followed by a number between 1-16 unless novel subtype is identified and the NA subtype should be the letter N followed by a number between 1-9 unless novel subtype is identified. If only one of the subtypes have been tested then use the format H5Nx or HxN1. Example: H1N1
isolateindividual isolate from which the sample was obtained
strainName of the strain from which the sample was obtained.
host_habitatNatural habitat of the avian or mammalian host.
isolation_source_hostassociatedName of host tissue or organ sampled for analysis. Example: tracheal tissue
host_descriptionOther descriptive information relating to the host.
gravidityWhether or not the subject is gravid. If so, report date due or date post-conception and specify which of these two dates is being reported.
host_behaviourNatural behaviour of the host.
isolation_source_nonhostassociatedDescribes the physical, environmental and/or local geographical source of the biological sample from which the sample was derived. Example: soil