MIxS Checklist · ERC000053
Minimum information required for reporting samples associated with the Tree of Life Programme (https://www.sanger.ac.uk/programme/tree-of-life/).
latitude_startLatitude of the location where the sampling event started, e.g. each CTD cast, net tow, or bucket collection is a distinct event. Format: ##.####, Decimal degrees; North= +, South= -; Use WGS 84 for GPS data. Example: -24.6666.
longitude_startLongitude of the location where the sampling event started, e.g. each CTD cast, net tow, or bucket collection is a distinct event. Format: ###.####, Decimal degrees; East= +, West= -; Use WGS 84 for GPS data. Example: -096.1012.
latitude_endLatitude of the location where the sampling event ended, e.g. each CTD cast, net tow, or bucket collection is a distinct event. Format: ##.####, Decimal degrees; North= +, South= -; Use WGS 84 for GPS data. Example: -24.6643.
longitude_endLongitude of the location where the sampling event ended, e.g. each CTD cast, net tow, or bucket collection is a distinct event. Format: ###.####, Decimal degrees; East= +, West= -; Use WGS 84 for GPS data. Example: -096.1171.
organism_partThe part of organism's anatomy or substance arising from an organism from which the biomaterial was derived, excludes cells.
lifestagethe age class or life stage of the organism at the time of collection.
relationshipindicates if the specimen has a known relationship to another specimen (e.g. parental, child, sibling or other kind of relationship)
sample_symbiont_ofReference to host sample from symbiont. The referenced sample should already be registered in INSDC. E.g. ERSxxxxxx
symbiontUsed to separate host and symbiont metadata within a symbiont system where the host species are indicated as 'N' and symbionts are indicated as 'Y'
sample_collection_methodThe method employed for collecting the sample. Can be provided in the form of a PMID, DOI, url or text.
sample_coordinator_affiliationThe university, institution, or society affiliation of the sample coordinator.
sample_same_asReference to sample(s) that are equivalent. The referenced sample(s) should already be registered in INSDC. This should be formatted as one of the following. A single sample e.g. ERSxxxxxx OR a comma separated list e.g. ERSxxxxxx,ERSxxxxxx
sample_derived_fromReference to parental sample(s) or original run(s) that the assembly is derived from. The referenced samples or runs should already be registered in INSDC. This should be formatted as one of the following. A single sample/run e.g. ERSxxxxxx OR a comma separated list e.g. ERSxxxxxx,ERSxxxxxx OR a range e.g. ERSxxxxxx-ERSxxxxxx
project_nameName of the project within which the sequencing was organized
barcoding_centerCenter where DNA barcoding was/will be performed.
tolidA ToLID (Tree of Life ID) is a unique and easy to communicate sample identifier that provides species recognition, differentiates between specimen of the same species and adds taxonomic context. ToLIDs are issued by id.tol.sanger.ac.uk. They are endorsed by the EarthBioGenome Project (EBP) and should be assigned to any sample with association to the EBP.
collected_byname of persons or institute who collected the specimen
collection_dateThe date the sample was collected with the intention of sequencing, either as an instance (single point in time) or interval. In case no exact time is available, the date/time can be right truncated i.e. all of these are valid ISO8601 compliant times: 2008-01-23T19:23:10+00:00; 2008-01-23T19:23:10; 2008-01-23; 2008-01; 2008.
geographic_location_latitudeThe geographical origin of the sample as defined by latitude. The values should be reported in decimal degrees and in WGS84 system
geographic_location_longitudeThe geographical origin of the sample as defined by longitude. The values should be reported in decimal degrees and in WGS84 system
geographic_location_region_and_localityThe geographical origin of the sample as defined by the specific region name followed by the locality name.
identified_byname of the expert who identified the specimen taxonomically
elevationThe elevation of the sampling site as measured by the vertical distance from mean sea level.
habitatdescription of the location of the sample material. please use EnvO terms where possible: https://www.ebi.ac.uk/ols/ontologies/envo
identifier_affiliationthe university, institution, or society responsible for identifying the specimen.
original_collection_dateFor use if the specimen is from a zoo, botanic garden, culture collection etc. and has a known original date of collection. In case no exact time is available, the date/time can be right truncated i.e. all of these are valid ISO8601 compliant times: 2008-01-23T19:23:10+00:00; 2008-01-23T19:23:10; 2008-01-23; 2008-01; 2008.
original_geographic_locationFor use if the specimen is from a zoo, botanic garden or culture collection etc. and has a known origin elsewhere. Please record the general description of the original collection location. This should be formatted as a country and optionally include more specific locations ranging from least to most specific separated by a | character, e.g. “United Kingdom | East Anglia | Norfolk | Norwich | University of East Anglia | UEA Broad".
original_geographic_location_latitudeFor use if the specimen is from a zoo, botanic garden or culture collection etc. and has a known origin elsewhere. Please record the geographic location where the specimen or sample was originally taken as defined by latitude. The values should be reported in decimal degrees and in WGS84 system
original_geographic_location_longitudeFor use if the specimen is from a zoo, botanic garden or culture collection etc. and has a known origin elsewhere. Please record the geographic location where the specimen or sample was originally taken as defined by longitude. The values should be reported in decimal degrees and in WGS84 system
sample_coordinatorThe name of the sample coordinator.
sexsex of the organism from which the sample was obtained
geographic_location_country_andor_seaThe geographical origin of where the sample was collected from, with the intention of sequencing, as defined by the country or sea name. Country or sea names should be chosen from the INSDC country list (http://insdc.org/country.html).
collecting_institutionName of the institution to which the person collecting the specimen belongs. Format: Institute Name, Institute Address
galthe name (or acronym) of the genome acquisition lab responsible for the sample.
specimen_idUnique identifier used to link all data for the recorded specimen.
gal_sample_idunique name assigned to the sample by the genome acquisition lab.
proxy_voucherFor use if voucher material needs to be made from a specimen that is different than the one submitted for sequencing. Please record the unique identifier that references the physical specimen and that ideally exists in a curated collection. The ID should have the following structure: name of the institution (institution code) followed by the collection code (if available) and the voucher id (institution_code:collection_code:voucher_id). Please note institution codes and collection codes are taken from a controlled vocabulary maintained by the INSDC: https://ftp.ncbi.nih.gov/pub/taxonomy/biocollections/
proxy_biomaterialFor use if voucher material needs to be made from a material that is different from the one submitted for sequencing. Please record the unique identifier that references the biomaterial and that ideally exists in a curated collection (e.g. stock centres, seed banks, DNA banks). The ID should have the following structure: name of the institution (institution code) followed by the collection code (if available) and the material id (institution_code:collection_code:material_id). Please note institution codes and collection codes are taken from a controlled vocabulary maintained by the INSDC: https://ftp.ncbi.nih.gov/pub/taxonomy/biocollections/
bio_materialUnique identifier that references the biological material from which the sample was obtained and that ideally exists in a curated collection (e.g. stock centres, seed banks, DNA banks). The ID should have the following structure: name of the institution (institution code) followed by the collection code (if available) and the voucher id (institution_code:collection_code:voucher_id). Please note institution codes and collection codes are taken from a controlled vocabulary maintained by the INSDC: https://ftp.ncbi.nih.gov/pub/taxonomy/biocollections/
specimen_voucherUnique identifier that references the physical specimen that remains after the sequence has been obtained and that ideally exists in a curated collection. The ID should have the following structure: name of the institution (institution code) followed by the collection code (if available) and the voucher id (institution_code:collection_code:voucher_id). Please note institution codes and collection codes are taken from a controlled vocabulary maintained by the INSDC: https://ftp.ncbi.nih.gov/pub/taxonomy/biocollections/
culture_or_strain_idliving, culturable, named laboratory strain that sequenced material is derived from.
depthThe vertical distance below local surface, e.g. for sediment or soil samples depth is measured from sediment or soil surface, respectively. Depth can be reported as an interval for subsurface samples.