← All checklists

MIxS Checklist · ERC000053

Tree of Life Checklist

Minimum information required for reporting samples associated with the Tree of Life Programme (https://www.sanger.ac.uk/programme/tree-of-life/).

42 fields10 requiredmixsv1.0.0
SeqDesk offers this GSC MIxS environment checklist when you define studies and samples, so contributors enter exactly the metadata the standard expects. The catalog is kept in sync with the upstream GSC MIxS / ENA checklist definitions (registry v2), last updated 2026-06-03, and is published through our registry API. Source: https://www.ebi.ac.uk/ena/browser/view/ERC000053.

Marine Event

Latitude StartOptional
latitude_start

Latitude of the location where the sampling event started, e.g. each CTD cast, net tow, or bucket collection is a distinct event. Format: ##.####, Decimal degrees; North= +, South= -; Use WGS 84 for GPS data. Example: -24.6666.

text
Longitude StartOptional
longitude_start

Longitude of the location where the sampling event started, e.g. each CTD cast, net tow, or bucket collection is a distinct event. Format: ###.####, Decimal degrees; East= +, West= -; Use WGS 84 for GPS data. Example: -096.1012.

text
Latitude EndOptional
latitude_end

Latitude of the location where the sampling event ended, e.g. each CTD cast, net tow, or bucket collection is a distinct event. Format: ##.####, Decimal degrees; North= +, South= -; Use WGS 84 for GPS data. Example: -24.6643.

text
Longitude EndOptional
longitude_end

Longitude of the location where the sampling event ended, e.g. each CTD cast, net tow, or bucket collection is a distinct event. Format: ###.####, Decimal degrees; East= +, West= -; Use WGS 84 for GPS data. Example: -096.1171.

text

Part and developmental stage of organism

organism partRequired
organism_part

The part of organism's anatomy or substance arising from an organism from which the biomaterial was derived, excludes cells.

text
lifestageRequired
lifestage

the age class or life stage of the organism at the time of collection.

select23 options

Organism characteristics: ecosystem

relationshipOptional
relationship

indicates if the specimen has a known relationship to another specimen (e.g. parental, child, sibling or other kind of relationship)

text
sample symbiont ofOptional
sample_symbiont_of

Reference to host sample from symbiont. The referenced sample should already be registered in INSDC. E.g. ERSxxxxxx

text
symbiontOptional
symbiont

Used to separate host and symbiont metadata within a symbiont system where the host species are indicated as 'N' and symbionts are indicated as 'Y'

select2 options

sample collection: methods, storage and transport

sample collection methodOptional
sample_collection_method

The method employed for collecting the sample. Can be provided in the form of a PMID, DOI, url or text.

text

sample collection: site related

sample coordinator affiliationOptional
sample_coordinator_affiliation

The university, institution, or society affiliation of the sample coordinator.

text

sample collection: core sample properties

sample same asOptional
sample_same_as

Reference to sample(s) that are equivalent. The referenced sample(s) should already be registered in INSDC. This should be formatted as one of the following. A single sample e.g. ERSxxxxxx OR a comma separated list e.g. ERSxxxxxx,ERSxxxxxx

text
sample derived fromOptional
sample_derived_from

Reference to parental sample(s) or original run(s) that the assembly is derived from. The referenced samples or runs should already be registered in INSDC. This should be formatted as one of the following. A single sample/run e.g. ERSxxxxxx OR a comma separated list e.g. ERSxxxxxx,ERSxxxxxx OR a range e.g. ERSxxxxxx-ERSxxxxxx

text

non-sample terms: study or project

project nameRequired
project_name

Name of the project within which the sequencing was organized

text

non-sample terms

barcoding centerOptional
barcoding_center

Center where DNA barcoding was/will be performed.

text
tolidOptional
tolid

A ToLID (Tree of Life ID) is a unique and easy to communicate sample identifier that provides species recognition, differentiates between specimen of the same species and adds taxonomic context. ToLIDs are issued by id.tol.sanger.ac.uk. They are endorsed by the EarthBioGenome Project (EBP) and should be assigned to any sample with association to the EBP.

text

Collection event information

collected_byRequired
collected_by

name of persons or institute who collected the specimen

text
collection dateRequired
collection_date

The date the sample was collected with the intention of sequencing, either as an instance (single point in time) or interval. In case no exact time is available, the date/time can be right truncated i.e. all of these are valid ISO8601 compliant times: 2008-01-23T19:23:10+00:00; 2008-01-23T19:23:10; 2008-01-23; 2008-01; 2008.

text
geographic location (latitude)Optional
geographic_location_latitude

The geographical origin of the sample as defined by latitude. The values should be reported in decimal degrees and in WGS84 system

text
geographic location (longitude)Optional
geographic_location_longitude

The geographical origin of the sample as defined by longitude. The values should be reported in decimal degrees and in WGS84 system

text
geographic location (region and locality)Required
geographic_location_region_and_locality

The geographical origin of the sample as defined by the specific region name followed by the locality name.

text
identified_byOptional
identified_by

name of the expert who identified the specimen taxonomically

text
elevationOptional
elevation

The elevation of the sampling site as measured by the vertical distance from mean sea level.

text
habitatRequired
habitat

description of the location of the sample material. please use EnvO terms where possible: https://www.ebi.ac.uk/ols/ontologies/envo

text
identifier_affiliationOptional
identifier_affiliation

the university, institution, or society responsible for identifying the specimen.

text
original collection dateOptional
original_collection_date

For use if the specimen is from a zoo, botanic garden, culture collection etc. and has a known original date of collection. In case no exact time is available, the date/time can be right truncated i.e. all of these are valid ISO8601 compliant times: 2008-01-23T19:23:10+00:00; 2008-01-23T19:23:10; 2008-01-23; 2008-01; 2008.

text
original geographic locationOptional
original_geographic_location

For use if the specimen is from a zoo, botanic garden or culture collection etc. and has a known origin elsewhere. Please record the general description of the original collection location. This should be formatted as a country and optionally include more specific locations ranging from least to most specific separated by a | character, e.g. “United Kingdom | East Anglia | Norfolk | Norwich | University of East Anglia | UEA Broad".

text
original geographic location (latitude)Optional
original_geographic_location_latitude

For use if the specimen is from a zoo, botanic garden or culture collection etc. and has a known origin elsewhere. Please record the geographic location where the specimen or sample was originally taken as defined by latitude. The values should be reported in decimal degrees and in WGS84 system

text
original geographic location (longitude)Optional
original_geographic_location_longitude

For use if the specimen is from a zoo, botanic garden or culture collection etc. and has a known origin elsewhere. Please record the geographic location where the specimen or sample was originally taken as defined by longitude. The values should be reported in decimal degrees and in WGS84 system

text

sample collection

sample coordinatorOptional
sample_coordinator

The name of the sample coordinator.

text

Organism characteristics

sexRequired
sex

sex of the organism from which the sample was obtained

text
geographic location (country and/or sea)Required
geographic_location_country_andor_sea

The geographical origin of where the sample was collected from, with the intention of sequencing, as defined by the country or sea name. Country or sea names should be chosen from the INSDC country list (http://insdc.org/country.html).

select294 options

General collection event information

collecting institutionRequired
collecting_institution

Name of the institution to which the person collecting the specimen belongs. Format: Institute Name, Institute Address

text
GALOptional
gal

the name (or acronym) of the genome acquisition lab responsible for the sample.

select55 options

Pointer to physical material

specimen_idOptional
specimen_id

Unique identifier used to link all data for the recorded specimen.

text
GAL_sample_idOptional
gal_sample_id

unique name assigned to the sample by the genome acquisition lab.

text
proxy voucherOptional
proxy_voucher

For use if voucher material needs to be made from a specimen that is different than the one submitted for sequencing. Please record the unique identifier that references the physical specimen and that ideally exists in a curated collection. The ID should have the following structure: name of the institution (institution code) followed by the collection code (if available) and the voucher id (institution_code:collection_code:voucher_id). Please note institution codes and collection codes are taken from a controlled vocabulary maintained by the INSDC: https://ftp.ncbi.nih.gov/pub/taxonomy/biocollections/

text
proxy biomaterialOptional
proxy_biomaterial

For use if voucher material needs to be made from a material that is different from the one submitted for sequencing. Please record the unique identifier that references the biomaterial and that ideally exists in a curated collection (e.g. stock centres, seed banks, DNA banks). The ID should have the following structure: name of the institution (institution code) followed by the collection code (if available) and the material id (institution_code:collection_code:material_id). Please note institution codes and collection codes are taken from a controlled vocabulary maintained by the INSDC: https://ftp.ncbi.nih.gov/pub/taxonomy/biocollections/

text
bio_materialOptional
bio_material

Unique identifier that references the biological material from which the sample was obtained and that ideally exists in a curated collection (e.g. stock centres, seed banks, DNA banks). The ID should have the following structure: name of the institution (institution code) followed by the collection code (if available) and the voucher id (institution_code:collection_code:voucher_id). Please note institution codes and collection codes are taken from a controlled vocabulary maintained by the INSDC: https://ftp.ncbi.nih.gov/pub/taxonomy/biocollections/

text
specimen_voucherOptional
specimen_voucher

Unique identifier that references the physical specimen that remains after the sequence has been obtained and that ideally exists in a curated collection. The ID should have the following structure: name of the institution (institution code) followed by the collection code (if available) and the voucher id (institution_code:collection_code:voucher_id). Please note institution codes and collection codes are taken from a controlled vocabulary maintained by the INSDC: https://ftp.ncbi.nih.gov/pub/taxonomy/biocollections/

text

Infraspecies information

culture_or_strain_idOptional
culture_or_strain_id

living, culturable, named laboratory strain that sequenced material is derived from.

text

Environmental information

depthOptional
depth

The vertical distance below local surface, e.g. for sediment or soil samples depth is measured from sediment or soil surface, respectively. Depth can be reported as an interval for subsurface samples.

text