Skip to main content

πŸ’‘ Illustrative Examples of Evidence Matrix Columns

The tables below show examples of how PEG evidence matrix columns can be named and formatted.
These examples are not mandatory fields β€” they are provided to demonstrate recommended naming patterns, data formats, and reporting styles.

  • Projects may define additional or alternative columns, we recommend following these general conventions.
  • Metadata should provide comprehensive information to understand the data type, provenance, and scale used for each column.
Evidence CategoryColumn headerData FormatDescriptionRequirementExample data
GWASGWAS_pvalueExponent or βˆ’log10P-value of the primary variant in the source GWAS. Specify whether exponent (e.g. 4Γ—10⁻⁹) or βˆ’log10 scale in the metadata file.optional4Γ—10⁻⁹
ProximityPROX_nearest_genebooleanIndicates whether the variant is the nearest gene. Details on how distance is derived (e.g. to TSS, to gene footprint) should be documented in the metadata.optionalN
QTLQTL_eQTL_pancreas_pvalueexponent or -log10Significance value for eQTL association in pancreas tissue.optional0.01
QTLQTL_eQTL_pancreas_CIrangeConfidence interval for the eQTL effect. Define confidence level (e.g. 95%) in metadata.optional[1.2, 2.5]
FunctionalFUNC_CADDfloatCADD functional prediction score. Specify genome build and release in the metadata.optional15.62
Fine-mappingFM_credible_set_IDstringIdentifier of the credible set variant from fine-mapping.optionalchr10:114754071:T:C
Fine-mappingFM_PIPfloatPosterior inclusion probability (PIP) from fine-mapping.optional0.98
ColocCOLOC_PPH4floatColocalisation posterior probability that both traits share a causal variant (PPH4).optional0.85