Skip to main content
Version: 0.0.1

📋 PEGASUS Evidence Matrix Standard

Genomic Identifiers

Column headerData formatDescriptionRequirementExample data
Primary Variant IDchr:bp:ref:altThe variant to which variant-centric evidence relates. Used as the primary row ID; may be a lead variant, a variant in LD, or a fine-mapped SNP (defined in metadata).mandatorychr10:114754071:T:C
rsIDrs[]The rsID of the primary variant.optionalrs1234
VAR_[xyz]bespokeAdditional variant ID columns. Custom names must follow VAR_[xyz] and be defined in the metadata file.optionalbespoke

Evidence — General Pattern

All variant-centric evidence columns are optional. However, we suggest to include at least one variant-centric evidence to support variant-gene relationship.

We define a general reporting pattern:

Column headerData FormatDescriptionRequirementExample data
Category_[xyz]BesopkeMost headers follow the format Category_(stream)_[xyz].

Category is mandatory;
stream is used only if it differs from the category;
[xyz] can be any user-defined label.

e.g. GWAS_pvalue, EXP_AdiposeTissue_TPM, QTL_eQTL_pancreas,TPWAS_TWAS_pvalue. The category must be from the controlled list and defined in the metadata file.
optionalvariant-centric evidence examples;

gene-centric evidence examples

These are not strict requirements. Different categories may call for different types of data, and users can adapt them as needed. For guidance, we provide reference guidelines for the general evidence categories. Each category — variant-centric, gene-centric, comes with suggested naming patterns and example formats.

Integration Evidence — General Pattern

Column headerData FormatDescriptionRequirementExample data
INT_[details]Bespoke

Headers may follow the format INT_[details] (or INT alone).

INT indicates integration evidence; [details] is a user-defined suffix when multiple integrations are reported.

For multi-word field names, use CamelCase (e.g., CredibleSetId).

Provenance and integration specifics can differ by row; capture them in the metadata file and, if they vary within the dataset, also in the data file.

optionalIntegration evidence example