Skip to content

Output

Outputs at a glance

KGGSum operates using a task-by-task model. Each task generates efficient intermediate files (including binary variants*.gtb outputs) that store harmonized variants, statistics, and annotations. This design lets you resume interrupted workflows or re-run with new parameters without repeating completed steps (you can also branch workflows).

Option Description Default
--output Specify the output folder path. All data from each task will be stored under this folder, preserving intermediate files and helping you avoid duplicate work.
Format: --output <dir>
Example: --output ./out/test
./kggsum
--clean-intermediate-data Remove intermediate data to reduce disk usage and memory pressure.
Format: --clean-intermediate-data
[OFF]

While each analysis has its own tasks, some common task outputs are listed below.

File Description
ConvertVCF2GTBTask*.gtb The gtb format file converted from the input VCF file by --ref-gty-file
GenerateRootVariantSetTask/variants.annot.hg38.gtb The variants extracted from the reference genotype file specified by --ref-gty-file in gtb format will be used as the base for the following analysis.
AppendVariants2RootVariantSetTask/variants.annot.hg38.gtb The base variants appended with GWAS summary statistics specified by the --sum-file
GeneFeatureAnnotationTask/variants.annot.hg38.gtb The variants annotated with gene features subsequently
OutputVariants2TSVTask/variants.hg38.tsv.gz The GWAS summary and annotations of variants retained for analysis in TSV format.