dbNSFP

The GTB format dbNSFP database retains only the functional prediction scores ((including rank scores, totaling 86 fields) and conservation scores (including rank scores, totaling 19 fields) from dbNSFP4.4a. The Field descriptions are sourced from the dbNSFP4.4.a README file.

#Field	Description	Type
SIFT_score	SIFT score (SIFTori). Scores range from 0 to 1. The smaller the score the more likely the SNP has damaging effect.	Float
SIFT_converted_rankscore	SIFTori scores were first converted to SIFTnew=1-SIFTori, then ranked among all SIFTnew scores in dbNSFP. The rankscore is the ratio of the rank the SIFTnew score over the total number of SIFTnew scores in dbNSFP.	Float
SIFT4G_score	SIFT 4G score (SIFT4G). Scores range from 0 to 1. The smaller the score the more likely the SNP has damaging effect.	Float
SIFT4G_converted_rankscore	SIFT4G scores were first converted to SIFT4Gnew=1-SIFT4G, then ranked among all SIFT4Gnew scores in dbNSFP. The rankscore is the ratio of the rank the SIFT4Gnew score over the total number of SIFT4Gnew scores in dbNSFP.	Float
PROVEAN_score	PROVEAN score (PROVEANori). The smaller the score the more likely the SNP has damaging effect. Range: [-14, 14].	Float
PROVEAN_converted_rankscore	PROVEANori were first converted to PROVEANnew=1-(PROVEANori+14)/28, then ranked among all PROVEANnew scores in dbNSFP. The rankscore is the ratio of the rank the PROVEANnew score over the total number of PROVEANnew scores in dbNSFP.	Float
Polyphen2_HDIV_score	Polyphen2 score based on HumDiv, i.e. hdiv_prob. Range: [0, 1].	Float
Polyphen2_HDIV_rankscore	Polyphen2 HDIV scores were first ranked among all HDIV scores in dbNSFP. The rankscore is the ratio of the rank the score over the total number of the scores in dbNSFP.	Float
Polyphen2_HVAR_score	Polyphen2 score based on HumVar, i.e. hvar_prob. Range: [0, 1].	Float
Polyphen2_HVAR_rankscore	Polyphen2 HVAR scores were first ranked among all HVAR scores in dbNSFP. The rankscore is the ratio of the rank the score over the total number of the scores in dbNSFP.	Float
LRT_score	The original LRT two-sided p-value (LRTori). Range: [0, 1].	Float
LRT_converted_rankscore	LRTori scores were first converted as LRTnew=1-LRTori0.5 if Omega<1, or LRTnew=LRTori0.5 if Omega>=1. Then LRTnew scores were ranked among all LRTnew scores in dbNSFP. The rankscore is the ratio of the rank over the total number of the scores in dbNSFP.	Float
LRT_Omega	estimated nonsynonymous-to-synonymous-rate ratio (Omega, reported by LRT)	Float
MutationTaster_score	MutationTaster p-value (MTori). Range: [0, 1].	Float
MutationTaster_converted_rankscore	The MTori scores were first converted. If the prediction is "A" or "D" MTnew=MTori; if the prediction is "N" or "P", MTnew=1-MTori. Then MTnew scores were ranked among all MTnew scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of MTnew scores in dbNSFP.	Float
MutationAssessor_score	MutationAssessor functional impact combined score (MAori). Range: [-5.17, 6.49].	Float
MutationAssessor_rankscore	MAori scores were ranked among all MAori scores in dbNSFP.The rankscore is the ratio of the rank of the score over the total number of MAori scores in dbNSFP.	Float
FATHMM_score	FATHMM default score (weighted for human inherited-disease mutations with Disease Ontology) (FATHMMori). The smaller the score the more likely the SNP has damaging effect.Range: [-16.13, 10.64].	Float
FATHMM_converted_rankscore	FATHMMori scores were first converted to FATHMMnew=1-(FATHMMori+16.13)/26.77, then ranked among all FATHMMnew scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of FATHMMnew scores in dbNSFP.	Float
fathmm-MKL_coding_score	fathmm-MKL p-values. Range: [0, 1].	Float
fathmm-MKL_coding_rankscore	fathmm-MKL coding scores were ranked among all fathmm-MKL coding scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of fathmm-MKL coding scores in dbNSFP.	Float
fathmm-XF_coding_score	fathmm-XF p-values. Range: [0, 1].	Float
fathmm-XF_coding_rankscore	fathmm-XF coding scores were ranked among all fathmm-XF coding scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of fathmm-XF coding scores in dbNSFP.	Float
CADD_raw	CADD raw score for functional prediction of a SNP. The larger the score the more likely the SNP has damaging effect. Range:[-6.458163, 18.301497].	Float
CADD_raw_rankscore	CADD raw scores were ranked among all CADD raw scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of CADD raw scores in dbNSFP.	Float
CADD_phred	CADD phred-like score. This is phred-like rank score based on whole genome CADD raw scores. The larger the score the more likely the SNP has damaging effect.	Float
VEST4_score	VEST 4.0 score. The larger the score the more likely the mutation may cause functional change. Range: [0, 1].	Float
VEST4_rankscore	VEST4 scores were ranked among all VEST4 scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of VEST4 scores in dbNSFP.	Float
MetaSVM_score	Support vector machine (SVM) based ensemble prediction score, which incorporated 10 scores (SIFT, PolyPhen-2 HDIV, PolyPhen-2 HVAR, GERP++, MutationTaster, Mutation Assessor, FATHMM, LRT, SiPhy, PhyloP) and the maximum frequency observed in the 1000 genomes populations. Larger value means the SNV is more likely to be damaging. Range: [-2, 3].	Float
MetaSVM_rankscore	MetaSVM scores were ranked among all MetaSVM scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of MetaSVM scores in dbNSFP.	Float
MetaLR_score	Logistic regression (LR) based ensemble prediction score, which incorporated 10 scores (SIFT, PolyPhen-2 HDIV, PolyPhen-2 HVAR, GERP++, MutationTaster, Mutation Assessor, FATHMM, LRT, SiPhy, PhyloP) and the maximum frequency observed in the 1000 genomes populations. Larger value means the SNV is more likely to be damaging. Range: [0, 1].	Float
MetaLR_rankscore	MetaLR scores were ranked among all MetaLR scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of MetaLR scores in dbNSFP.	Float
MetaRNN_score	Recurrent neural network (RNN) based ensemble prediction score, which incorporated 16 scores (SIFT, Polyphen2_HDIV, Polyphen2_HVAR, MutationAssessor, PROVEAN, VEST4, M-CAP, REVEL, MutPred, MVP, PrimateAI, DEOGEN2, CADD, fathmm-XF, Eigen and GenoCanyon), 8 conservation scores (GERP, phyloP100way_vertebrate, phyloP30way_mammalian, phyloP17way_primate, phastCons100way_vertebrate, phastCons30way_mammalian, phastCons17way_primate and SiPhy), and allele frequency information from the 1000 Genomes Project (1000GP), ExAC, and gnomAD. Larger value means the SNV is more likely to be damaging. Range: [0, 1].	Float
MetaRNN_rankscore	MetaRNN scores were ranked among all MetaRNN scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of MetaRNN scores in dbNSFP.	Float
M-CAP_score	M-CAP is hybrid ensemble score. The larger the score the more likely the SNP has damaging effect. Range: [0, 1].	Float
M-CAP_rankscore	M-CAP scores were ranked among all M-CAP scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of M-CAP scores in dbNSFP.	Float
REVEL_score	REVEL is an ensemble score based on 13 individual scores for predicting the pathogenicity of missense variants. The larger the score the more likely the SNP has damaging effect. Range: [0, 1].	Float
REVEL_rankscore	REVEL scores were ranked among all REVEL scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of REVEL scores in dbNSFP.	Float
MutPred_score	General MutPred score. The larger the score the more likely the SNP has damaging effect. Range: [0, 1].	Float
MutPred_rankscore	MutPred scores were ranked among all MutPred scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of MutPred scores in dbNSFP.	Float
MVP_score	A pathogenicity prediction score for missense variants using deep learning approach. The larger the score, the more likely the variant is pathogenic. Range: [0, 1].	Float
MVP_rankscore	MVP scores were ranked among all MVP scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of MVP scores in dbNSFP.	Float
gMVP_score	A pathogenicity prediction score for missense variants using a graph attention neural network model. The larger the score, the more likely the variant is pathogenic. Range: [0, 1].	Float
gMVP_rankscore	gMVP scores were ranked among all gMVP scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of gMVP scores in dbNSFP.	Float
MPC_score	A deleteriousness prediction score for missense variants based on regional missense constraint. The larger the score, the more likely the variant is pathogenic. Range: [0, 5].	Float
MPC_rankscore	MPC scores were ranked among all MPC scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of MPC scores in dbNSFP.	Float
PrimateAI_score	A pathogenicity prediction score for missense variants based on common variants of non-human primate species using a deep neural network. The larger the score, the more likely the variant is pathogenic. Range: [0, 1].	Float
PrimateAI_rankscore	PrimateAI scores were ranked among all PrimateAI scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of PrimateAI scores in dbNSFP.	Float
DEOGEN2_score	A deleteriousness prediction score "which incorporates heterogeneous information about the molecular effects of the variants, the domains involved, the relevance of the gene and the interactions in which it participates". The larger the score, the more likely the variant is deleterious. Range: [0, 1].	Float
DEOGEN2_rankscore	DEOGEN2 scores were ranked among all DEOGEN2 scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of DEOGEN2 scores in dbNSFP.	Float
BayesDel_addAF_score	A deleteriousness preidction meta-score for SNVs and indels with inclusion of MaxAF. The higher the score, the more likely the variant is pathogenic. Range: [-1.11707, 0.750927].	Float
BayesDel_addAF_rankscore	BayesDel_addAF scores were ranked among all BayesDel_addAF scores in dbNSFP.The rankscore is the ratio of the rank of the score over the total number of BayesDel_addAF scores in dbNSFP.	Float
BayesDel_noAF_score	A deleteriousness preidction meta-score for SNVs and indels without inclusion of MaxAF. The higher the score, the more likely the variant is pathogenic. Range: [-1.31914, 0.840878].	Float
BayesDel_noAF_rankscore	BayesDel_noAF scores were ranked among all BayesDel_noAF scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of BayesDel_noAF scores in dbNSFP.	Float
ClinPred_score	A deleteriousness preidction meta-score for nonsynonymous SNVs. The higher the score, the more likely the variant is pathogenic. Range: [0, 1].	Float
ClinPred_rankscore	ClinPred scores were ranked among all ClinPred scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of ClinPred scores in dbNSFP.	Float
LIST-S2_score	A deleteriousness preidction score for nonsynonymous SNVs. The higher the score, the more likely the variant is pathogenic. Range: [0, 1].	Float
LIST-S2_rankscore	LIST-S2 scores were ranked among all LIST-S2 scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of LIST-S2 scores in dbNSFP.	Float
VARITY_R_score	VARITY_R scores are pathogenicity prediction scores for rare human missense variants. The larger the score, the more likely the variant is pathogenic. Range: [0, 1].	Float
VARITY_R_rankscore	VARITY_R scores were ranked among all VARITY_R scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of VARITY_R scores in dbNSFP.	Float
VARITY_ER_score	VARITY_ER scores are pathogenicity prediction scores for extreme rare human missense variants. The larger the score, the more likely the variant is pathogenic. Range: [0, 1].	Float
VARITY_ER_rankscore	VARITY_ER scores were ranked among all VARITY_ER scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of VARITY_ER scores in dbNSFP.	Float
VARITY_R_LOO_score	Same as VARITY_R except the prediction on the variants used for training was made using Leave-One-Variant out.	Float
VARITY_R_LOO_rankscore	VARITY_R_LOO scores were ranked among all VARITY_R_LOO scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of VARITY_R_LOO scores in dbNSFP.	Float
VARITY_ER_LOO_score	Same as VARITY_ER except the prediction on the variants used for training was made using Leave-One-Variant out.	Float
VARITY_ER_LOO_rankscore	VARITY_ER_LOO scores were ranked among all VARITY_ER_LOO scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of VARITY_ER_LOO scores in dbNSFP.	Float
DANN_score	DANN is a functional prediction score retrained based on the training data of CADD using deep neural network. A larger number indicate a higher probability to be damaging. Range: [0, 1].	Float
DANN_rankscore	DANN scores were ranked among all DANN scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of DANN scores in dbNSFP.	Float
Eigen-raw_coding	Eigen score for coding SNVs. A functional prediction score based on conservation, allele frequencies, and deleteriousness prediction using an unsupervised learning method.	Float
Eigen-raw_coding_rankscore	Eigen-raw scores were ranked among all Eigen-raw scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of Eigen-raw scores in dbNSFP.	Float
Eigen-phred_coding	Eigen score in phred scale.	Float
Eigen-PC-raw_coding	Eigen PC score for genome-wide SNVs. A functional prediction score based on conservation, allele frequencies, deleteriousness prediction (for missense SNVs) and epigenomic signals (for synonymous and non-coding SNVs) using an unsupervised learning method.	Float
Eigen-PC-raw_coding_rankscore	Eigen-PC-raw scores were ranked among all Eigen-PC-raw scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of Eigen-PC-raw scores in dbNSFP.	Float
Eigen-PC-phred_coding	Eigen PC score in phred scale.	Float
GenoCanyon_score	A functional prediction score based on conservation and biochemical annotations using an unsupervised statistical learning.	Float
GenoCanyon_rankscore	GenoCanyon_score scores were ranked among all integrated fitCons scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of GenoCanyon_score scores in dbNSFP.	Float
integrated_fitCons_score	fitCons score predicts the fraction of genomic positions belonging to a specific function class (defined by epigenomic "fingerprint") that are under selective pressure. A larger score indicating a higher proportion of nucleic sites of the functional class the genomic position belong to are under selective pressure, therefore more likely to be functional important. Integrated (i6) scores are integrated across three cell types (GM12878, H1-hESC and HUVEC). Range: [0, 1].	Float
integrated_fitCons_rankscore	integrated fitCons scores were ranked among all integrated fitCons scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of integrated fitCons scores in dbNSFP.	Float
GM12878_fitCons_score	GM12878 fitCons scores are based on cell type GM12878.	Float
GM12878_fitCons_rankscore	GM12878 fitCons scores were ranked among all GM12878 fitCons scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of GM12878 fitCons scores in dbNSFP.	Float
H1-hESC_fitCons_score	H1-hESC fitCons scores are based on cell type H1-hESC.	Float
H1-hESC_fitCons_rankscore	H1-hESC fitCons scores were ranked among all H1-hESC fitCons scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of H1-hESC fitCons scores in dbNSFP.	Float
HUVEC_fitCons_score	HUVEC fitCons scores are based on cell type HUVEC.	Float
HUVEC_fitCons_rankscore	HUVEC fitCons scores were ranked among all HUVEC fitCons scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of HUVEC fitCons scores in dbNSFP.	Float
LINSIGHT	The LINSIGHT score measures the probability of negative selection on noncoding sites.	Float
LINSIGHT_rankscore	LINSIGHT scores were ranked among all LINSIGHT scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of LINSIGHT scores in dbNSFP.	Float
GERP++_NR	GERP++ neutral rate.	Float
GERP++_RS	GERP++ RS score, the larger the score, the more conserved the site. Range: [-12.3, 6.17].	Float
GERP++_RS_rankscore	GERP++ RS scores were ranked among all GERP++ RS scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of GERP++ RS scores in dbNSFP.	Float
phyloP100way_vertebrate	phyloP (phylogenetic p-values) conservation score based on the multiple alignments of 100 vertebrate genomes (including human). The larger the score, the more conserved the site. Range: [-20.0, 10.003].	Float
phyloP100way_vertebrate_rankscore	phyloP100way_vertebrate scores were ranked among all phyloP100way_vertebrate scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of phyloP100way_vertebrate scores in dbNSFP.	Float
phyloP470way_mammalian	phyloP (phylogenetic p-values) conservation score based on the multiple alignments of 470 mammalian genomes (including human). The larger the score, the more conserved the site. Range: [-20, 11.936].	Float
phyloP470way_mammalian_rankscore	phyloP470way_mammalian scores were ranked among all phyloP470way_mammalian scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of phyloP470way_mammalian scores in dbNSFP.	Float
phyloP17way_primate	a conservation score based on 17way alignment primate set, the higher the more conservative. Range: [-13.362, 0.756].	Float
phyloP17way_primate_rankscore	the rank of the phyloP17way_primate score among all phyloP17way_primate scores in dbNSFP.	Float
phastCons100way_vertebrate	phastCons conservation score based on the multiple alignments of 100 vertebrate genomes (including human). The larger the score, the more conserved the site. Range: [0, 1].	Float
phastCons100way_vertebrate_rankscore	phastCons100way_vertebrate scores were ranked among all phastCons100way_vertebrate scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of phastCons100way_vertebrate scores in dbNSFP.	Float
phastCons470way_mammalian	phastCons conservation score based on the multiple alignments of 470 mammalian genomes (including human). The larger the score, the more conserved the site. Range: [0, 1].	Float
phastCons470way_mammalian_rankscore	phastCons470way_mammalian scores were ranked among all phastCons470way_mammalian scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of phastCons470way_mammalian scores in dbNSFP.	Float
phastCons17way_primate	a conservation score based on 17way alignment primate set. The larger the score, the more conserved the site. Range: [0, 1].	Float
phastCons17way_primate_rankscore	the rank of the phastCons17way_primate score among all phastCons17way_primate scores in dbNSFP.	Float
SiPhy_29way_logOdds	SiPhy score based on 29 mammals genomes. The larger the score, the more conserved the site. Range: [0, 37.9718].	Float
SiPhy_29way_logOdds_rankscore	SiPhy_29way_logOdds scores were ranked among all SiPhy_29way_logOdds scores in dbNSFP. The rankscore is the ratio of the rank of the score over the total number of SiPhy_29way_logOdds scores in dbNSFP.	Float
bStatistic	Background selection (B) value estimates the expected fraction (*1000) of neutral diversity present at a site. Values close to 0 represent near complete removal of diversity as a result of background selection and values near 1000 indicating absent of background selection. Range: [0, 1000].	Float
bStatistic_converted_rankscore	bStatistic scores were first converted to -bStatistic, then ranked among all -bStatistic scores in dbNSFP. The rankscore is the ratio of the rank of -bStatistic over the total number of -bStatistic scores in dbNSFP.	Float

c

dbNSFP

results matching ""

No results matching ""