GATK之VariantAnnotator

VariantAnnotator

简要说明

用途: 利用上下文信息注释识别的变异位点(variant calls)
分类: 变异位点操作工具
概要: 根据变异位点的背景信息(与功能注释相对)进行注释。目前有许多的注释模块(见注释模块一节)可供使用。

输入文件

用于注释的VCF文件和可选的BAM文件

输出文件

注释完毕的VCF文件

使用案例

HaplotypeCallerUnifiedGenotyper的结果中增加每个样本的深度和dbSNP ID信息。

java -jar GenomeAnalysisTK.jar \-R reference.fasta \-T VariantAnnotator \-I input.bam \-V input.vcf \-o output.vcf \-A Coverage \--dbsnp dbsnp.vcf

参数说明:

-R/--reference_sequence:参考基因组
-T/--analysis_type : 运行的工具
-I/--input_file: 和vcf相应的BAM文件
-o :输出文件
-V/--varaint: 输入的VCF文件
-A/--annotation: 要添加哪些注释项
--dbsnp: 已有的snp信息注释数据库

HaplotypeCaller和MuTect2也有-A选项,并且有些注释模块只能在HaplotypeCaller和MuTect2计算,例如StrandAlleleCountsBySample
如下是 -A可接的内容:

Standard annotations in the list below are marked with a '*'.
Available annotations for the VCF INFO field:AS_BaseQualityRankSumTestAS_FisherStrandAS_InbreedingCoeffAS_InsertSizeRankSumAS_MQMateRankSumTestAS_MappingQualityRankSumTestAS_QualByDepthAS_RMSMappingQualityAS_ReadPosRankSumTestAS_StrandOddsRatioAlleleBalanceBaseCounts*BaseQualityRankSumTest*ChromosomeCountsClippingRankSumTestClusteredReadPosition*Coverage*ExcessHet*FisherStrandFractionInformativeReadsGCContentGenotypeSummaries*HaplotypeScoreHardyWeinbergHomopolymerRun*InbreedingCoeffLikelihoodRankSumTestLowMQMVLikelihoodRatio*MappingQualityRankSumTestMappingQualityZeroNBaseCountPossibleDeNovo*QualByDepth*RMSMappingQuality*ReadPosRankSumTestSampleListSnpEffSpanningDeletions*StrandOddsRatioTandemRepeatAnnotatorTransmissionDisequilibriumTestVariantTypeAvailable annotations for the VCF FORMAT field:AlleleBalanceBySampleAlleleCountBySampleBaseCountsBySampleBaseQualitySumPerAlleleBySample*DepthPerAlleleBySampleDepthPerSampleHCMappingQualityZeroBySampleOxoGReadCountsStrandAlleleCountsBySampleStrandBiasBySampleAvailable classes/groups of annotations:AS_RMSAnnotationAS_RankSumTestAS_StandardAnnotationAS_StrandBiasTestActiveRegionBasedAnnotationBetaTestingAnnotationExperimentalAnnotationRMSAnnotationRankSumTestReducibleAnnotationRodRequiringAnnotationStandardAnnotationStandardHCAnnotationStandardSomaticAnnotationStandardUGAnnotationStrandBiasTestWorkInProgressAnnotation

注释模块

这是官方文档提供的注释模块:

NameSummary
AS_BaseQualityRankSumTestAllele-specific rank Sum Test of REF versus ALT base quality scores
AS_FisherStrandAllele-specific strand bias estimated using Fisher's Exact Test *
AS_InbreedingCoeffAllele-specific likelihood-based test for the inbreeding among samples
AS_InsertSizeRankSumAllele specific Rank Sum Test for insert sizes of REF versus ALT reads
AS_MQMateRankSumTestAllele specific Rank Sum Test for mate's mapping qualities of REF versus ALT reads
AS_MappingQualityRankSumTestAllele specific Rank Sum Test for mapping qualities of REF versus ALT reads
AS_QualByDepthAllele-specific call confidence normalized by depth of sample reads supporting the allele
AS_RMSMappingQualityAllele-specific Root Mean Square of the mapping quality of reads across all samples.
AS_ReadPosRankSumTestAllele-specific Rank Sum Test for relative positioning of REF versus ALT allele within reads
AS_StrandOddsRatioAllele-specific strand bias estimated by the Symmetric Odds Ratio test
AlleleBalanceAllele balance across all samples
AlleleBalanceBySampleAllele balance per sample
AlleleCountBySampleAllele count and frequency expectation per sample
BaseCountsCount of A, C, G, T bases across all samples
BaseCountsBySampleCount of A, C, G, T bases for each sample
BaseQualityRankSumTestRank Sum Test of REF versus ALT base quality scores
BaseQualitySumPerAlleleBySampleSum of evidence in reads supporting each allele for each sample
ChromosomeCountsCounts and frequency of alleles in called genotypes
ClippingRankSumTestRank Sum Test for hard-clipped bases on REF versus ALT reads
ClusteredReadPositionDetect clustering of variants near the ends of reads
CoverageTotal depth of coverage per sample and over all samples.
DepthPerAlleleBySampleDepth of coverage of each allele per sample
DepthPerSampleHCDepth of informative coverage for each sample.
ExcessHetPhred-scaled p-value for exact test of excess heterozygosity
FisherStrandStrand bias estimated using Fisher's Exact Test
FractionInformativeReadsThe fraction of reads deemed informative over the entire cohort
GCContentGC content of the reference around the given site
GenotypeSummariesSummarize genotype statistics from all samples at the site level
HaplotypeScoreConsistency of the site with strictly two segregating haplotypes
HardyWeinbergHardy-Weinberg test for transmission disequilibrium
HomopolymerRunLargest contiguous homopolymer run of the variant allele
InbreedingCoeffLikelihood-based test for the inbreeding among samples
LikelihoodRankSumTestRank Sum Test of per-read likelihoods of REF versus ALT reads
LowMQProportion of low quality reads
MVLikelihoodRatioLikelihood of being a Mendelian Violation
MappingQualityRankSumTestRank Sum Test for mapping qualities of REF versus ALT reads
MappingQualityZeroCount of all reads with MAPQ = 0 across all samples
MappingQualityZeroBySampleCount of reads with mapping quality zero for each sample
NBaseCountPercentage of N bases
OxoGReadCountsCount of read pairs in the F1R2 and F2R1 configurations supporting the reference and alternate alleles
PossibleDeNovoExistence of a de novo mutation in at least one of the given families
QualByDepthVariant call confidence normalized by depth of sample reads supporting a variant
RMSMappingQualityRoot Mean Square of the mapping quality of reads across all samples.
ReadPosRankSumTestRank Sum Test for relative positioning of REF versus ALT alleles within reads
SampleListList samples that are non-reference at a given site
SnpEffTop effect from SnpEff functional predictions
SpanningDeletionsFraction of reads containing spanning deletions
StrandAlleleCountsBySampleNumber of forward and reverse reads that support each allele
StrandBiasBySampleNumber of forward and reverse reads that support REF and ALT alleles
StrandOddsRatioStrand bias estimated by the Symmetric Odds Ratio test
TandemRepeatAnnotatorTandem repeat unit composition and counts per allele
TransmissionDisequilibriumTestWittkowski transmission disequilibrium test
VariantTypeGeneral category of variant

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.mzph.cn/news/541505.shtml

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈email:809451989@qq.com,一经查实,立即删除!

相关文章

pipedreader_Java PipedReader connect()方法与示例

pipedreaderPipedReader类的connect()方法 (PipedReader Class connect() method) connect() method is available in java.io package. connect()方法在java.io包中可用。 connect() method is used to cause this PipedReader to be connected to the given PipedWriter (sou…

《Java学习指南》—— 1.4 设计安全

本节书摘来异步社区《Java学习指南》一书中的第1章,第1.4节,作者:【美】Patrick Niemeyer , Daniel Leuck,更多章节内容可以访问云栖社区“异步社区”公众号查看。 1.4 设计安全 Java被设计为一种安全语言,对于这一事实…

ppython_Python pcom包_程序模块 - PyPI - Python中文网

PCOM在python中一个非常基本的unitronics pcom协议实现。如何使用from pcom import commandsfrom pcom.plc import EthernetPlcwith EthernetPlc(address(192.168.5.43, 1616)) as plc:# Read realtime clockc commands.ReadRtc()res plc.send(c)print(res)# Set realtime cl…

bitcount方法详解_Java Long类的bitCount()方法和示例

bitcount方法详解长类bitCount()方法 (Long class bitCount() method) bitCount() method is available in java.lang package. bitCount()方法在java.lang包中可用。 bitCount() method is used to find the number of 1s bits in the 2s complement binary denotation of the…

《软件定义数据中心:Windows Server SDDC技术与实践》——导读

前言 通过对自身的审视和对身边IT 技术专家的观察,我发现对于我们来说,掌握一项新的技术或熟悉一个新的产品,大都是闻而后知,知而后学,学以致用,用以知其然。然而Windows Server作为一个简单的、易上手的操…

python二维向量运算模拟_python二维向量运算_[VB.NET][C#]二维向量的基本运算

前言在数学中,几何向量是指具有大小和方向的几何对象。在编程中,向量有着广泛的应用,其作用在图形编程和游戏物理引擎方面尤为突出。第一节 构造函数通过创建一个二维向量的类(或结构体),实现向量的表示及其运算。1. 首先&#xf…

Java LinkedHashMap clear()方法与示例

LinkedHashMap类的clear()方法 (LinkedHashMap Class clear() method) clear() method is available in java.util package. clear()方法在java.util包中可用。 clear() method is used to remove all the existing elements from this LinkedHashMap. clear()方法用于从此Link…

《Imperfect C++中文版》——1.3 运行期契约:前置条件、后置条件和不变式

本节书摘来自异步社区出版社《Imperfect C中文版》一书中的第1章,第1.3节,作者: 【美】Matthew Wilson,更多章节内容可以访问云栖社区“异步社区”公众号查看。 1.3 运行期契约:前置条件、后置条件和不变式 Imperfect …

python名称空间与运用域_Python名称空间和作用域讲座,命名,Namespaces,Scopes

Python命名空间(Namespaces)和作用域(Scopes)讲座命名空间(Namespace)命名空间(Namespace),是名称到对象的映射。命名空间提供了在项目中避免名字冲突的一种方法。命名空间是独立的,没有任何关系的,所以一个命名空间中不能有重名,…

getminimum_Java Calendar getMinimum()方法与示例

getminimumCalendar类的getMinimum()方法 (Calendar Class getMinimum() method) getMinimum() method is available in java.util package. getMinimum()方法在java.util包中可用。 getMinimum() method is used to get the minimum value of the given field(fi) of this Cal…

《Spark核心技术与高级应用》——3.2节构建Spark的开发环境

本节书摘来自华章社区《Spark核心技术与高级应用》一书中的第3章,第3.2节构建Spark的开发环境,作者于俊 向海 代其锋 马海平,更多章节内容可以访问云栖社区“华章社区”公众号查看 3.2 构建Spark的开发环境无论Windows或Linux操作系统&am…

python闭包怎么理解_Python 闭包的理解

Last Updated on 2019年10月15日Python中的闭包是一个比较模糊的概念,不太好理解,我最近的面试中也被问及,在一个单例模式的实现上,我用装饰器实现单例,然后面试官就问到了我对闭包的理解,回答的不太清楚。…

Java BufferedReader mark()方法与示例

BufferedReader类mark()方法 (BufferedReader Class mark() method) mark() method is available in java.io package. mark()方法在java.io包中可用。 mark() method is used to mark the current position in this stream and whenever we call reset() method so it will re…

《全球互联网金融商业模式:格局与发展》——第3章,第3节互联网保险公司...

本节书摘来自华章出版社《全球互联网金融商业模式:格局与发展》一书中的第3章,第3.3节互联网保险公司,作者廖理,更多章节内容可以访问云栖社区“华章计算机”公众号查看 3.3 互联网保险公司互联网思维贯穿整个保险创新发展过程&a…

webapi随机调用_BeetleX之webapi验证插件JWT集成

对于webapi服务应用很多时候需要制订访问限制,在前面的章节也讲述了组件如何制订控制器访问控制;但到了实际应用要自己去编写还是比较麻烦。为了让访问控制更方便组件实现基于JWT的控制器访问控制组件BeetleX.FastHttpApi.Jwt;通过这个组件可…

java bitset_Java BitSet nextClearBit()方法与示例

java bitsetBitSet类nextClearBit()方法 (BitSet Class nextClearBit() method) nextClearBit() method is available in java.util package. nextClearBit()方法在java.util包中可用。 nextClearBit() method is used to retrieve the index of the first bit that is set to …

《驯狮记——Mac OS X 10.8 Mountain Lion使用手册》——2.3 Dock

本节书摘来自异步社区《驯狮记——Mac OS X 10.8 Mountain Lion使用手册》一书中的第2章,第2.3节,作者:陈明 , 张铮 , 马玉龙著,更多章节内容可以访问云栖社区“异步社区”公众号查看 2.3 Dock 驯狮记——Mac OS X 10.8 Mountain…

mysql 嵌套if标签_对比Excel、MySQL、Python,分别讲述 “if函数” 的使用原理!

作者:黄伟呢本文转自:数据分析与统计学之美其实,不管是Excel、MySQL,还是Python,“if”条件判断都起着很重要的作用。今天这篇文章,就带着大家盘点一下,这三种语言如何分别使用 “if函数” 。if…

Java BigDecimal intValue()方法与示例

BigDecimal类的intValue()方法 (BigDecimal Class intValue() method) intValue() method is available in java.math package. intValue()方法在java.math包中可用。 intValue() method is used to convert a BigDecimal to an integer and when the converted BigDecimal val…

R语言数据挖掘

数据分析与决策技术丛书 R语言数据挖掘 Learning Data Mining with R [哈萨克斯坦]贝特麦克哈贝尔(Bater Makhabel) 著 李洪成 许金炜 段力辉 译 图书在版编目(CIP)数据 R语言数据挖掘 / (哈…