`
wanguan2000
  • 浏览: 68556 次
  • 性别: Icon_minigender_1
  • 来自: 上海
社区版块
存档分类
最新评论

haploid genome

    博客分类:
  • seq
 
阅读更多

http://seqanswers.com/forums/archive/index.php/t-10182.html

To my knowledge, FreeBayes is significantly different than other variant detection systems in common use in that it is not limited to the analysis of haploid or diploid individuals

The GATK can be used to call the sex (X and Y) chromosomes, without explicit knowledge of the gender of the samples. In an ideal world, with perfect upfront data processing, we would get perfect genotypes on the sex chromosomes without knowledge of who is diploid on X and has no Y, and who is hemizygous on both. However, misalignment and mismapping contributes especially to these chromosomes, as their reference sequence is clearly of lower quality than the autosomal regions of the genome. Nevertheless, it is possible to get reasonably good SNP calls, even with simple data processing and basic filtering. Results with proper, full data processing as per the best practices in the GATK should lead to very good calls. You can view a presentation "The GATK Unified Genotyper on chrX and chrY" in GSA Public Drop Box

 

What I ended up doing was using GATK's UnifiedGenotyper, manually extracting the likelihoods for both of the homozygote genotypes, and calling a SNP if the likelihood of the alternative allele was above a certain amount higher than the likelihood of the reference allele (I believe I required the likelihood of the alt allele to be at least 3X greater than the ref allele, although I haven't tested extensively to find the best threshold).

I have used FreeBayes on haploid sequences with good results; it is recommended.

http://www.broadinstitute.org/gsa/wiki/index.php/Understanding_the_Unified_Genotyper%27s_VCF_files

 

git clone --recursive git://github.com/ekg/freebayes.git

 

 

 

Field Meaning GT GQ AD and DP PL
The genotype of this sample. For a diploid, the GT field indicates the two alleles carried by the sample, encoded by a 0 for the REF allele, 1 for the first ALT allele, 2 for the second ALT allele, etc. When there's a single ALT allele (the by far more common case), GT will be either:
  • 0/0 - the sample is homozygous reference
  • 0/1 - the sample is heterozygous, carrying 1 copy of each of the REF and ALT alleles
  • 1/1 - the sample is homozygous alternate

In the three examples above, NA12878 is T/G, G/G, and C/T.

The Genotype Quality, as a Phred-scaled confidence at the true genotype is the one provided in GT. In diploid case, if GT is 0/1, then GQ is really L(0/1) / (L(0/0) + L(0/1) + L(1/1)), where L is the likelihood of the NGS sequencing data under the model of that the sample is 0/0, 0/1/, or 1/1.
See the online documentation for AD and DP .
We provide the AD and DP fields since this is usually what downstream users want. However, the truly sophisticated users will want to directly use the likelihoods of the three genotypes 0/0, 0/1, and 1/1 provide in the PL field. These are normalized, Phred-scaled likelihoods for each of the 0/0, 0/1, and 1/1, without priors. To be concrete, for the het case, this is L(data given that the true genotype is 0/1). The most likely genotype (the one in GT) is scaled so that it's P = 1.0 (0 when Phred-scaled), and the other likelihoods reflect their Phred-scaled likelihoods relative to this most likely genotype. Currently only provided when the site is biallelic.
分享到:
评论

相关推荐

    三代基因组组装软件

    The latter assembly leads a great improvement of quality to the previous genome assembled from the 54x haploid SMRT data. MECAT performance were compared with PBcR-Mhap pipeline, FALCON and Canu(v1.3...

    文献-我国玉米单倍体育种技术研究进展

    This paper introduced the research progress of China’S maize haploid in producing approaches,the identification method and the doubling method,and made a summary for the application value of ...

    POPGENE中文使用教程参照.pdf

    Haploid Data Analysis中,用户可以进行基因频率、等位基因数量、有效等位基因数量、多态性loci、基因多样性、Shannon信息指数、同质性测试、F-统计量、基因流、遗传距离、树形图、Neutrality Test、二位点LD等分析...

    不同施氮期对小麦千粒重影响的QTL分析 (2015年)

    本研究以花培3号×豫麦57的168个双单倍体(doubled haploid,DH)群体为材料,根据2年12个环境下千粒重性状的表型数据和含有323个位点的分子遗传图谱,对千粒重性状进行了QTL分析。结果共检测到40个QTL,主要集中在染色体...

    bqaa210 (1).pdf

    【文章描述】:本文探讨了在精子发生(spermiogenesis)过程中,除了遗传物质形态变化形成紧密核外,发育中的单倍体精原细胞(haploid spermatids)内广泛发生的组蛋白修饰。通过单细胞RNA测序技术,研究者深入理解...

    [家畜繁殖学课件]第五章 受精、妊娠与分娩4.ppt

    二、单倍体(Haploid) 单倍体则指只有一个染色体组的细胞或个体。在常规的有性生殖中,单倍体状态通常是短暂的,存在于减数分裂后的配子中。如果卵子或精子未经减数分裂直接参与受精,或者两个单倍体配子结合,就...

    细胞分裂习题.doc

    减数分裂发生在生殖细胞中,目的是产生四个不同且完整的 haploid cells。 2. 细胞周期:细胞周期是指细胞从一个阶段到下一个阶段的过程。细胞周期包括间期(interphase)、分裂期(mitosis)和 cytokinesis。间期是...

    减数分裂学生活动资料.docx

    首先,减数分裂的主要目的是为了减半染色体的数量,使得每个生殖细胞只含有单倍体( haploid)的染色体数,这是维持物种染色体数目稳定的关键。在大多数多细胞生物中,体细胞是二倍体( diploid),含有两套染色体,...

    减数分裂过程中的数量变化.ppt

    形成四个最终的配子细胞,每个细胞只有N/2条染色体,即单倍体(haploid)状态。 在精子形成过程中,四个最终的子细胞将发育成为精子。在卵细胞形成过程中,通常只有一个子细胞能存活下来并发育成卵细胞,而其他三...

    生物科学生物技术专业英语词汇手册范本.doc

    例如,"haplo-" 表示“单”,在 "haploid" 中指的是只有一个染色体组的生物;"bi-" 意为“双”,如 "bicolor" 指具有两种颜色,"dual" 意为“双重”的。 2. **数量相关的词素**:在生物学中,数量常用来描述细胞...

    高中生物《染色体变异》教案13新人教版必修2.pdf

    - **单倍体(haploid)**:含有一个染色体组的个体,通常由未受精的配子发育而来,如雄性蜜蜂。单倍体可能只有一个染色体组,也可能在某些情况下有多个染色体组,比如非同源染色体的加倍。 - **二倍体(diploid)*...

    生物专业英语词汇必备.doc

    例如,haploid(单倍体)指的是只有一套染色体的生物体,monoxide(一氧化碳)是由一个氧原子和另一个元素组成的化合物,而monoatomic(单原子的)则指由单一原子构成的物质。 2. bi, di, dipl, twi, du:这些词素...

    高中生物《染色体变异》教案13新人教版必修2-5页.pdf

    - **单倍体**(haploid):体细胞中含有单个染色体组的生物。在减数分裂中形成的配子,如精子和卵细胞,就是单倍体。 - **二倍体**(diploid):体细胞中含有两个染色体组的生物,如人类和其他大多数动植物。正常...

    《染色体变异》教学设计案例.pdf

    - **单倍体**(Haploid):只含有一组染色体的生物或细胞,如花粉粒和卵细胞。在受精过程中,单倍体的配子结合形成二倍体的合子。 5. **教学设计**:在教授染色体变异时,教师应利用动画演示、实际操作和案例分析...

    【创新设计】2021届高考生物一轮复习 2-1-3 减数分裂和受精作用限时训练.doc

    减数分裂过程中,染色体数目减少了一半,导致细胞中的基因组成为haploid(单倍体)。在减数分裂过程中,染色体是如何变化的?在减数第一次分裂后期,染色体数目减少了一半,每个染色体都被分离成两个 sister ...

    食科专业英语PPT课件.pptx

    - **Haplo-**、**Mono-**、**Uni-** 表示“单”,如单倍体(haploid)、一氧化碳(monoxide)、单原子的(monoatomic)。 - **Bi-**、**Di-**、**Dipl-**、**Twi-**、**Du-** 表示“二”或“双”,如双色(bicolor)...

    遗传育种相关名词中英文对照.docx

    19. **异源单倍体/allo haploid**:一个单倍体个体,其染色体来自两个不同的物种。 20. **异源多倍体/allo polyploid**:含有两个或更多不同物种染色体的多倍体。 这些名词只是遗传育种领域的一小部分,深入理解...

    生物学专业英语词汇.doc

    12. **遗传与进化**:涉及**单倍体**(haploid)、**二倍体**(diploid)和**多倍体**(polyploid),以及**同源器官**和**同功器官**的概念,它们分别表示基因组数目和功能相似但起源不同的器官。 这些词汇和概念...

Global site tag (gtag.js) - Google Analytics