ShanghaiTech University Knowledge Management System
Chromosome-Level Haplotype Phasing by Integrating HiFi with Hi-C | |
2025 | |
会议录名称 | INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY |
发表状态 | 待投递 |
摘要 | Haplotype-resolved genomes play a pivotal role in understanding evolutionary histories, genetic variations, and complex traits. While trio binning is an effective technique for haplotype phasing when parental information is available, its practical application is often hindered by the absence of parental data. We propose a novel approach DipHiphaser that integrates PacBio HiFi reads with Hi-C chromatin interaction data to resolve chromosome-level haplotypes without parental data. The core idea of our method is to leverage SNP-mer pairs, defined as pairs of k-mers each with a single nucleotide polymorphism (SNP) located exactly at the middle point, to identify haplotypes. These SNP-mer pairs are initially derived from HiFi reads and largely represent haplotype information as they are mostly mapped to heterozygous regions of the genome. Subsequently, we connect these SNP-mer pairs based on overlaps in HiFi data and further phase and assemble them into chromosome-haplotype blocks using Hi-C data. When applied to human genomes and a bird, DipHiphaser achieves an impressive phasing accuracy of HiFi reads exceeding 97% at both the haplotype and chromosome levels, outperforming trio binning. These phased reads can be independently assembled to achieve high contig phasing accuracy, closely approaching the top-performing haplotype-resolved assembly pipelines. DipHiphaser provides an accurate way of phasing reads at chromosome-level which leads to haplotype-resolved diploid genome assembly without the requirement of parental data and has great potential for studying haplotype variation and inheritance for diploid species. |
关键词 | HiFi HI-C SNP-mer Haplotype phasing Genome assembly |
收录类别 | EI |
语种 | 英语 |
文献类型 | 会议论文 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/411575 |
专题 | 信息科学与技术学院_硕士生 信息科学与技术学院_PI研究组_郑杰组 免疫化学研究所_特聘教授组_Erez Lieberman Aiden组 |
共同第一作者 | Yang ZZ(杨珍珍) |
通讯作者 | Zheng J(郑杰); Lieberman-Aiden, Erez |
作者单位 | 1.上海科技大学 2.贝勒医学院 3.莱斯大学 |
第一作者单位 | 上海科技大学 |
通讯作者单位 | 上海科技大学 |
第一作者的第一单位 | 上海科技大学 |
推荐引用方式 GB/T 7714 | Sun ZG,Yang ZZ,Dudchenko, Olga,et al. Chromosome-Level Haplotype Phasing by Integrating HiFi with Hi-C[C],2025. |
条目包含的文件 | ||||||
条目无相关文件。 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。