Chromosome-Level Haplotype Phasing by Integrating HiFi with Hi-C
2025
会议录名称INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY
发表状态待投递
摘要

Haplotype-resolved genomes play a pivotal role in understanding evolutionary histories, genetic variations, and complex traits. While trio binning is an effective technique for haplotype phasing when parental information is available, its practical application is often hindered by the absence of parental data.

We propose a novel approach DipHiphaser that integrates PacBio HiFi reads with Hi-C chromatin interaction data to resolve chromosome-level haplotypes without parental data. The core idea of our method is to leverage SNP-mer pairs, defined as pairs of k-mers each with a single nucleotide polymorphism (SNP) located exactly at the middle point, to identify haplotypes. These SNP-mer pairs are initially derived from HiFi reads and largely represent haplotype information as they are mostly mapped to heterozygous regions of the genome. Subsequently, we connect these SNP-mer pairs based on overlaps in HiFi data and further phase and assemble them into chromosome-haplotype blocks using Hi-C data. When applied to human genomes and a bird, DipHiphaser achieves an impressive phasing accuracy of HiFi reads exceeding 97% at both the haplotype and chromosome levels, outperforming trio binning. These phased reads can be independently assembled to achieve high contig phasing accuracy, closely approaching the top-performing haplotype-resolved assembly pipelines. DipHiphaser provides an accurate way of phasing reads at chromosome-level which leads to haplotype-resolved diploid genome assembly without the requirement of parental data and has great potential for studying haplotype variation and inheritance for diploid species.

关键词HiFi HI-C SNP-mer Haplotype phasing Genome assembly
收录类别EI
语种英语
文献类型会议论文
条目标识符https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/411575
专题信息科学与技术学院_硕士生
信息科学与技术学院_PI研究组_郑杰组
免疫化学研究所_特聘教授组_Erez Lieberman Aiden组
共同第一作者Yang ZZ(杨珍珍)
通讯作者Zheng J(郑杰); Lieberman-Aiden, Erez
作者单位
1.上海科技大学
2.贝勒医学院
3.莱斯大学
第一作者单位上海科技大学
通讯作者单位上海科技大学
第一作者的第一单位上海科技大学
推荐引用方式
GB/T 7714
Sun ZG,Yang ZZ,Dudchenko, Olga,et al. Chromosome-Level Haplotype Phasing by Integrating HiFi with Hi-C[C],2025.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Sun ZG(孙志刚)]的文章
[Yang ZZ(杨珍珍)]的文章
[Dudchenko, Olga]的文章
百度学术
百度学术中相似的文章
[Sun ZG(孙志刚)]的文章
[Yang ZZ(杨珍珍)]的文章
[Dudchenko, Olga]的文章
必应学术
必应学术中相似的文章
[Sun ZG(孙志刚)]的文章
[Yang ZZ(杨珍珍)]的文章
[Dudchenko, Olga]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。