Nature Communications (Accepted) · 2025

CiFi: Accurate long-read chromosome conformation capture with low-input requirements

Sean P. McGinty* (University of California, Davis)
Gulhan Kaya* (University of California, Davis)
Sheina B. Sim (USDA Agricultural Research Service)
Alex Makunin (Wellcome Sanger Institute)
Renée L. Corpuz (USDA Agricultural Research Service)
Michael A. Quail (Wellcome Sanger Institute)
Mohamed Abuelanin (University of California, Davis)
Mara K. N. Lawniczak (Wellcome Sanger Institute)
Scott M. Geib (USDA Agricultural Research Service)
Jonas Korlach (Pacific Biosciences)
Megan Y. Dennis (University of California, Davis)

Abstract

Hi-C characterizes three-dimensional chromatin organization, facilitates haplotype phasing, and enables genome-assembly scaffolding, but encounters difficulties across complex regions. By coupling chromosome conformation capture (3C) with PacBio HiFi long-read sequencing, here we develop a method (CiFi) that enables analysis of genomic interactions across repetitive regions. Starting with as little as 60,000 cells (sub-microgram DNA), the method produces multi-kilobasepair HiFi reads that contain multiple interacting, concatenated segments (~350 bp to 2 kbp). This multiplicity and increase in segment length versus standard short-read-based Hi-C improves read-mapping efficiency and coverage in repetitive regions and enhances haplotype phasing. CiFi pairwise interactions are largely concordant with Hi-C from a human lymphoblastoid cell line, with gains in assigning topologically associating domains across centromeres, segmental duplications, and human disease-associated genomic hotspots. As CiFi requires less input versus established methods, we apply the approach to characterize single small insects: assaying chromatin interactions across the genome from an *Anopheles coluzzii* mosquito and producing a chromosome-scale scaffolded assembly from a *Ceratitis capitata* Mediterranean fruit fly. Together, CiFi enables assessment of chromosome-scale interactions of previously recalcitrant low-complexity loci, low-input samples and small organisms.

Key Contributions

Low-Input Requirements

Produces high-quality chromatin interaction data starting with as little as 60,000 cells (sub-microgram DNA), enabling single-organism analysis.

Long-Read Technology

Combines 3C with PacBio HiFi sequencing to generate multi-kilobasepair reads with multiple interacting segments, improving mapping across repetitive regions.

Enhanced Resolution

Improves assessment of topologically associating domains across centromeres, segmental duplications, and disease-associated genomic hotspots.

Visuals & Media

PacBio Webinar: Small Samples, Big Insights - Capturing Chromatin Architecture with CiFi