Hi-C chromatin 3D organization analysis in pure NumPy/SciPy — TAD detection, A/B compartments, loop calling, and differential analysis. No cooler, no Juicer, no HiCExplorer.
Five analysis modules, each implemented from first principles in pure NumPy/SciPy.
Iterative Correction and Eigenvector decomposition — removes GC content, mappability, and restriction site biases. Converges in ~20 iterations.
Insulation score (Crane et al. 2015) + Directionality Index (Dixon et al. 2012). TAD boundaries = local minima of insulation score.
PC1 of O/E contact matrix via sklearn PCA. GC content-guided eigenvector orientation. Saddle plot for compartment interactions.
HICCUPS-inspired: distance-decay background, donut neighborhood enrichment, Poisson p-value peak calling. Detects focal contact enrichments.
Compare two Hi-C matrices (WT vs. KO, cell type A vs. B). Gained/lost boundaries, permutation significance testing, differential contact scores.
6-panel Plotly visualization: Hi-C heatmap with TAD/loop overlays, insulation track, compartment track, distance-decay, saddle plot, summary table.
Demo run on 200 synthetic bins with 6 TADs, 4 loops, and A/B compartment structure. All algorithms from first principles — no external Hi-C libraries.
From raw Hi-C matrix to 3D genome organization in five stages.
Python 3.9+. NumPy + SciPy core. Optional: cooler for .cool file I/O.