Entropic Isolation · Open Research Compression Research · AIM Framework
Entisol
Tools
Interactive Tool · Browser-Based · No Install
AIM Web
Analyzer
Structural Analysis & Decay Profiling — v12

Run AIM's entropy analysis directly in the browser. Drop any file to generate a full decay profile, bit-plane distribution fingerprint, and codec recommendation report. The same structural logic that powers the C implementation — no server, no upload.

Launch Analyzer →
Decay Profile
Recursive bit-plane decomposition depth chart. Visualizes how structure collapses across layers — the core AIM fingerprint.
Bit Distribution
Per-bit-position flag density at L0. Distinguishes natural language from random noise even when decay depth is identical.
Codec Race
All seven flag codecs compete on your data. Reports winning codec, encoded size, and entropy delta per depth.
Data Classification
Automatic classification: PCM audio, YUV video, text, binary, or random. Reports estimated AIM benefit vs gzip.
01 Entropic Isolation
"The goal is not to beat gzip.
The goal is to know what you are compressing."

Byte-level sliding window compressors treat every byte the same. But the high byte and low byte of a PCM sample have completely different entropy profiles. Consecutive YUV pixels repeat with period 4. Consecutive struct fields align at their width. These structures are invisible to LZ77.

AIM decomposes data losslessly into structured and entropic components — isolating the compressible from the incompressible. The result is a decay profile: a mathematical fingerprint of the data's internal structure. It correctly identifies when not to apply advanced techniques — returning near-parity on random data and accepting that gzip wins on natural language text.

02 Algorithm
01
Bit Sweep

Sweep all 8 bit positions across the full stream. Find the sparsest bit plane.

02
Flag Separation

Extract the plane as a flag set. Seven codecs race — shortest encoding wins.

03
Remap + Recurse

Clear the bit, remap to a halved symbol alphabet, recurse on the aligned stream.

04
Optimal Halt

Halt conditions fire when continuing costs more than stopping. At depth 8, halt is proven — not parameterized.

GAP
First pos + varbyte gaps. Sparse sets (<5%).
BITSET
Dense bit vector. Always applicable.
ELIAS-FANO
Monotone sequences. Clustered gaps.
RLE
Run-length on bitset. Alternating runs.
HUFFMAN
Canonical Huffman on flag bytes.
LZ77
Sliding window. Positional repetition.
DEFLATE
LZ77 + Huffman. Large sets, mixed.
03 Key Findings
F1
Structural Collapse Under Recursion

Mathematical sequences with strong arithmetic regularity halt in 2–4 layers. Prime Gaps halt at depth 2 with a flag ratio of 0.001. Fibonacci mod 256 halts at depth 4 with a flag ratio of exactly 2/3 — a deterministic arithmetic property, not an approximation.

Prime Gaps: depth 2 · 0.1% flags
F2
Two-Dimensional Fingerprinting

Decay depth alone is insufficient to distinguish data types. Natural Language and Random Noise both run 13 layers with nearly identical flag ratios — but their bit distributions at L0 are completely different. The fingerprint requires both decay profile and bit distribution.

Text b5 = 1.0 · Noise b5 ≈ 0.5
F4
Fewest Flags ≠ Lowest Entropy

The sparsest bit plane and the optimal entropy target can differ. For Prime Gaps: the sweep picks bit 6 (zero flags, zero gain). The entropy winner is bit 4 — 190 flags, but a genuine −6.53% net entropy reduction because clearing it concentrates the value distribution.

Bit 6: 0 flags, 0% gain · Bit 4: 190 flags, −6.53%
F5
Genuine Total Entropy Reduction

First confirmed case of genuine entropy reduction — not relocation. The condition: the target bit must be set for a structurally distinct, sparse minority whose positions are cheap to encode and whose removal genuinely narrows the value distribution.

Prime Gaps bit 4: −6.53% total entropy
04 Results
FileRawOutputvs gzip
PCM Audio (38.7 MB WAV) 38.7 MB28.7 MB −25.90%
Uncompressed Video (527 MB YUV) 527 MB194.7 MB −63.07%
Synthetic YUV420 (5 MB) 5.0 MB2.73 MB −41.27%
Synthetic PCM (5 MB) 5.0 MB3.63 MB −22.70%
Source Code 48 KB +19% — gzip wins on text
Random Data 100 KB +0.14% — near-parity

AIM is not a universal improvement over gzip. It targets data with bit-plane regularity or inter-symbol periodicity: PCM audio, YUV video, scientific measurements, packed binary formats. For text, gzip wins and AIM correctly identifies this.

05 Development Arc
Sessions 1–5
Foundation

Initial Python implementation. REAL transform. Bit-plane sweep. First flag codec race.

Sessions 6–9
Recursive Architecture

Full recursive decomposition. Multiple codec backends. Wire format v4–v8. Roundtrip verification.

Sessions 10–13
Entropy Analysis

Structural fingerprinting. Decay profiles. Confirmed findings F1–F7. AIM Web Analyzer tool.

Sessions 14–16
C Port

Full C implementation (aim3). 29× faster than Python. Memory optimization. Large file support.

Sessions 17–21
Optimization

rANS backends. ANS order-0/1/2d. Stride-k selection. Memory-bounded streaming. Chunk analyzer.

Sessions 22–24
HALT_ANS_STRIDE

v15. Optimal early cutoff. −25.90% vs gzip on PCM audio. −63.07% on uncompressed video.