Music – Hodge Irreversibility Depth

Byte-level Hodge decomposition of MIDI files treated as raw byte streams.

Data Sources

Real MIDI files from Mutopia Project (public domain):

Corpus Files Total bytes Unique bytes
Bach BWV 117a, Sheep May Safely Graze, BWV 230, B Minor Mass, CPE Bach Rondo 86,800 170
Debussy Arabesque 1 & 2, Clair de Lune, Prelude L117-4 53,014 167
Joplin (ragtime) Elite Syncopations, Pineapple Rag, Something Doing, The Entertainer, Bethena, Magnetic Rag 123,309 140

Results

Stream D=1 D=2 D=3 D=4 D=5
English (5MB enwik8) 0.957 0.761 0.588 0.524 0.492
Bach (87KB MIDI) 0.965 0.795 0.758 0.723 0.701
Debussy (53KB MIDI) 0.970 0.740 0.714 0.580 0.570
Joplin ragtime (123KB MIDI) 0.966 0.809 0.767 0.676 0.659
Random bytes (5KB) 1.000 0.984 0.984

Data sizes capped per depth for tractability: D=1: 100KB, D=2: 80KB, D=3: 40KB, D=4: 25KB, D=5: 18KB.

Observations

MIDI is more irreversible than natural language at every depth.

All three composers show f(D) curves that sit above English across D=2..5. The harmonic (irreversible) fraction decays more slowly in MIDI than in text:

Stream f(1)-f(3) f(3)-f(5) f(5) floor
English 0.369 0.096 0.492
Bach 0.207 0.057 0.701
Debussy 0.256 0.144 0.570
Joplin 0.199 0.108 0.659

Key findings:

  1. Bach has the highest asymptotic irreversibility (f(5) = 0.70). The byte stream retains the most harmonic energy at depth 5. This likely reflects the rigid contrapuntal structure of the MIDI encoding – Bach’s music has highly constrained voice leading that creates strong non-gradient cycles in the Markov field even at long context.

  2. Debussy has the steepest decay and lowest floor (f(5) = 0.57). Among the three composers, Debussy is closest to natural language in its depth profile. The impressionistic style with whole-tone scales and less rigid structure may create more “reversible” (gradient-like) byte patterns.

  3. Joplin sits between Bach and Debussy (f(5) = 0.66). Ragtime’s syncopated but repetitive structure produces moderate irreversibility.

  4. All music >> English at depth. English drops to f(5) = 0.49 while all MIDI streams stay above 0.57. This gap persists even though MIDI files use fewer unique byte values (140-170) than English text (which uses most of the ASCII range).

  5. MIDI format contributes structure. The MIDI binary format itself (headers, delta-time encoding, status bytes) imposes byte-level patterns that are inherently non-reversible. The difference between composers (0.57-0.70) measures musical structure on top of the format baseline.

Interpretation

Natural language becomes increasingly “gradient” (reversible) at depth because long-range semantic context makes predictions symmetric – knowing the future helps predict the past about as much as vice versa. Music as MIDI byte streams resists this: the temporal arrow of musical structure (voice leading, harmonic progression, rhythmic patterns) creates persistent cycles in the Markov transition field that cannot be decomposed into a potential function, even at depth 5.

The ordering Bach > Joplin > Debussy for irreversibility depth correlates inversely with harmonic ambiguity – Bach’s strict counterpoint is the most directional, Debussy’s impressionism the least.