Best Plain Text Compression Tool
Up to 0% smaller than LZMA. 0 wins, 0 losses across real-world Plain Text files. Lossless, verified, free to try.
How PZIP Compresses Plain Text
PZIP uses a high-order character-level prediction model that learns the statistical patterns of natural language as it reads. Per-character probability estimation outperforms LZMA's sliding window on prose, achieving up to 27.6% smaller output on Project Gutenberg texts.
Codec: Predictive modelingHistory of Plain Text
Plain text is the oldest digital data format, dating to ASCII (1963) and predating it in telegraph codes. Despite decades of compression research, new techniques continue to find savings. Predictive models have been the gold standard for text compression since the 1980s.
Compression Timeline
ASCII standard published
Lempel-Ziv (LZ77) algorithm — basis of gzip
PPM (Prediction by Partial Matching) algorithm
bzip2 released (BWT-based)
LZMA algorithm by Igor Pavlov
PZIP's predictive model beats LZMA by up to 27.6% on natural text
Real-World Benchmark Results
Every file tested with LZMA-9 (maximum compression) as baseline. Round-trip correctness verified on every file.
| File | Size | PZIP vs LZMA | Result | Download |
|---|---|---|---|---|
| .renames.txt | 65.5 KB | TIE | ERROR | Source |
| README_P9.txt | 21.9 KB | TIE | ERROR | Source |
| adventures_huck_finn_alt.txt | 1.1 MB | TIE | ERROR | Source |
| alice_in_wonderland.txt | 170.3 KB | TIE | ERROR | Source |
| all_requirements.txt | 42.1 KB | TIE | ERROR | Source |
| anna_karenina.txt | 2.0 MB | TIE | ERROR | Source |
| anthem.txt | 127.6 KB | TIE | ERROR | Source |
| avx512fp16.txt | 91.6 KB | TIE | ERROR | Source |
| bible_kjv.txt | 4.4 MB | TIE | ERROR | Source |
| brothers_karamazov.txt | 1.9 MB | TIE | ERROR | Source |
| call_of_the_wild.txt | 195.1 KB | TIE | ERROR | Source |
| christmas_carol.txt | 178.0 KB | TIE | ERROR | Source |
| communist_manifesto.txt | 2.7 MB | TIE | ERROR | Source |
| confessions_augustine.txt | 617.4 KB | TIE | ERROR | Source |
| count_of_monte_cristo.txt | 2.7 MB | TIE | ERROR | Source |
| crime_and_punishment.txt | 1.1 MB | TIE | ERROR | Source |
| critique_pure_reason.txt | 104.3 KB | TIE | ERROR | Source |
| democracy_in_america_v1.txt | 1.1 MB | TIE | ERROR | Source |
| democracy_in_america_v2.txt | 846.0 KB | TIE | ERROR | Source |
| divine_comedy.txt | 143.9 KB | TIE | ERROR | Source |
| don_quixote.txt | 2.3 MB | TIE | ERROR | Source |
| dorian_gray.txt | 454.9 KB | TIE | ERROR | Source |
| dracula.txt | 869.5 KB | TIE | ERROR | Source |
| emma.txt | 911.9 KB | TIE | ERROR | Source |
| frankenstein.txt | 438.4 KB | TIE | ERROR | Source |
| gfx10_mimg.txt | 52.8 KB | TIE | ERROR | Source |
| gfx10_vop2.txt | 85.2 KB | TIE | ERROR | Source |
| gfx10_vopcx.txt | 156.0 KB | TIE | ERROR | Source |
| gfx12_dasm_vop3_from_vop2_dpp8.txt | 33.7 KB | TIE | ERROR | Source |
| gfx12_dasm_vopcx_dpp8.txt | 45.4 KB | TIE | ERROR | Source |
| gfx12_dasm_vopd.txt | 686.2 KB | TIE | ERROR | Source |
| gfx8_vop2_dpp.txt | 222.8 KB | TIE | ERROR | Source |
| gfx8_vop3cx.txt | 628.4 KB | TIE | ERROR | Source |
| gfx9_vop2_sdwa.txt | 373.1 KB | TIE | ERROR | Source |
| great_expectations.txt | 1.0 MB | TIE | ERROR | Source |
| great_gatsby_precursor.txt | 631.2 KB | TIE | ERROR | Source |
| grimms_fairy_tales.txt | 547.1 KB | TIE | ERROR | Source |
| gullivers_travels.txt | 399.4 KB | TIE | ERROR | Source |
| heart_of_darkness.txt | 231.6 KB | TIE | ERROR | Source |
| huckleberry_finn.txt | 607.9 KB | TIE | ERROR | Source |
| idiot_dostoevsky.txt | 1.2 MB | TIE | ERROR | Source |
| importance_of_being_earnest.txt | 138.6 KB | TIE | ERROR | Source |
| jane_eyre.txt | 1.0 MB | TIE | ERROR | Source |
| jekyll_and_hyde.txt | 159.7 KB | TIE | ERROR | Source |
| jungle_book.txt | 296.6 KB | TIE | ERROR | Source |
| legal_contract.txt | 71.5 KB | TIE | ERROR | Source |
| les_miserables.txt | 165.2 KB | TIE | ERROR | Source |
| leviathan_hobbes.txt | 3.0 MB | TIE | ERROR | Source |
| little_women.txt | 1.0 MB | TIE | ERROR | Source |
| lorem_ipsum.txt | 87.3 KB | TIE | ERROR | Source |
| meditations_aurelius.txt | 415.4 KB | TIE | ERROR | Source |
| metamorphosis.txt | 138.7 KB | TIE | ERROR | Source |
| moby_dick.txt | 1.2 MB | TIE | ERROR | Source |
| modest_proposal.txt | 38.8 KB | TIE | ERROR | Source |
| my_antonia.txt | 1.2 MB | TIE | ERROR | Source |
| neon-instructions.txt | 85.8 KB | TIE | ERROR | Source |
| notes_from_underground.txt | 260.8 KB | TIE | ERROR | Source |
| odyssey_butler.txt | 728.9 KB | TIE | ERROR | Source |
| odyssey_homer.txt | 701.0 KB | TIE | ERROR | Source |
| oliver_twist.txt | 933.5 KB | TIE | ERROR | Source |
| origin_of_species.txt | 1.0 MB | TIE | ERROR | Source |
| paradise_lost.txt | 971.2 KB | TIE | ERROR | Source |
| peter_pan.txt | 282.0 KB | TIE | ERROR | Source |
| portrait_of_the_artist.txt | 484.7 KB | TIE | ERROR | Source |
| pride_and_prejudice.txt | 754.3 KB | TIE | ERROR | Source |
| psychology_william_james.txt | 210.1 KB | TIE | ERROR | Source |
| rfc1035.txt | 119.7 KB | TIE | ERROR | Source |
| rfc1951.txt | 36.1 KB | TIE | ERROR | Source |
| rfc2045.txt | 71.2 KB | TIE | ERROR | Source |
| rfc2046.txt | 103.4 KB | TIE | ERROR | Source |
| rfc2047.txt | 32.5 KB | TIE | ERROR | Source |
| rfc2048.txt | 44.0 KB | TIE | ERROR | Source |
| rfc2049.txt | 50.0 KB | TIE | ERROR | Source |
| rfc2068.txt | 369.3 KB | TIE | ERROR | Source |
| rfc2131.txt | 111.1 KB | TIE | ERROR | Source |
| rfc2396.txt | 82.5 KB | TIE | ERROR | Source |
| rfc2616.txt | 412.4 KB | TIE | ERROR | Source |
| rfc2822.txt | 108.1 KB | TIE | ERROR | Source |
| rfc3023.txt | 84.0 KB | TIE | ERROR | Source |
| rfc3261.txt | 632.8 KB | TIE | ERROR | Source |
| rfc3501.txt | 222.3 KB | TIE | ERROR | Source |
| rfc3550.txt | 253.9 KB | TIE | ERROR | Source |
| rfc3629.txt | 33.1 KB | TIE | ERROR | Source |
| rfc3748.txt | 154.3 KB | TIE | ERROR | Source |
| rfc3986.txt | 138.5 KB | TIE | ERROR | Source |
| rfc4251.txt | 70.1 KB | TIE | ERROR | Source |
| rfc4252.txt | 33.5 KB | TIE | ERROR | Source |
| rfc4253.txt | 66.7 KB | TIE | ERROR | Source |
| rfc4254.txt | 49.2 KB | TIE | ERROR | Source |
| rfc4271.txt | 217.5 KB | TIE | ERROR | Source |
| rfc4566.txt | 106.3 KB | TIE | ERROR | Source |
| rfc4648.txt | 34.7 KB | TIE | ERROR | Source |
| rfc4880.txt | 198.9 KB | TIE | ERROR | Source |
| rfc4918.txt | 269.9 KB | TIE | ERROR | Source |
| rfc5246.txt | 217.2 KB | TIE | ERROR | Source |
| rfc5280.txt | 344.3 KB | TIE | ERROR | Source |
| rfc5321.txt | 220.6 KB | TIE | ERROR | Source |
| rfc5322.txt | 119.5 KB | TIE | ERROR | Source |
| rfc5545.txt | 337.4 KB | TIE | ERROR | Source |
| rfc5789.txt | 21.2 KB | TIE | ERROR | Source |
| rfc5849.txt | 78.9 KB | TIE | ERROR | Source |
| rfc6125.txt | 133.3 KB | TIE | ERROR | Source |
| rfc6202.txt | 43.7 KB | TIE | ERROR | Source |
| rfc6241.txt | 204.6 KB | TIE | ERROR | Source |
| rfc6455.txt | 158.3 KB | TIE | ERROR | Source |
| rfc6570.txt | 77.9 KB | TIE | ERROR | Source |
| rfc6749.txt | 159.7 KB | TIE | ERROR | Source |
| rfc6838.txt | 71.2 KB | TIE | ERROR | Source |
| rfc7049.txt | 130.9 KB | TIE | ERROR | Source |
| rfc7159.txt | 26.8 KB | TIE | ERROR | Source |
| rfc7230.txt | 201.1 KB | TIE | ERROR | Source |
| rfc7231.txt | 229.5 KB | TIE | ERROR | Source |
| rfc7232.txt | 55.4 KB | TIE | ERROR | Source |
| rfc7233.txt | 45.8 KB | TIE | ERROR | Source |
| rfc7234.txt | 88.5 KB | TIE | ERROR | Source |
| rfc7235.txt | 37.2 KB | TIE | ERROR | Source |
| rfc7489.txt | 158.9 KB | TIE | ERROR | Source |
| rfc7519.txt | 61.6 KB | TIE | ERROR | Source |
| rfc7540.txt | 204.7 KB | TIE | ERROR | Source |
| rfc7578.txt | 29.5 KB | TIE | ERROR | Source |
| rfc791.txt | 92.7 KB | TIE | ERROR | Source |
| rfc793.txt | 168.7 KB | TIE | ERROR | Source |
| rfc7950.txt | 383.9 KB | TIE | ERROR | Source |
| rfc8032.txt | 100.8 KB | TIE | ERROR | Source |
| rfc8259.txt | 27.7 KB | TIE | ERROR | Source |
| rfc8439.txt | 86.8 KB | TIE | ERROR | Source |
| rfc8446.txt | 329.8 KB | TIE | ERROR | Source |
| rfc8555.txt | 192.3 KB | TIE | ERROR | Source |
| rfc8615.txt | 26.2 KB | TIE | ERROR | Source |
| rfc9000.txt | 394.0 KB | TIE | ERROR | Source |
| rfc9110.txt | 491.2 KB | TIE | ERROR | Source |
| rfc9113.txt | 187.3 KB | TIE | ERROR | Source |
| rfc9114.txt | 151.6 KB | TIE | ERROR | Source |
| robinson_crusoe.txt | 638.7 KB | TIE | ERROR | Source |
| scarlet_letter.txt | 514.0 KB | TIE | ERROR | Source |
| sense_and_sensibility.txt | 696.2 KB | TIE | ERROR | Source |
| shakespeare_complete.txt | 5.4 MB | TIE | ERROR | Source |
| sherlock_holmes.txt | 593.4 KB | TIE | ERROR | Source |
| siddhartha.txt | 331.1 KB | TIE | ERROR | Source |
| sincosf.txt | 415.5 KB | TIE | ERROR | Source |
| souls_of_black_folk.txt | 418.7 KB | TIE | ERROR | Source |
| study_in_scarlet.txt | 262.5 KB | TIE | ERROR | Source |
| tale_of_two_cities.txt | 788.3 KB | TIE | ERROR | Source |
| the_aeneid.txt | 641.4 KB | TIE | ERROR | Source |
| the_iliad.txt | 1.1 MB | TIE | ERROR | Source |
| the_prince.txt | 300.8 KB | TIE | ERROR | Source |
| the_republic.txt | 1.2 MB | TIE | ERROR | Source |
| thus_spake_zarathustra.txt | 667.0 KB | TIE | ERROR | Source |
| time_machine.txt | 199.6 KB | TIE | ERROR | Source |
| tom_sawyer.txt | 424.2 KB | TIE | ERROR | Source |
| treasure_island.txt | 390.5 KB | TIE | ERROR | Source |
| turn_of_the_screw.txt | 253.2 KB | TIE | ERROR | Source |
| ulysses.txt | 1.5 MB | TIE | ERROR | Source |
| uncle_toms_cabin.txt | 1.0 MB | TIE | ERROR | Source |
| walden.txt | 213.4 KB | TIE | ERROR | Source |
| war_and_peace.txt | 3.2 MB | TIE | ERROR | Source |
| war_of_the_worlds.txt | 358.1 KB | TIE | ERROR | Source |
| white_fang.txt | 418.9 KB | TIE | ERROR | Source |
| wizard_of_oz.txt | 231.7 KB | TIE | ERROR | Source |
| wuthering_heights.txt | 677.6 KB | TIE | ERROR | Source |
| yellow_wallpaper.txt | 51.4 KB | TIE | ERROR | Source |
Frequently Asked Questions
Why does PZIP beat LZMA on text?
+
LZMA uses a sliding window to find repeated byte sequences. PZIP builds a character-level probability model — essentially learning the language as it compresses. For natural language prose, this statistical approach is more powerful.
What text files work best with PZIP?
+
Longer prose works see the best savings (14-28%). Short texts (<10KB) may not benefit as the model needs data to learn from.
PZIP vs Other Compressors for Plain Text
| Feature | PZIP | LZMA / xz | gzip | zstd |
|---|---|---|---|---|
| Type-Aware | Yes | No | No | No |
| Lossless | Yes | Yes | Yes | Yes |
| Never-Worse Guarantee | Yes | N/A | N/A | N/A |
| Best Plain Text Saving | 0% | Baseline | Worse | ~Similar |
| Round-Trip Verified | Every file | Manual | Manual | Manual |
Try PZIP on Your Plain Text Files
Upload any Plain Text file up to 30 MB. Free during beta — no signup required. See how much smaller PZIP makes it.
Baseline: LZMA-9 (maximum compression)
Competitors: gzip-9, bz2-9, brotli-11, zstd-19, PPMd 2-24
Verification: Byte-exact round-trip on every file
Guarantee: Never-worse (PZIP <= LZMA, always)
Test files: 161 real-world Plain Text files
Updated: 2026-02-15