Could you compress text files by mapping a word to how commonly it is used and translating it with an application?

Corroded · 1 year ago

Could you compress text files by mapping a word to how commonly it is used and translating it with an application?

nychtelios@rlyeh.icu · 1 year ago

You cannot represent everything using english text, and text in known languages can be extremely compressed just because we know details about its structure. (And anyway, it cannot be compressed that extremely, information theory explains this very well, computational power isn’t the only limit here).

If you cannot represent everything using valid english text, you cannot compress at high rates without losing information. A big part of digital data is actually noise, and noise cannot be compressed by definition.