Page 1 of 1
[Solved] Software recommendation: 7-zip with CJK support
Posted: Thu May 13, 2021 3:10 am
by pvc
Does anybody know about any 7-zip extractor capable of dealing with CJK character set for Linux? Preferably without adding extra locale. p7zip can theoretically do that but needs to be recompiled with extra configuration flags, which I don't want to deal with. Maybe there is some easier solution.
Re: Software recommendation: 7-zip decompressor with CJK sup
Posted: Thu May 13, 2021 7:03 am
by bzt
Nope. 7-zip SDK is Windows-only, it's port to other OS is mostly garbage (no wonder most of them are rewrites, and it took YEARS to compile a cli 7-zip under Linux). Also it's compression, lzma2 is badly implemented, not transmission-safe and
inefficient (that page is about xz, however 7z uses exactly the same lzma2 encoding). If I were you, I'd choose another archive format instead of 7z in the first place.
If the archive is from a 3rd party and you cannot download the material in non 7z form, then give a try to
command line 7zip under Linux, but it is known to be buggy (see sourceforge forums and tickets, there are more than 500 open and unresolved issues).
Not sure, but maybe
XZ Utils can extract 7z files, worth a try.
Cheers,
bzt
Re: Software recommendation: 7-zip decompressor with CJK sup
Posted: Thu May 13, 2021 11:05 am
by Octocontrabass
The 7z format always uses UTF-16 for file names, so anything not based on the original 7-zip Windows code should at least handle characters in the BMP.
A quick search turns up
py7zr. Does that one work for you?
Re: Software recommendation: 7-zip decompressor with CJK sup
Posted: Thu May 13, 2021 11:31 am
by pvc
@Octocontrabass
Thanks. py7zr works like a charm. p7zip that comes from Arch repos says `locale=C,Utf16=off` in its help message, so that was probably the problem.