Skip to content

zip file encoding from SOEPindex shared via Owncloud not properly read by default on linux machine

Issue:

image

  • I am trying to unzip the shared SOEPindex zip-file found on owncloud and, in my linux machine, the non-ASCII characters are probably encoded in a different encoding. This has the effect of breaking the HTML links within the page.
  • I am using Ubuntu-based Pop!OS distribution.

Fix:

After trying a few usual encodings, IBM437 seems to work out. (The following command inflates the zipped file in the same directory that it is located.)

unzip -O IBM437 ./soepindex.zip -d ./

I hope this helps should anyone face the same issue. Since the issue is already solved on my part, feel free to close it, should it not affect any other users.

Edited by Marcelo Avila