InterPals Parsed?
by Sneed - Wednesday September 20, 2023 at 08:23 PM
#1
Hello, I'm wondering if anyone has tried parsing this absolute wreck of a file? Whoever dumped this really had no idea what they were doing; everything is misaligned and the text encoding seems to be a tug-of-war between ANSI and UTF-8. Normally it is quite straightforward but I have given up on this one for now, too lazy to write regex for it.

Example of what I mean:
[Image: v9egn7.png]
Formerly @‍God, but that username was stolen from me.
Reply
#2
normally i can do it, but let me take a look on the data and if i have free time.
i'll post it when i done
Reply
#3
(09-24-2023, 08:14 AM)All3in Wrote: normally i can do it, but let me take a look on the data and if i have free time.
i'll post it when i done

Thank you, much appreciated!
Formerly @‍God, but that username was stolen from me.
Reply
#4
I'm using the latest version of EmEditor and have absolutely no problems with the encoding,
it is detected completely correctly as UTF-8 without a signature, why can't you do the same?

https://breachforums.is/Thread-InterPals...d-Download
Reply
#5
(09-24-2023, 08:24 PM)Blastoise Wrote: I'm using the latest version of EmEditor and have absolutely no problems with the encoding,
it is detected completely correctly as UTF-8 without a signature, why can't you do the same?

https://breachforums.is/Thread-InterPals...d-Download

Thanks but unfortunately the official version is the messy version which I have.

Some of the names are correctly UTF-8, others are UTF-8 encoded as CP-1252 and then decoded incorrectly as UTF-8. The inconsistency causes some names to show up as mojibake, while others are fine. Normally this wouldn't matter too much but for a site based around an international audience it's very messy.

Besides this, the columns are not at synced as shown in my screenshot, I am trying to load large databases into ElasticSearch automatically so this is important.
Formerly @‍God, but that username was stolen from me.
Reply




 Users browsing this thread: 1 Guest(s)