I think the title should explain it all.
(Like a gigabyte size, a file size.)
(Like a gigabyte size, a file size.)
Last edited by choolisfound,
I updated the description to clarify what I meant.Well it has 622586 members (although I think only 1/10 of them is active nowadays)


Why am I on the main page?View attachment 476369
This should be most of the site. (taken from the main page.)
It's reasonably small, though this is all compiled. I'd imagine the source HTML/CSS/JS files are WAY larger.
Why am I on the main page?
^!!(taken from the main page.)



You do realize that webcrawlers can only access what is publicly facing in a website, and that usually only constitutes about 0.00001% of a large website like this one, right? I can name one particular type of file that is almost never publicly visible to webcrawlers: SQL databases, like the ones that forums use to store every user's entire profiles (including post history, times and dates online (basically an entire activity log) and every single status on the profile as well as all edits done to said profile since it was created), all forum posts in every thread (including non-public threads and the server chat log backups), as well as extremely sensitive information like usernames and passwords, auth info for any connected accounts (like Facebook or Google), and user last used IP addresses. These files make up essentially 99.9% of a website like this one's bulk.^!!
Your profile picture must've been visible somewhere on the site when I collected the files.

Which is why I am confused my pfp is in the dlYou do realize that webcrawlers can only access what is publicly facing in a website, and that usually only constitutes about 0.00001% of a large website like this one, right? I can name one particular type of file that is almost never publicly visible to webcrawlers: SQL databases, like the ones that forums use to store every user's entire profiles (including post history, times and dates online (basically an entire activity log) and every single status on the profile as well as all edits done to said profile since it was created), all forum posts in every thread (including non-public threads and the server chat log backups), as well as extremely sensitive information like usernames and passwords, auth info for any connected accounts (like Facebook or Google), and user last used IP addresses. These files make up essentially 99.9% of a website like this one's bulk.
tl;dr
The actual size is tens of millions of times bigger than what you 'extracted'.
The SQL database is only a small part of it (it's just text, after all)You do realize that webcrawlers can only access what is publicly facing in a website, and that usually only constitutes about 0.00001% of a large website like this one, right? I can name one particular type of file that is almost never publicly visible to webcrawlers: SQL databases, like the ones that forums use to store every user's entire profiles (including post history, times and dates online (basically an entire activity log) and every single status on the profile as well as all edits done to said profile since it was created), all forum posts in every thread (including non-public threads and the server chat log backups), as well as extremely sensitive information like usernames and passwords, auth info for any connected accounts (like Facebook or Google), and user last used IP addresses. These files make up essentially 99.9% of a website like this one's bulk.
tl;dr
The actual size is tens of millions of times bigger than what you 'extracted'.
You do realize that webcrawlers can only access what is publicly facing in a website, and that usually only constitutes about 0.00001% of a large website like this one, right? I can name one particular type of file that is almost never publicly visible to webcrawlers: SQL databases, like the ones that forums use to store every user's entire profiles (including post history, times and dates online (basically an entire activity log) and every single status on the profile as well as all edits done to said profile since it was created), all forum posts in every thread (including non-public threads and the server chat log backups), as well as extremely sensitive information like usernames and passwords, auth info for any connected accounts (like Facebook or Google), and user last used IP addresses. These files make up essentially 99.9% of a website like this one's bulk.
tl;dr
The actual size is tens of millions of times bigger than what you 'extracted'.
I'd imagine the source HTML/CSS/JS files are WAY larger.

Actually, when GBAtemp was hacked a bit of ways back (the hacker didn't make any info public; they actually DESTROYED data), it was revealed that the Download Center and attachments, in addition to all downloads in filetrip,net, were stored in those very same SQL databases, which were quite literally zeroed out. At the time, the SQL encryption keys were stored, very clearly and legibly, in an Admin-only visible document that was in plaintext, so the hacker at that point basically was able to effectively zero out 90% of the databases before he was locked out. filetrip.net being completely brought down was a terrible blow to GBAtemp as a whole, as so many files were utterly lost to the internet forever. It did not have a backup, although the server disk had recovery software installed on it after the fact and almost everything else was intact for the most part and thus recoverable.The SQL database is only a small part of it (it's just text, after all)
Most of it is going to be attachments. I don't know if there's a file size limit for attachments, but I've been able to upload large images no problems. Every thread posted by a noob asking for help has images, blog posts are full of images, the meme box has LOTS of them.
The entire site is probably between 1-10TB. Just for reference, 1 TB is ~500k images at 2 MB each. There's probably at least that many images scattered across the site, of course they aren't all going to be 2 MB, but that's a reasonable average given that modern smartphone photos can easily be 5 MB while screenshots are usually less than 2 MB, and older photos would be lower resolution.
People don't really attach videos (if that's even possible), which could inflate the size a lot, and there are some non-image attachments, but they are infrequent and relatively small in size.
There's also the download center, but people don't use it that much, and those files are also relatively small.
html, css, and js are not compiled languages.It's reasonably small, though this is all compiled. I'd imagine the source HTML/CSS/JS files are WAY larger.
I mean yeah, you can store binary data, in them but it just doesn't make sense to do so... Even SQlite supports that, with the BLOB datatype.Actually, when GBAtemp was hacked a bit of ways back (the hacker didn't make any info public; they actually DESTROYED data), it was revealed that the Download Center and attachments, in addition to all downloads in filetrip,net, were stored in those very same SQL databases, which were quite literally zeroed out. At the time, the SQL encryption keys were stored, very clearly and legibly, in an Admin-only visible document that was in plaintext, so the hacker at that point basically was able to effectively zero out 90% of the databases before he was locked out. filetrip.net being completely brought down was a terrible blow to GBAtemp as a whole, as so many files were utterly lost to the internet forever. It did not have a backup, although the server disk had recovery software installed on it after the fact and almost everything else was intact for the most part and thus recoverable.
EDit: Also, SQLite databases are the ones that are just text; technically, enterprise SQL server databases can store up to 4TB of raw data each.