How large is GBAtemp?

1734482270901.png

This should be most of the site. (taken from the main page.)
It's reasonably small, though this is all compiled. I'd imagine the source HTML/CSS/JS files are WAY larger.
 
  • Like
Reactions: SylverReZ
GBATemp is a secret country, that defends itself with dildos.

OOOOOOH you meant GB wise. Nevermind carry on, ignore the first sentence.
 
When entering the front page, around 2.5mb (cache disabled)
1734483091535.png


If you're asking about how much backend data is stored in the database, then that's something only the admin's would probably know about.
 
  • Like
Reactions: cearp
^!!
Your profile picture must've been visible somewhere on the site when I collected the files.
You do realize that webcrawlers can only access what is publicly facing in a website, and that usually only constitutes about 0.00001% of a large website like this one, right? I can name one particular type of file that is almost never publicly visible to webcrawlers: SQL databases, like the ones that forums use to store every user's entire profiles (including post history, times and dates online (basically an entire activity log) and every single status on the profile as well as all edits done to said profile since it was created), all forum posts in every thread (including non-public threads and the server chat log backups), as well as extremely sensitive information like usernames and passwords, auth info for any connected accounts (like Facebook or Google), and user last used IP addresses. These files make up essentially 99.9% of a website like this one's bulk.

tl;dr
The actual size is tens of millions of times bigger than what you 'extracted'.
 
You do realize that webcrawlers can only access what is publicly facing in a website, and that usually only constitutes about 0.00001% of a large website like this one, right? I can name one particular type of file that is almost never publicly visible to webcrawlers: SQL databases, like the ones that forums use to store every user's entire profiles (including post history, times and dates online (basically an entire activity log) and every single status on the profile as well as all edits done to said profile since it was created), all forum posts in every thread (including non-public threads and the server chat log backups), as well as extremely sensitive information like usernames and passwords, auth info for any connected accounts (like Facebook or Google), and user last used IP addresses. These files make up essentially 99.9% of a website like this one's bulk.

tl;dr
The actual size is tens of millions of times bigger than what you 'extracted'.
Which is why I am confused my pfp is in the dl
 
You do realize that webcrawlers can only access what is publicly facing in a website, and that usually only constitutes about 0.00001% of a large website like this one, right? I can name one particular type of file that is almost never publicly visible to webcrawlers: SQL databases, like the ones that forums use to store every user's entire profiles (including post history, times and dates online (basically an entire activity log) and every single status on the profile as well as all edits done to said profile since it was created), all forum posts in every thread (including non-public threads and the server chat log backups), as well as extremely sensitive information like usernames and passwords, auth info for any connected accounts (like Facebook or Google), and user last used IP addresses. These files make up essentially 99.9% of a website like this one's bulk.

tl;dr
The actual size is tens of millions of times bigger than what you 'extracted'.
The SQL database is only a small part of it (it's just text, after all)
Most of it is going to be attachments. I don't know if there's a file size limit for attachments, but I've been able to upload large images no problems. Every thread posted by a noob asking for help has images, blog posts are full of images, the meme box has LOTS of them.
The entire site is probably between 1-10TB. Just for reference, 1 TB is ~500k images at 2 MB each. There's probably at least that many images scattered across the site, of course they aren't all going to be 2 MB, but that's a reasonable average given that modern smartphone photos can easily be 5 MB while screenshots are usually less than 2 MB, and older photos would be lower resolution.

People don't really attach videos (if that's even possible), which could inflate the size a lot, and there are some non-image attachments, but they are infrequent and relatively small in size.
There's also the download center, but people don't use it that much, and those files are also relatively small.
 
Last edited by The Real Jdbye,
You do realize that webcrawlers can only access what is publicly facing in a website, and that usually only constitutes about 0.00001% of a large website like this one, right? I can name one particular type of file that is almost never publicly visible to webcrawlers: SQL databases, like the ones that forums use to store every user's entire profiles (including post history, times and dates online (basically an entire activity log) and every single status on the profile as well as all edits done to said profile since it was created), all forum posts in every thread (including non-public threads and the server chat log backups), as well as extremely sensitive information like usernames and passwords, auth info for any connected accounts (like Facebook or Google), and user last used IP addresses. These files make up essentially 99.9% of a website like this one's bulk.

tl;dr
The actual size is tens of millions of times bigger than what you 'extracted'.

I'd imagine the source HTML/CSS/JS files are WAY larger.
 
The SQL database is only a small part of it (it's just text, after all)
Most of it is going to be attachments. I don't know if there's a file size limit for attachments, but I've been able to upload large images no problems. Every thread posted by a noob asking for help has images, blog posts are full of images, the meme box has LOTS of them.
The entire site is probably between 1-10TB. Just for reference, 1 TB is ~500k images at 2 MB each. There's probably at least that many images scattered across the site, of course they aren't all going to be 2 MB, but that's a reasonable average given that modern smartphone photos can easily be 5 MB while screenshots are usually less than 2 MB, and older photos would be lower resolution.

People don't really attach videos (if that's even possible), which could inflate the size a lot, and there are some non-image attachments, but they are infrequent and relatively small in size.
There's also the download center, but people don't use it that much, and those files are also relatively small.
Actually, when GBAtemp was hacked a bit of ways back (the hacker didn't make any info public; they actually DESTROYED data), it was revealed that the Download Center and attachments, in addition to all downloads in filetrip,net, were stored in those very same SQL databases, which were quite literally zeroed out. At the time, the SQL encryption keys were stored, very clearly and legibly, in an Admin-only visible document that was in plaintext, so the hacker at that point basically was able to effectively zero out 90% of the databases before he was locked out. filetrip.net being completely brought down was a terrible blow to GBAtemp as a whole, as so many files were utterly lost to the internet forever. It did not have a backup, although the server disk had recovery software installed on it after the fact and almost everything else was intact for the most part and thus recoverable.

EDit: Also, SQLite databases are the ones that are just text; technically, enterprise SQL server databases can store up to 4TB of raw data each.
 
It's reasonably small, though this is all compiled. I'd imagine the source HTML/CSS/JS files are WAY larger.
html, css, and js are not compiled languages.
(And it's not necessarily true that source code is bigger than the compiled output)

Like someone said above, the posts (and of course attachments) will take up much more data than the files your web browser displays.
Read about "frontend" vs "backend" :)
 
Actually, when GBAtemp was hacked a bit of ways back (the hacker didn't make any info public; they actually DESTROYED data), it was revealed that the Download Center and attachments, in addition to all downloads in filetrip,net, were stored in those very same SQL databases, which were quite literally zeroed out. At the time, the SQL encryption keys were stored, very clearly and legibly, in an Admin-only visible document that was in plaintext, so the hacker at that point basically was able to effectively zero out 90% of the databases before he was locked out. filetrip.net being completely brought down was a terrible blow to GBAtemp as a whole, as so many files were utterly lost to the internet forever. It did not have a backup, although the server disk had recovery software installed on it after the fact and almost everything else was intact for the most part and thus recoverable.

EDit: Also, SQLite databases are the ones that are just text; technically, enterprise SQL server databases can store up to 4TB of raw data each.
I mean yeah, you can store binary data, in them but it just doesn't make sense to do so... Even SQlite supports that, with the BLOB datatype.
FileTrip was not brought down by hackers. It was shut down due to a couple issues. https://gbatemp.net/threads/filetrip-is-now-permanently-closed.534841/
I don't remember any of that other stuff being revealed either.
 

Site & Scene News

Popular threads in this forum