287
you are viewing a single comment's thread
view the rest of the comments
[-] umbrella@lemmy.ml 30 points 5 months ago

how big is the database?

books can't be that big, but i'm guessing the selection is simply huge?

[-] xrtxn@lemmy.sdf.org 50 points 5 months ago

The selection is literally all books that can be found on the internet.

[-] tsonfeir@lemm.ee 13 points 5 months ago
[-] Index@feddit.nl 33 points 5 months ago

According to their total dataset size excluding duplicates, over 900 TB

[-] rufus@discuss.tchncs.de 17 points 5 months ago

Sure, that's a bit more than $65.000 per year with Backblaze.

[-] tsonfeir@lemm.ee 12 points 5 months ago

Shit, my synology has more than that… alas, it is full of movie “archives”

[-] state_electrician@discuss.tchncs.de 18 points 5 months ago

You run a petabyte Synology at home?

[-] tsonfeir@lemm.ee 7 points 5 months ago

Well, it’s not just a single synology, it’s got a bunch of expansion units, and there are multiple host machines.

[-] dutchkimble@lemy.lol 6 points 5 months ago

I'm guessing you're talking GBs?

[-] tsonfeir@lemm.ee 8 points 5 months ago
[-] dutchkimble@lemy.lol 2 points 5 months ago

That's awesome - how many drives and of what sizes do you have? Also why synology instead of higher enterprise grade solution at this point?

[-] tsonfeir@lemm.ee 4 points 5 months ago

Right now most of them are 20T each. I started smaller at first, but they’ve dropped so much in price. I usually wait until a sale and grab a bunch. There are… math… 62 drives?

When I first started, I only had the 6 bay… I chose synology because I wanted something that was managed for me. I don’t want to have to focus on setting things up and possibly doing things wrong. It comes with amazing tools. Also, the server buy-in was a lot less than the other “professional” rack mounted solutions.

I had such a great experience that I just kept with them. It is a pretty expensive hobby though, but so is buying physical movies. And, some things never get a physical release, so having it digitally protects me from when Netflix, or whomever, decides to drop something.

[-] FigMcLargeHuge@sh.itjust.works 3 points 5 months ago* (last edited 5 months ago)

They put a link in with the total...

Total Excluding duplicates 133,708,037 files 913.1 TB

[-] umbrella@lemmy.ml 1 points 5 months ago* (last edited 5 months ago)

wait what? how expensive is it to buy and run? is it practical at all, what are the common snags? always wanted to get into doing some archiving.

[-] tsonfeir@lemm.ee 7 points 5 months ago

It’s an investment. It’s like the price of a small car. But it was built over time, so not like one lump sum.

Originally, it was to have easier access to my already insane Blu-ray collection. But I started getting discs from Redbox, rental stores, libraries, etc. they are full rip, not that compressed PB stuff. Now there are like 3000 movies and fuck knows how many tv shows.

A lot of my effort was to have the best release available. Or, have things that got canceled. Like the Simpsons episode with MJ, which is unavailable to stream.

Snags… well, synology is sooo easy. Once you figure out how you want you drives set up, there’s nothing to it.

Whatever you do, always have redundant drives. Yes, you lose space, but eventually one of them is gonna die and you don’t want to lose data.

[-] redcalcium@lemmy.institute 11 points 5 months ago

You should write a will instructing your family to send those disks to the internet archive for preservation if something happened to you.

[-] AmbiguousProps@lemmy.today 5 points 5 months ago

Correct me if I'm wrong, but they only index shadow libraries and do not host any files themselves (unless you count the torrents). So, you don't need 900+ TB of storage to create a mirror.

[-] FreudianCafe@lemmy.ml 5 points 5 months ago

I guess more than 5?

[-] Cuntessera@sh.itjust.works 4 points 5 months ago

I imagine a couple of terabytes at the very least, though, I could be underestimating how many books have got deDRMed so far.

[-] tsonfeir@lemm.ee 4 points 5 months ago
[-] Cuntessera@sh.itjust.works 9 points 5 months ago

Girl, what? No wonder they’re having trouble hosting their archive. Does Anna’s Archive host copyrighted content as well or is all that copyleft?

[-] redcalcium@lemmy.institute 13 points 5 months ago

They host academic papers and books, most of them are copyrighted contents. They recently got in trouble for scraping a book metadata service to generate a list of books that hasn't been archived yet: https://torrentfreak.com/lawsuit-accuses-annas-archive-of-hacking-worldcat-stealing-2-2-tb-data-240207/

[-] AmbiguousProps@lemmy.today 2 points 5 months ago* (last edited 5 months ago)

They index, not host, no? (Unless you count the torrents, which are distributed)

[-] Cuntessera@sh.itjust.works 2 points 5 months ago

Is hosting all that stuff even legal? I mean, they’re not making any money off of it, but they’re still a “piracy” hub. How have they survived this long?

[-] catloaf@lemm.ee 4 points 5 months ago

No, it's not.

They've survived by making themselves hard to identify and shut down. And as we can see here, by creating redundancies.

[-] redcalcium@lemmy.institute 4 points 5 months ago

It's very illegal. iirc it was created by a group called "Pirate Library Mirror" after the guy that runs z-library got arrested, so I assume they're taking anonymity seriously to avoid arrest.

[-] smnwcj@fedia.io 4 points 5 months ago

The archive includes copyrighted works. Often multiple copies of each work, across different formats.

[-] spiderman@ani.social 2 points 5 months ago* (last edited 5 months ago)

bigger than zlib or project Gutenberg?

[-] redcalcium@lemmy.institute 15 points 5 months ago

It is huge! They claimed to have preserved about 5% of the world’s books.

[-] umbrella@lemmy.ml 2 points 5 months ago

oh i actually tought it was way more! there wasnt a single book i wanted (or even tought to look up) that i didnt actually find in there.

this post was submitted on 31 Mar 2024
287 points (99.3% liked)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

53958 readers
747 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder


💰 Please help cover server costs.

Ko-FiLiberapay


founded 1 year ago
MODERATORS