this post was submitted on 03 Oct 2023
1831 points (97.7% liked)

Technology

59656 readers
2691 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

We are contacting you regarding a past Prime Video purchase(s). The below content is no longer playable on Prime Video.

In an effort to compensate you for the inconvenience, we have applied a £5.99 Amazon Gift Card to your account. The Gift Card amount is equal to the amount you paid for the Prime Video purchase(s). To apologize for the inconvenience, we've also added an Amazon Gift Certificate of £5 to your account. Your Gift Card balance will be automatically applied to your next eligible order. You can view your balance and usage history in Your Account here:

you are viewing a single comment's thread
view the rest of the comments
[–] archomrade@midwest.social 1 points 1 year ago (1 children)

I mean, I'd really have to disagree, but that's fine.

The effort involved with deconstructing a book, batching it through a document scanner, and compiling it with OCR in a EBOOK-compatible format is not trivial. Most consumer-quality OCR software isn't even that great at recognizing words, new lines, symbols, and hyphenated and line-broken words, let alone recognizing chapters, indexes, footnotes, ect. It's just not something that would be worthwhile for what it produces in the end, and there are millions more print titles than there are movie and show titles.

On the other hand, with A/V there's almost always a way to pass playback through a virtual media capture device. Worst-case you have to wait the real run-time in order to capture it, but at the end you at least have a near-original quality file.

If tomorrow all EBOOKs got locked down without a means to strip DRM, I don't think anyone outside of historical archivists would start spending their time manually cataloguing copyrighted hard copy books to distribute freely. Best-case, only the highest-demanded books would justify that amount of effort, and certainly not enough books to sustain a digital library worth frequenting.

[–] nybble41@programming.dev 1 points 1 year ago

Historically speaking, people have gone to the trouble of manually digitizing hard copy books to distribute freely. There were digital copies of print books available online (if you knew where to look) before e-books were officially available for sale in any form. That includes mass-market novels as well as items of interest to historians. Ergo, your scepticism seems entirely unjustified.

OCR is far from perfect (though editing OCR output is generally faster than retyping), but even without it we have the storage and bandwidth these days to distribute full books as stacks of images if needed, without converting them to text. The same way people distribute scans of comics/manga.