12
submitted 9 months ago* (last edited 9 months ago) by zabadoh@lemmy.ml to c/foss@beehaw.org

I have a book scan that came back from a book scanning company.

The scan images were fine, but the OCR text in the PDF are whacky, due to eccentric fonts, dirt, etc.

So I'm going to have to go through this by hand and tidy up.

I have tried a lot of FOSS PDF editors on this particular PDF, but none of them work as well as an old copy of Foxit PhantomPDF (an old version of the product currently named Foxit PDF Editor) that I have on a dying laptop.

I've tried the following commonly recommended FOSS PDF editors without much success:

LibreOffice Draw - Many text fields in wrong layer order. Page images not visible.

PDFEdit - Loads the file as blank

Scribus - Won't load the file

FireFox - only allows annotation changes

Inkscape - It sort of works, but it's not oriented towards text editing, so looking and editing text is cumbersome.

you are viewing a single comment's thread
view the rest of the comments
[-] sibloure@beehaw.org 2 points 8 months ago

This is amazing. Did not realize it existed. Thank you for sharing

this post was submitted on 21 Dec 2023
12 points (100.0% liked)

Free and Open Source Software

17718 readers
11 users here now

If it's free and open source and it's also software, it can be discussed here. Subcommunity of Technology.


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS