Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
Converting a pdf or changing its fonts
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Multimedia
View previous topic :: View next topic  
Author Message
Lockup
Guru
Guru


Joined: 25 Jul 2002
Posts: 430

PostPosted: Sat Mar 22, 2008 8:01 pm    Post subject: Converting a pdf or changing its fonts Reply with quote

I was recently given a pdf file with terrible fonts - everything is blurry and badly aligned (it was most likely scanned). However, KPDF still allows me to select text and copies it properly to the clipboard. This has me wondering: are there any tools that would allow me to either convert this pdf to another format or at least select which font is used for rendering?

Small catch: the pdf contains a lot of mathematical notation, and the conversion has to take care of them properly (converting them to an image would be ok).

I tried pdftohtml, but the html document that it generated contained nothing but underscores.

Thanks.
Back to top
View user's profile Send private message
barbar
Guru
Guru


Joined: 16 Apr 2003
Posts: 397
Location: Austria

PostPosted: Sun Mar 23, 2008 6:11 am    Post subject: Reply with quote

You can try
Code:
pdftotext
or try to convert the pdf into ps with
Code:
pdf2ps
and view with ghostview, gv or other postscript viewers.Not sure if it works with blurred documents.

Otherwise convert your document to ppm format with
Code:
pdftoppm
and let an ocr software (like ocrad or gocr) do the dirty work. I have not tried one of these ocr programs.
Back to top
View user's profile Send private message
i92guboj
Bodhisattva
Bodhisattva


Joined: 30 Nov 2004
Posts: 10315
Location: Córdoba (Spain)

PostPosted: Sun Mar 23, 2008 1:21 pm    Post subject: Reply with quote

barbar wrote:
You can try
Code:
pdftotext
or try to convert the pdf into ps with
Code:
pdf2ps
and view with ghostview, gv or other postscript viewers.Not sure if it works with blurred documents.


I suppose that your best bet is to try to convert them to ps. But I am no expert in the matter.

Quote:

Otherwise convert your document to ppm format with
Code:
pdftoppm
and let an ocr software (like ocrad or gocr) do the dirty work. I have not tried one of these ocr programs.


I never got any useful result from any orc program under linux, as much as I love open source, ocr still sucks around here. I would be most happy to be wrong.
Back to top
View user's profile Send private message
StarDragon
Guru
Guru


Joined: 19 Jun 2005
Posts: 390
Location: tEXas

PostPosted: Sun Mar 23, 2008 9:17 pm    Post subject: Reply with quote

Have you tried opening it on acroread?
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Multimedia All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum