Jump to content

mbdick

Members
  • Content count

    6
  • Joined

  • Last visited

Community Reputation

0 Poker-Face

About mbdick

  • Rank
    Lurker
  1. mbdick

    OCR on a UNICODE PDF document

    I think I worded my inquiry poorly. Of course I do not want an OCR for a language dead for 2000 years! My problem I guess is that I have problems getting PDFelement to recognize UNICODE. As my example shows ša i-na bala-e Iiri-ba damar.utu ṭūḫḫeṣi; when PDFelement encounters a line like that in an article and creates a PDF, it seems to garble this line, even though it should be easily recognizable as UNICODE (Adobe PDF did), additionally it should be able to recognize superscripts and subscripts. I don't expect it to know what the line says (OCR), but it should be able to reproduce it in a PDF. Is there a tweak that takes PDFelement beyond ASCII with accents?
  2. mbdick

    OCR on a UNICODE PDF document

    I occasionally have to OCR a scanned PDF document which is in Unicode but not in one of the OCR languages. It's in 2500-years-old Akkadian. ša i-na bala-e (17) Iiri-ba damar.utu ṭūḫḫeṣi (BTW the last word does not exist but I made it up to indicate the type of letters I use) How do I tweak PDFElement 6 professional to just slavishly render the text as Unicode so I can search it?
Digitize paperwork and accelerate the way you create, prepare and sign documents.

Available for Windows, Mac, iOS, & Android.

Try Free Buy Now
Start your free trial!

Skip and Download

×
Start your free trial!

Skip and Download

×
×