[INDOLOGY] OCR with diacritics

Paras Mehta psmehta.in at gmail.com
Wed Mar 26 05:07:20 UTC 2025


Respected scholars,
Hello,

An Indology book publisher whom I know has acquired the copyrights of an
old book on Indology and wants to republish it. The book is in English and
has many Sanskrit terms in Roman script (i.e. with diacritics). Because the
printable soft copy of that book is no longer available, the publisher
wishes to scan the pages of that book and do an OCR on those scans. The
text obtained by OCR will then be laid out in a file and made ready for
reprint.
I would like to know if there is a good OCR resource which can take the
scans and accurately extract the English text along with the Romanized
Sanskrit words.

Thank you.

Best wishes,
Paras Mehta
Researcher at École française d'Extrême-Orient (Pondicherry)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://list.indology.info/pipermail/indology/attachments/20250326/e6a46503/attachment.htm>


More information about the INDOLOGY mailing list