OCR of (transcribed) indic texts

E. Bashir ebashir at UMICH.EDU
Tue Nov 24 19:50:29 UTC 1998

I am also interested in the answers to these questions.  If anyone has
helpful information on them, please reply to the list.


E. Bashir, Visiting Lecturer
Dept. of Asian Languages and Cultures
3070 Frieze Bldg.
The University of Michigan
Ann Arbor, MI 48109

Office Phone:  734-764-0214
Dept. Phone:  734-764-8286 (messages only)
Fax:  734-647-0157

On Tue, 24 Nov 1998, Juergen Neuss wrote:

> Dear list-members,
> maybe some of you are aware of the difficulties which arise if one wants
> to scan transcribed (not to speak of original) indic texts. Of course
> the scanning itself is not the problem but the subsequent transformation
> of the image-file into a text-file by means of OCR (Optical character
> recognition) programs. These programs often do recognise only the usual
> set of ASCII-characters. Some of them include extended features ehich
> means that in certain cases the user may direct the program to read a
> certain difficult character in a certain way. As far as I know
> diacritical signs are a problem for at least most of these programs. If
> anyone of you has experience with OCR programs in this respect I would
> be grateful for your recommendations. Moreover I would like to know
> whether there are any OCR programs available which recognise Indian
> characters of any kind. I hope this message does not provoke any
> response which violates the non-commercial spirit of this list.
> Thanks for reading.
> jn
> --
> jneuss at zedat.fu-berlin.de
> Juergen Neuss
> Freie Universitaet Berlin
> Institut f�r Indische Philologie und Kunstgeschichte
> K�nigin-Luise-Str. 34a
> 14195 Berlin

More information about the INDOLOGY mailing list