[INDOLOGY] Diacriticals in unicode, single or multiple glyphs

Harry Spier hspier.muktabodha at gmail.com
Fri Nov 18 12:58:58 UTC 2016

Dear list members,

In unicode you can write characters with diacriticals with either a single
glyph or you can combine the character with the diacritical writing it in
two glyphs.

This is a problem when one searchs sanskrit etexts.

For example, the letters with diacriticals in the Muktabodha digital
library are written with one glyph and as far as I can see GRETIL does the
same thing.  But the transcoding utility at  "The Sanskrit Library"
combines letters with their diacriticals in two glyphs.
 So if you used the Sanskrit Library utility to create a transliterated
word such as for example: *śākti* and then searched texts from either
GRETIL or Muktabodha for that word your search wouldn't find anything.

Harry Spier

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://list.indology.info/pipermail/indology/attachments/20161118/76de491d/attachment.htm>

More information about the INDOLOGY mailing list