Announcement of a Sanskrit Search Engine
ambapradeep at GMAIL.COM
Thu Mar 22 16:19:12 UTC 2012
Greetings on the Nandana nāma samvatsaraḥ.
The Department of Sanskrit Studies, University of Hyderabad announces
release of the FIRST Search Engine for Sanskrit -- गवेषिका available at
This search engine integrates a Sanskrit morphological analyser with the
basic search engine resulting in better search results.
The Sanskrit Search Engine गवेषिका has following search options
a) Basic Search:This is a basic search feature where you can search
a bare string "as it as" .
b) Search with regular expressions:This allows one to search a string
with various patterns as shown below.
i) str1 AND str2: searches for the instances of both str1 and
str2 in the same text.
ii) str1 OR str2: searches for the instances of either or both
iii) str1 +str2: occurences where str2 IS present and optionally
str1 may be.
iv) "str1 str2": literal string "str1 str2"
In addition you may use the wild characters '?' and '*'.
c) Search with morphology enabled: This allows one to search for
strings with variations in their forms due to inflectional suffixes.
E.g. given राम as a प्रतिपदिकम् (nominal stem) with gender
पुंलिङ्गम् the machine searches occurences of various nominal declensions
of राम such as रामेण , रामस्य etc. allowing the spelling variations (such
as अंकः / अङ्कः) as well.
We acknowledge with thanks the following software tools and corpora used
for the development of Search Engine.
a) Lucene a free Search Engine available under GPL at
b) A morphological analyser and a generator developed by the consortium
of 7 institutes led by the Department under the project
"Development of Sanskrit computational toolkit and Sanskrit-Hindi
Machine Translation system" funded by TDIL programme of DIT(2008-2012).
c) Gretil Corpus available at
d) Critical edition of Mahabharat available at Bhandarkar Oriental
Research Institute, Pune http://bori.ac.in/
e) Corpus developed by Peter Scharf and available at Sanskrit Library
f) Corpus developed by Oliver Hellwig and available at DCS
g) Corpus developed by the Department of Sanskrit Studies, University of
h) Corpus developed by the Sanskrit Consortium, funded by TDIL
Amba Kulkarni, Head, Department of Sanskrit Studies, University of
amrutham.gowri at gmail.com
kannaiah.chinni at gmail.com
ambapradeep at gmail.com
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the INDOLOGY