Announcement of a Sanskrit Search Engine

amba kulkarni ambapradeep at GMAIL.COM
Thu Mar 22 16:19:12 UTC 2012

Dear list,

Greetings on the Nandana nāma samvatsaraḥ.

The Department of Sanskrit Studies, University of Hyderabad announces
release of the FIRST Search Engine for Sanskrit -- गवेषिका  available at

This search engine integrates a Sanskrit morphological analyser with the
basic search engine resulting in better search results.


    The Sanskrit Search Engine गवेषिका has following search options

     a)   Basic Search:This is a basic search feature where you can search
a bare string "as it as" .
     b)  Search with regular expressions:This allows one to search a string
with various patterns as shown below.
          i)  str1 AND str2: searches for the instances of both str1 and
str2 in the same text.
         ii) str1 OR str2: searches for the instances of either or both
        iii) str1 +str2: occurences where str2 IS present and optionally
str1 may be.
        iv) "str1 str2": literal string "str1 str2"

          In addition you may use the wild characters '?' and '*'.

      c)  Search with morphology enabled: This allows one to search for
strings with variations in their forms due to inflectional suffixes.
            E.g. given राम as a प्रतिपदिकम् (nominal stem) with gender
पुंलिङ्गम् the machine searches occurences of various nominal declensions
of राम such as रामेण , रामस्य etc. allowing the spelling variations (such
as अंकः / अङ्कः) as well.

We acknowledge with thanks the following software tools and corpora used
for the development of Search Engine.

   a) Lucene a free Search Engine available under GPL at
   b) A morphological analyser and a generator developed by the consortium
of 7 institutes led by the Department under the project
       "Development of Sanskrit computational toolkit and Sanskrit­-Hindi
Machine Translation system" funded by TDIL programme of DIT(2008-2012).
   c) Gretil Corpus available at
   d) Critical edition of Mahabharat available at Bhandarkar Oriental
Research Institute, Pune
   e) Corpus developed by Peter Scharf and available at Sanskrit Library project.
   f) Corpus developed by Oliver Hellwig and available at DCS
   g) Corpus developed by the Department of Sanskrit Studies, University of
   h) Corpus developed by the Sanskrit Consortium, funded by TDIL
programme, DIT(2008-2012).

    Sri Gowri

Amba Kulkarni, Head, Department of Sanskrit Studies, University of

amrutham.gowri at
kannaiah.chinni at
ambapradeep at

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <>

More information about the INDOLOGY mailing list