Announcement of a Sanskrit Search Engine

amba kulkarni ambapradeep at GMAIL.COM
Thu Mar 22 16:19:12 UTC 2012


Dear list,

Greetings on the Nandana nāma samvatsaraḥ.

The Department of Sanskrit Studies, University of Hyderabad announces
release of the FIRST Search Engine for Sanskrit -- गवेषिका  available at
* http://sanskrit.uohyd.ernet.in:8080/searchengine*

This search engine integrates a Sanskrit morphological analyser with the
basic search engine resulting in better search results.

*Features:*

    The Sanskrit Search Engine गवेषिका has following search options

     a)   Basic Search:This is a basic search feature where you can search
a bare string "as it as" .
     b)  Search with regular expressions:This allows one to search a string
with various patterns as shown below.
          i)  str1 AND str2: searches for the instances of both str1 and
str2 in the same text.
         ii) str1 OR str2: searches for the instances of either or both
occurences.
        iii) str1 +str2: occurences where str2 IS present and optionally
str1 may be.
        iv) "str1 str2": literal string "str1 str2"

          In addition you may use the wild characters '?' and '*'.

      c)  Search with morphology enabled: This allows one to search for
strings with variations in their forms due to inflectional suffixes.
            E.g. given राम as a प्रतिपदिकम् (nominal stem) with gender
पुंलिङ्गम् the machine searches occurences of various nominal declensions
of राम such as रामेण , रामस्य etc. allowing the spelling variations (such
as अंकः / अङ्कः) as well.

*Acknowledgment:*
We acknowledge with thanks the following software tools and corpora used
for the development of Search Engine.

   a) Lucene a free Search Engine available under GPL at
http://lucene.apache.org/core/
   b) A morphological analyser and a generator developed by the consortium
of 7 institutes led by the Department under the project
       "Development of Sanskrit computational toolkit and Sanskrit­-Hindi
Machine Translation system" funded by TDIL programme of DIT(2008-2012).
   c) Gretil Corpus available at
http://fiindolo.sub.uni-goettingen.de/gretil.htm
   d) Critical edition of Mahabharat available at Bhandarkar Oriental
Research Institute, Pune  http://bori.ac.in/
   e) Corpus developed by Peter Scharf and available at Sanskrit Library
http://sanskritlibrary.org/ project.
   f) Corpus developed by Oliver Hellwig and available at DCS
http://kjc-fs-cluster.kjc.uni-heidelberg.de/dcs/
   g) Corpus developed by the Department of Sanskrit Studies, University of
Hyderabad.
   h) Corpus developed by the Sanskrit Consortium, funded by TDIL
programme, DIT(2008-2012).

*Contributors:*
    Sri Gowri
    Karunakar

*Guidance:*
Amba Kulkarni, Head, Department of Sanskrit Studies, University of
Hyderabad.

*Contact:*
amrutham.gowri at gmail.com
kannaiah.chinni at gmail.com
ambapradeep at gmail.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://list.indology.info/pipermail/indology/attachments/20120322/2863e54f/attachment.htm>


More information about the INDOLOGY mailing list