Dear all,
Dear list,
Greetings on the Nandana nāma samvatsaraḥ.
The Department of Sanskrit Studies, University of Hyderabad announces release of the FIRST Search Engine for Sanskrit -- गवेषिका available at
http://sanskrit.uohyd.ernet.in:8080/searchengine
This search engine integrates a Sanskrit morphological analyser with the basic search engine resulting in better search results.
Features:
The Sanskrit Search Engine गवेषिका has following search options
a) Basic Search:This is a basic search feature where you can search a bare string "as it as" .
b) Search with regular expressions:This allows one to search a string with various patterns as shown below.
i) str1 AND str2: searches for the instances of both str1 and str2 in the same text.
ii) str1 OR str2: searches for the instances of either or both occurences.
iii) str1 +str2: occurences where str2 IS present and optionally str1 may be.
iv) "str1 str2": literal string "str1 str2"
In addition you may use the wild characters '?' and '*'.
c) Search with morphology enabled: This allows one to search for strings with variations in their forms due to inflectional suffixes.
E.g. given राम as a प्रतिपदिकम् (nominal stem) with gender पुंलिङ्गम् the machine searches occurences of various nominal declensions of राम such as रामेण , रामस्य etc. allowing the spelling variations (such as अंकः / अङ्कः) as well.
Acknowledgment:
We acknowledge with thanks the following software tools and corpora used for the development of Search Engine.
a) Lucene a free Search Engine available under GPL at http://lucene.apache.org/core/
b) A morphological analyser and a generator developed by the consortium of 7 institutes led by the Department under the project
"Development of Sanskrit computational toolkit and Sanskrit-Hindi Machine Translation system" funded by TDIL programme of DIT(2008-2012).
c) Gretil Corpus available at http://fiindolo.sub.uni-goettingen.de/gretil.htm
d) Critical edition of Mahabharat available at Bhandarkar Oriental Research Institute, Pune http://bori.ac.in/
e) Corpus developed by Peter Scharf and available at Sanskrit Library http://sanskritlibrary.org/ project.
f) Corpus developed by Oliver Hellwig and available at DCS http://kjc-fs-cluster.kjc.uni-heidelberg.de/dcs/
g) Corpus developed by the Department of Sanskrit Studies, University of Hyderabad.
h) Corpus developed by the Sanskrit Consortium, funded by TDIL programme, DIT(2008-2012).
Contributors:
Sri Gowri
Karunakar
Guidance:
Amba Kulkarni, Head, Department of Sanskrit Studies, University of Hyderabad.
Contact:
amrutham.gowri@gmail.com
kannaiah.chinni@gmail.com
ambapradeep@gmail.com