[INDOLOGY] Search interface for the GRETIL Corpus

Dominik Wujastyk wujastyk at gmail.com
Tue Apr 22 16:50:16 UTC 2025


Thank you, Claudius, this is a very helpful new search interface.

I've been thinking recently about the long-term lifetimes of various
digital projects.

The way you have implemented this as a static site hosted at Gitlab means
that it is not dependent on anything except the persistence of gitlab
itself.  There are no fees to be paid for a DNS or webspace, no software to
be personally maintained.  In twenty, thirty years, this search interface
is likely to still be available, even if your personal interests have moved
on.

Thank you!
Best,
Dominik


--
Dominik Wujastyk, Professor Emeritus, Classical Indian History
University of Alberta

"The University of Alberta is committed to the pursuit of truth,
the advancement of learning, and the dissemination of knowledge
through teaching, research and other scholarly and creative activities and
service."
-- Collective Agreement
<https://www.ualberta.ca/human-resources-health-safety-environment/media-library/my-employment/agreements/2020-2024-collective-agreement---working-version.pdf>
3.01



On Tue, 22 Apr 2025 at 03:02, Claudius Teodorescu via INDOLOGY <
indology at list.indology.info> wrote:

> Dear all,
>
> During the last months, I managed to set a search interface for the texts
> of the GRETIL Corpus, located at [1]. The interface is published as a
> static website, with a static full-text index and a static search engine,
> which execute the search in the browser, without the need for a server.
>
> In order to convert the files to HTML format, which is used to display
> them in the search interface, I had to make some small updates to the XML
> files of the corpus. These changes are documented in [2]. As one expects,
> there is still work to be done with the XML files of the corpus.
>
> Please let me know if you find any bugs with the search interface.
>
> Best regards,
> Claudius Teodorescu
>
> [1] https://claudius-teodorescu.gitlab.io/gretil-corpus-site/
> [2] https://gitlab.com/claudius-teodorescu/gretil-corpus-data
>
> _______________________________________________
> INDOLOGY mailing list
> INDOLOGY at list.indology.info
> https://list.indology.info/mailman/listinfo/indology
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://list.indology.info/pipermail/indology/attachments/20250422/04819e6d/attachment.htm>


More information about the INDOLOGY mailing list