[INDOLOGY] Search interface for the GRETIL Corpus
Arlo Griffiths
arlogriffiths at hotmail.com
Wed May 7 14:52:12 UTC 2025
Dear Claudius,
Thanks a lot for this initiative. Allow me to ask if it is also possible to resume absorbing texts into the same corpus?
Now that its Göttingen host no longer seems to be interested in curating it, why not store all files on github or gitlab and initiate a collective INDOLOGY endeavor toward curating (txt > xml conversion) and expanding the corpus?
I write these words without having a full understanding of everything that would be required, but I'd certainly be interested in contributing.
Best wishes,
Arlo Griffiths
EFEO
________________________________
From: INDOLOGY <indology-bounces at list.indology.info> on behalf of Claudius Teodorescu via INDOLOGY <indology at list.indology.info>
Sent: Tuesday, April 22, 2025 9:00 AM
To: Indology <indology at list.indology.info>
Subject: [INDOLOGY] Search interface for the GRETIL Corpus
Dear all,
During the last months, I managed to set a search interface for the texts of the GRETIL Corpus, located at [1]. The interface is published as a static website, with a static full-text index and a static search engine, which execute the search in the browser, without the need for a server.
In order to convert the files to HTML format, which is used to display them in the search interface, I had to make some small updates to the XML files of the corpus. These changes are documented in [2]. As one expects, there is still work to be done with the XML files of the corpus.
Please let me know if you find any bugs with the search interface.
Best regards,
Claudius Teodorescu
[1] https://claudius-teodorescu.gitlab.io/gretil-corpus-site/
[2] https://gitlab.com/claudius-teodorescu/gretil-corpus-data
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://list.indology.info/pipermail/indology/attachments/20250507/66116e25/attachment.htm>
More information about the INDOLOGY
mailing list