[INDOLOGY] Workshop "OCR AND BEYOND: Doing Philology, Building Structured Corpora, and Computer-aided Annotation Workflows in the Age of Generative AI*
Sebastian Nehrdich
nehrdbsd at gmail.com
Thu Mar 26 14:57:24 UTC 2026
Dear List members,
This is to draw your attention to a small event we are hosting here at
Tohoku University in Sendai, on the coming weekend, and online
participation is possible as well!
OCR AND BEYOND
Doing Philology, Building Structured Corpora, and Computer-aided
Annotation Workflows in the Age of Generative AI
Date:
March 28-29
Location:
Tohoku University
Katahira Campus
WPI-AIMR Main Building (B01)
Seminar Room, 2nd floor
Time zone:
JST
Registration for online participation:
https://docs.google.com/forms/d/e/1FAIpQLSc9NCF13PFBXLxN56b_KrhUzc7_xjvyQosNoUmfnjN0jUErNQ/viewform
Program
Saturday, March 28
10:00-10:05
Opening: Welcome
10:05-10:35
On Reading Śāstra with Claude: Practical Reflections on AI in Sanskrit Studies
Kei Kataoka
10:35-11:05
Indological Research in 2026: How AI is Helping, with Various Examples
Kengo Harimoto
11:05-11:35
“They build a hut for the sacrificer”:
Towards a Word-aligned Corpus of Vedic Prose
Oliver Hellwig
11:35-12:20
Discussion on Session 1 Themes
Lead: Kyoko Amano
12:20-13:20
Lunch
13:20-13:50
The Vedic Prose Corpus: Progress, Challenges, Visions
Sven Sellmer
13:50-14:20
eText Collections, Repositories, and Distribution:
Observations towards Quality Control
Julian Schott
14:20-14:50
Towards a Sustainable Dataset of Colophons, Paratexts, and Inscriptions
Ryan Conlon
14:50-15:35
Discussion on Session 2 Themes
Lead: Kengo Harimoto
15:35
Closing
Sunday, March 29
10:30-11:00
HTR for Nepalese Manuscripts in Pracalit Script
Alexander J. O’Neill
11:00-11:30
Who Is Speaking? Encoding and Detecting Quotation Layers in Japanese
Esoteric Buddhist Texts
Gaetan Rappo
11:30-12:00
Discussion on Session 3 Themes
Lead: Oliver Hellwig, Sven Sellmer
12:00-13:00
Lunch
13:00-13:30
A Preliminary Investigation of OCR Techniques for Tibetan Materials
Preserved at Tohoku University
Ryuta Kikuya
13:30-14:00
Seeing the Full Elephant: Vision Language Model and LLM Use at the
Dharmamitra Project
Sebastian Nehrdich
14:00-14:15
The Significance of Buddhist Studies in East Asian/Japanese Text Encoding
Kiyonori Nagasaki
14:15-15:00
Discussion on Session 4 Themes
Lead: Oliver Hellwig, Sven Sellmer
15:00
Closing
--
Sebastian Nehrdich
Distinguished Assistant Professor
Tohoku University
Center for Integrated Japanese Studies
東北大学 合日本学センター
ディスティングイッシュトアシスタントプロフェッサー(助教)
〒980-8576
https://cijs.oii.tohoku.ac.jp/
sebastian-nehrdich.com
✉️ nehrdich at tohoku.ac.jp
☎️ 022-795-3822
More information about the INDOLOGY
mailing list