Abhidharmakosha in e-format
Paul G. Hackett
ph2046 at COLUMBIA.EDU
Thu Feb 4 00:53:33 UTC 2010
> even that has to be
> changed to utf-8 for the Chinese to appear properly.
That's true Dan, if you want to cut-and-paste from the webpage, but if
you simply download the html "source" files for each chapter from the
index page, you can get the data with having to modify your browser
If you open the resulting file(s) in a plain text editor (one that can
read UTF-8, of course), you can see that the individual lines are
consistently tagged. It should be a simple matter to write a short
macro or Perl program to extract just the Sanskrit, if desired.
More information about the INDOLOGY