Thank you Jan for identifying the problem.
Harry Spier

On Wed, Nov 27, 2024, 4:36 AM Jan Kučera <jan.kucera@ujca.cz> wrote:

And the reason for that is because the files contain invalid UTF-8 characters. You might be good without BOM if you fix the broken files.

 

As an example, in advayatArakopaniSat-M00102-IAST.txt, line 31 has “pūramada iti śāntiḥ” where the middle character is supposed to be dash I assume, but it is not UTF-8 encoded. I attach list of all the files with invalid characters.

 

Also note that the following links are broken on the individual files page:

 

kedārakalpa

tatvaśodhanam

bhattantrasāra

śrīcakrasādhanam

śrīvidyānityapaddhati

 

Hope this helps.

 

Best regards,

Jan Kučera

Institute of South and Central Asia Students, Prague

 

 

 

From: INDOLOGY <indology-bounces@list.indology.info> On Behalf Of Jan Kucera
Sent: středa 27. listopadu 2024 9:51
To: Harry Spier <vasishtha.spier@gmail.com>; indology@list.indology.info
Subject: Re: [INDOLOGY] Help needed from Indologist with a Mac.

 

Dear Harry,

 

I confirm that some of the files are opened with wrong encoding in TextEdit. Unfortunately, TextEdit refuses to open it with UTF-8 when you pick it manually.

 

You could fix this by including a BOM mark at the beginning of the file. Let me know if you need any assistance.

 

Thanks,

Jan

 

From: INDOLOGY <indology-bounces@list.indology.info> On Behalf Of Harry Spier via INDOLOGY
Sent: středa 27. listopadu 2024 2:39
To: indology@list.indology.info
Subject: [INDOLOGY] Help needed from Indologist with a Mac.

 

Dear list members,

I've just added a feature to the Muktabodha digital library where you can download individual files.  These are plain text files encoded in utf-8, but I'm getting some feedback that when some users download and open these text files, then some files display correctly but in other files the characters with diacriticals are not being displayed correctly.

 

 I think this is just a Mac problem and (I don't have a Mac and my windows computer displays correctly) and I've seen some postings about UTF-8 not working in TEXTEDIT. 

 

Could a Mac user download a few of the etexts from the individual file download

https://muktalib7.com/DL_CATALOG_ROOT/MUKTABODHA-LIBRARY-IAST/UTF8-TITLE-LINK-LIST.html

 

see if some display incorrectly and if they do then in TextEdit change the encoding from automatic to UTF-8 and re-open the file.

The instructions I've seen for doing this in  TextEdit are: 

In the TextEdit app  on your Mac, choose File > Open, then select the file (don’t open it).

Click Options in the lower-left corner of the window.
Click the Plain Text Encoding pop-up menu and choose an encoding.

 

Thanks,

Harry Spier