A misconception regarding the PDF format (Re: Text processing in Unicode
wujastyk at GMAIL.COM
Sun Mar 28 08:42:46 EDT 2010
Very impressionistically - I haven't done any real test testing - my
experience is that if I use Unicode for my source file, then I get a Unicode
PDF. So I can cut-and-paste and get all the diacritics. And if I do "save
as" plain text from PDF, I get a plain text file that's correctly Unicode
I'm using XeTeX.
PS Zdenek Wagner has done successful but still experimental work on getting
TeX + Velthuis Devnag => searchable Devanagari PDFs.
More information about the INDOLOGY