Return-Path: Received: from relay2.vsu.ru ([62.76.169.17] verified) by vsu.ru (CommuniGate Pro SMTP 3.3.1) with ESMTP id 1818184 for CyrTeX-en@vsu.ru; Sat, 19 Aug 2000 11:41:45 +0400 Received: by relay2.vsu.ru (Postfix, from userid 5) id 2D174185E; Sat, 19 Aug 2000 11:41:40 +0400 (MSD) Received: (from vvv@localhost) by vvv.vsu.ru (8.9.3/8.9.3) id LAA24006; Sat, 19 Aug 2000 11:38:30 +0400 X-Authentication-Warning: vvv.vsu.ru: vvv set sender to vvv@vvv.vsu.ru using -f To: CyrTeX-en@vsu.ru Subject: Re: ISO 8859-5 References: From: Vladimir Volovich Date: 19 Aug 2000 11:38:29 +0400 In-Reply-To: Laurent Siebenmann's message of "Sat, 19 Aug 2000 07:22:06 +0200" Message-ID: User-Agent: Gnus/5.0803 (Gnus v5.8.3) Emacs/20.4 MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii "LS" == Laurent Siebenmann writes: LS> Bad news, the troika 866, 1251, KOI8 had an air of tolerable LS> complexity. complexity in the number of encodings (3)? then, latin encodings have a non-smaller complexity: latin1, latin2, latin3, latin4, latin5, etc encodings... after all, one can use on universal encoding utf-8 these days for texts in any language. i've just put an experimental utf-8 input encoding support to CTAN:macros/latex/contrib/supported/t2/etc/utf-8/ it should work with "ordinary" TeX (Omega not required). currently, latin and cyrillic scripts are supported, i.e. if a text file contains accented latin + cyrillic at once, then it could be processed directly by TeX. it is not hard to add support for other scripts (e.g. greek). LS> Malyshev has listed DEC as using ISO 8859-5. VMS as well as UNIX LS> I presume. Do you know another major unix flavor that uses ISO LS> 8859-5? SUN (Solaris) and IBM (AIX) unixes support cyrillic in ISO 8859-5 (now it is changing, -- other encoding options are being added). LS> If anyone listening could spot the codes for any of << >> < > ,, LS> `` '' and the number sign for DEC and followers, I would be LS> grateful. The standard itself refers only to letters, I have LS> added it to yesterday's ftp posting. ISO 8859-5 does not seem to contain any guillemets (either single or double) and quotes listed above. numero sign has decimal code 240 in ISO 8859-5. >> cyrillic text exchanges abroad russia LS> Could you give an example? I am not sure I understand. i meant primarily software (mail readers, browsers, etc) which supported only standard ISO encodings but ignored encodings such as koi8-r or ms codepages. it was mainly developped not in russia. Best regards, -- Vladimir.