From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Peter Dyballa Newsgroups: gmane.emacs.help Subject: Re: Single unrecognized character wrecks entire display Date: Fri, 24 Aug 2012 17:01:49 +0200 Message-ID: References: NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 (Apple Message framework v1084) Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1345820534 23666 80.91.229.3 (24 Aug 2012 15:02:14 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 24 Aug 2012 15:02:14 +0000 (UTC) Cc: help-gnu-emacs@gnu.org To: Alexandre Oberlin Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Fri Aug 24 17:02:15 2012 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1T4vOp-00052F-7k for geh-help-gnu-emacs@m.gmane.org; Fri, 24 Aug 2012 17:02:15 +0200 Original-Received: from localhost ([::1]:44693 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1T4vOn-0005py-Jc for geh-help-gnu-emacs@m.gmane.org; Fri, 24 Aug 2012 11:02:13 -0400 Original-Received: from eggs.gnu.org ([208.118.235.92]:45310) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1T4vOd-0005Xc-KN for help-gnu-emacs@gnu.org; Fri, 24 Aug 2012 11:02:09 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1T4vOS-0000AF-Aa for help-gnu-emacs@gnu.org; Fri, 24 Aug 2012 11:02:00 -0400 Original-Received: from mout.web.de ([212.227.17.11]:60280) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1T4vOS-00009w-0p for help-gnu-emacs@gnu.org; Fri, 24 Aug 2012 11:01:52 -0400 Original-Received: from [192.168.178.8] ([95.222.201.211]) by smtp.web.de (mrweb103) with ESMTPA (Nemesis) id 0Martm-1TOFSZ3cJm-00JoFo; Fri, 24 Aug 2012 17:01:49 +0200 In-Reply-To: X-Mailer: Apple Mail (2.1084) X-Provags-ID: V02:K0:vmDAcGuLa8WKb4dfDQK8iQWqRTelNCb9JnrGqv9a2DD mIK8WRMf0SqaR5gZ75GTq/Jz8qv2qsU9uYR/ag970AMXjBMd7I 0YT6uvHsHeDvGfoiLQ6q30/D/M+YZPTjNRmyEAcejiU/GvSxDz 6MWaA3HPTaP2axv2fXYoSxxE1j9CJSu+31kLX+TfBKkTm1QHIw BBiuI2grLqUXtf+/L2swg== X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 212.227.17.11 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:86495 Archived-At: Am 24.08.2012 um 15:46 schrieb Alexandre Oberlin: > iconv acts just the same. It tells me the 13th character is faulty = (\351), while only the 40th is (\234) Alexandre, I think you're making here the same mistake as I did before! \351 is not = the number of the character in the Unicode encoding but an UTF-8 byte. = The UTF encodings are multi-byte encodings and therefore there cannot be = that byte \351 stands for character \351 (or 233 decimal or E9 = hexadecimal). Iconv and GNU Emacs obviously find some single isolated = bytes are spread into the text. This could also explain the different = counting: characters vs. bytes (13th vs. 40th). Could you try a native MS Losedos GNU Emacs? Could you send me privately such a translation output before GNU Emacs = or iconv have changed anything? Can it be that this output is not plain = text but some structured format containing these odd bytes you mentioned = initially which might switch font or emphasising or tell where a = paragraph ends or a footnote starts? -- Greetings Pete "By filing this bug report you have challenged the honor of my family. = Prepare to die!"