From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Peter Dyballa Newsgroups: gmane.emacs.help Subject: Re: Encoding help Date: Wed, 3 Jun 2009 19:58:39 +0200 Message-ID: <2C10CD43-EC63-43A2-B5B6-B5BE08AC6C78@Web.DE> References: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 (Apple Message framework v753.1) Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1244051955 11871 80.91.229.12 (3 Jun 2009 17:59:15 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 3 Jun 2009 17:59:15 +0000 (UTC) Cc: help-gnu-emacs@gnu.org To: "B. T. Raven" Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Wed Jun 03 19:59:12 2009 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1MBuk3-0003Q2-Jj for geh-help-gnu-emacs@m.gmane.org; Wed, 03 Jun 2009 19:59:11 +0200 Original-Received: from localhost ([127.0.0.1]:38988 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MBuk3-0005aN-2D for geh-help-gnu-emacs@m.gmane.org; Wed, 03 Jun 2009 13:59:11 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1MBuji-0005Xd-KK for help-gnu-emacs@gnu.org; Wed, 03 Jun 2009 13:58:50 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1MBujd-0005T2-Ua for help-gnu-emacs@gnu.org; Wed, 03 Jun 2009 13:58:50 -0400 Original-Received: from [199.232.76.173] (port=47526 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MBujd-0005Sg-FX for help-gnu-emacs@gnu.org; Wed, 03 Jun 2009 13:58:45 -0400 Original-Received: from fmmailgate02.web.de ([217.72.192.227]:35917) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1MBujc-0005IQ-W0 for help-gnu-emacs@gnu.org; Wed, 03 Jun 2009 13:58:45 -0400 Original-Received: from smtp08.web.de (fmsmtp08.dlan.cinetic.de [172.20.5.216]) by fmmailgate02.web.de (Postfix) with ESMTP id 3CCC4101A83B3; Wed, 3 Jun 2009 19:58:42 +0200 (CEST) Original-Received: from [91.35.237.150] (helo=[192.168.1.2]) by smtp08.web.de with asmtp (WEB.DE 4.110 #277) id 1MBujZ-0000K9-00; Wed, 03 Jun 2009 19:58:42 +0200 In-Reply-To: X-Mailer: Apple Mail (2.753.1) X-Sender: Peter_Dyballa@web.de X-Provags-ID: V01U2FsdGVkX1+4si/R3rXv/5UABP023I8WuIxDFc2zOuJbs46F pwO0R/1XjrbqRj3xnP6sNn0bvtGLMWPq2ccEFJ2fFIB+Oadg7v 3k40y/OLGn5uKB9Gc0Dg== X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.4-2.6 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:64930 Archived-At: Am 03.06.2009 um 19:35 schrieb B. T. Raven: > In the meanwhile I made a similar pdf with auctex and the .txt file > produced by Adobe Reader is even more fragmented than the first > one. I guess this is not surprising after the orginal .tex file > goes through \usepackage[utf8x]{inputenc} and \usepackage{babel}. As long as you're using pdfTeX you can be sure that the PDF file has composed characters (input encoding plays no role, because it's just an *input* encoding). With a CMAP (character mapping, see 'texdoc -s cmap') and an 8 (or 7) bit font encoding (T1, T2A, T2B, T2C, T5, OT1, OT1tt, OT6, LGR, LAE, LFE) the composed characters can be mapped to the ready to use (pre-composed) Unicode characters. The use of XeTeX might be another option (it's xdvipdfmx output driver inserts CMAPs into the PDF file). Or another PDF viewer, one that automatically reloads the updated PDF output file. -- Greetings Pete A common mistake that people make when trying to design something completely foolproof is to underestimate the ingenuity of complete fools.