From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Jason Rumney Newsgroups: gmane.emacs.devel Subject: Re: I'm is really I'm Date: Wed, 07 Jul 2010 12:19:38 +0800 Message-ID: <4C34005A.1010300@gnu.org> References: <20100706.203803.12862906.hanche@math.ntnu.no> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Trace: dough.gmane.org 1278476435 21523 80.91.229.12 (7 Jul 2010 04:20:35 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Wed, 7 Jul 2010 04:20:35 +0000 (UTC) To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Wed Jul 07 06:20:31 2010 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1OWM7Z-00059u-0b for ged-emacs-devel@m.gmane.org; Wed, 07 Jul 2010 06:20:29 +0200 Original-Received: from localhost ([127.0.0.1]:59613 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1OWM7Y-0000Ad-Fs for ged-emacs-devel@m.gmane.org; Wed, 07 Jul 2010 00:20:28 -0400 Original-Received: from [140.186.70.92] (port=43172 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1OWM7S-0000AW-FN for emacs-devel@gnu.org; Wed, 07 Jul 2010 00:20:23 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1OWM7R-0006DG-9g for emacs-devel@gnu.org; Wed, 07 Jul 2010 00:20:22 -0400 Original-Received: from mail-pv0-f169.google.com ([74.125.83.169]:62270) by eggs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1OWM7R-0006Cy-17 for emacs-devel@gnu.org; Wed, 07 Jul 2010 00:20:21 -0400 Original-Received: by pvc30 with SMTP id 30so1329961pvc.0 for ; Tue, 06 Jul 2010 21:20:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:sender:message-id:date:from :user-agent:mime-version:to:subject:references:in-reply-to :content-type:content-transfer-encoding; bh=RY8C6mlSpV6Sbb5nkB+pbbhJWr6Ybnnw+BbSCQa/qbI=; b=Lbvcieo8tpL3fDRpHPMXWrx0djhcyxtzJ/8vIyggoa3efI9uMnV2CJSJRws4fQjkom tsc0CakMYnoM0f6SeZ6ACVUKhdsGQObGziUh1bHzPEq6G2DLGwBt+GlPTLeQ2sH2SOTL TfkNs9NteWbNFqHP29qC47yF4xtd+W6kfkWMc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=sender:message-id:date:from:user-agent:mime-version:to:subject :references:in-reply-to:content-type:content-transfer-encoding; b=gTWoy3BG6btnxWfgdjMDL6qpNBgfVlxPRQbIcUjXLjrI+4Ff+O3/C0zaNcUpLgeGvM AtJMLiOPRVgmCImjVsFB4Sf7zpVEbWTgtsdi7UEiuwjWprgQjsaXspm3VkwlrH/pSYb6 Y30PFuxuGpVvthQFajOuWroCkX+KbBUQsM3lw= Original-Received: by 10.114.137.13 with SMTP id k13mr6498174wad.221.1278476419580; Tue, 06 Jul 2010 21:20:19 -0700 (PDT) Original-Received: from [10.1.1.54] ([61.4.103.130]) by mx.google.com with ESMTPS id k23sm93377660waf.5.2010.07.06.21.20.16 (version=SSLv3 cipher=RC4-MD5); Tue, 06 Jul 2010 21:20:18 -0700 (PDT) User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.1.7) Gecko/20100111 Thunderbird/3.0.1 In-Reply-To: X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:126860 Archived-At: On 07/07/2010 09:57, Lennart Borgman wrote: > I hoped there were some easy cases where some characters commonly used > for typographic reasons could be replaced by more "wellknown" > characters. > There are some filters in Gnus to handle this type of problem, but the problem you saw is different. PDF allows fonts to be embedded in the document, and when this happens the mapping from character encoding to glyph gets optimised so there is no common standard. I've seen this before when attempting to copy and paste from a Japanese PDF document - there was no way of getting useful information out short of using OCR on a bitmap of the PDF reader's display.