From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: trentbuck@gmail.com (Trent W. Buck) Newsgroups: gmane.emacs.bugs Subject: bug#22222: 24.4; eww ignores charset=utf-8 in text/plain Date: Tue, 22 Dec 2015 15:11:56 +1100 Message-ID: <87d1tzw29f.fsf@gmail.com> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: ger.gmane.org 1450757604 21260 80.91.229.3 (22 Dec 2015 04:13:24 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Tue, 22 Dec 2015 04:13:24 +0000 (UTC) To: 22222@debbugs.gnu.org Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Tue Dec 22 05:13:13 2015 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1aBEJz-00037i-NU for geb-bug-gnu-emacs@m.gmane.org; Tue, 22 Dec 2015 05:13:11 +0100 Original-Received: from localhost ([::1]:48634 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aBEJz-0004Y4-5L for geb-bug-gnu-emacs@m.gmane.org; Mon, 21 Dec 2015 23:13:11 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:34823) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aBEJv-0004Xt-5b for bug-gnu-emacs@gnu.org; Mon, 21 Dec 2015 23:13:08 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1aBEJq-0001Ta-FY for bug-gnu-emacs@gnu.org; Mon, 21 Dec 2015 23:13:07 -0500 Original-Received: from debbugs.gnu.org ([208.118.235.43]:51718) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aBEJq-0001TW-C4 for bug-gnu-emacs@gnu.org; Mon, 21 Dec 2015 23:13:02 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84) (envelope-from ) id 1aBEJq-0002tS-8z for bug-gnu-emacs@gnu.org; Mon, 21 Dec 2015 23:13:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: trentbuck@gmail.com (Trent W. Buck) Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 22 Dec 2015 04:13:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 22222 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: X-Debbugs-Original-To: bug-gnu-emacs@gnu.org Original-Received: via spool by submit@debbugs.gnu.org id=B.145075754711072 (code B ref -1); Tue, 22 Dec 2015 04:13:02 +0000 Original-Received: (at submit) by debbugs.gnu.org; 22 Dec 2015 04:12:27 +0000 Original-Received: from localhost ([127.0.0.1]:59320 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aBEJG-0002sW-Mu for submit@debbugs.gnu.org; Mon, 21 Dec 2015 23:12:27 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:43728) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aBEJF-0002sJ-59 for submit@debbugs.gnu.org; Mon, 21 Dec 2015 23:12:25 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1aBEJ5-0001Oi-9c for submit@debbugs.gnu.org; Mon, 21 Dec 2015 23:12:19 -0500 Original-Received: from lists.gnu.org ([2001:4830:134:3::11]:44346) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aBEJ5-0001Oe-6O for submit@debbugs.gnu.org; Mon, 21 Dec 2015 23:12:15 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:34651) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aBEIx-0004Rg-36 for bug-gnu-emacs@gnu.org; Mon, 21 Dec 2015 23:12:15 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1aBEIs-0001Kp-9C for bug-gnu-emacs@gnu.org; Mon, 21 Dec 2015 23:12:06 -0500 Original-Received: from mail-pf0-x229.google.com ([2607:f8b0:400e:c00::229]:33054) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aBEIs-0001KD-0k for bug-gnu-emacs@gnu.org; Mon, 21 Dec 2015 23:12:02 -0500 Original-Received: by mail-pf0-x229.google.com with SMTP id n128so82708545pfn.0 for ; Mon, 21 Dec 2015 20:12:01 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:to:subject:date:message-id:mime-version:content-type; bh=BaG2F2eGN8ReilkdnYjcG8TF/2WMDwNVJNFvFcpUWWM=; b=c7Yf6WUvmBGcw4/9VNy8Lts3jb/3CbvvXpm04Z49pl7tiRHjxasW+CG8ZSqT4fhlA3 L3LDko4kTkrTAMNohBmas7fYkXnMhXLzURvNLEOPRWCUnJqWUGXrgfbDPxKrjGka0kz/ UlUnB0nKLFZJ50TQa5LKTjANztZWMJ4AiFpKGP5sfrNHz/R66j7yrtv3DxEuj+XMoZS4 XLoQ05YszcbLEJyv/6nOTHcMLQINK4qiLDO0BFSrnFFWiL5GmpEg3RiFOef0zW5ddtxB M1g+/9f454HQnYqCnHbOkwYzYs8Sr5eu5NxSivK5bOGmAYrYYd7+ek2pjxtdi+fjEsnH ZuYg== X-Received: by 10.98.19.9 with SMTP id b9mr31968315pfj.28.1450757521285; Mon, 21 Dec 2015 20:12:01 -0800 (PST) Original-Received: from localhost ([203.7.155.120]) by smtp.gmail.com with ESMTPSA id g68sm37613536pfg.9.2015.12.21.20.11.58 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 21 Dec 2015 20:12:00 -0800 (PST) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 208.118.235.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:110269 Archived-At: In #emacs on Freenode, "snotglob" reported that When I open this doc with eww https://raw.githubusercontent.com/howardabrams/pdx-emacs-hackers/master/workshops/keyboard-macros.org it appears that some apostrophes are shown as \342\200\231 I can reproduce his fault. I'm using UTF-8 in all the places it can be used. That URL has this header field: Content-Type: text/plain; charset=utf-8 While eww-render correctly extracts the charset, it doesn't use it: (eww-display-raw) compare: (eww-display-html charset url nil point) I have not actually tested HEAD, but the issue appears to be present from RTFS: http://git.savannah.gnu.org/cgit/emacs.git/tree/lisp/net/eww.el#n347 In GNU Emacs 24.4.1 (x86_64-pc-linux-gnu) of 2014-10-26 on trouble, modified by Debian System Description: Debian GNU/Linux 8.0 (jessie) Configured using: `configure --build x86_64-linux-gnu --prefix=/usr --sharedstatedir=/var/lib --libexecdir=/usr/lib --localstatedir=/var/lib --infodir=/usr/share/info --mandir=/usr/share/man --with-pop=yes --enable-locallisppath=/etc/emacs24:/etc/emacs:/usr/local/share/emacs/24.4/site-lisp:/usr/local/share/emacs/site-lisp:/usr/share/emacs/24.4/site-lisp:/usr/share/emacs/site-lisp --build x86_64-linux-gnu --prefix=/usr --sharedstatedir=/var/lib --libexecdir=/usr/lib --localstatedir=/var/lib --infodir=/usr/share/info --mandir=/usr/share/man --with-pop=yes --enable-locallisppath=/etc/emacs24:/etc/emacs:/usr/local/share/emacs/24.4/site-lisp:/usr/local/share/emacs/site-lisp:/usr/share/emacs/24.4/site-lisp:/usr/share/emacs/site-lisp --with-x=no --without-gconf --without-gsettings 'CFLAGS=-g -O2 -fstack-protector-strong -Wformat -Werror=format-security -Wall' CPPFLAGS=-D_FORTIFY_SOURCE=2 LDFLAGS=-Wl,-z,relro' Important settings: value of $LC_COLLATE: C value of $LANG: en_AU.utf8 locale-coding-system: utf-8-unix Major mode: Fundamental Minor modes in effect: xterm-mouse-mode: t ido-everywhere: t savehist-mode: t icomplete-mode: t show-paren-mode: t delete-selection-mode: t tooltip-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t auto-composition-mode: t auto-encryption-mode: t auto-compression-mode: t column-number-mode: t line-number-mode: t transient-mark-mode: t Recent input: Recent messages: Loading /usr/share/emacs/site-lisp/magit/magit-install.el (source)...done Loading /etc/emacs/site-start.d/50magit.el (source)...done Loading /etc/emacs/site-start.d/50pylint.el (source)... Loading pylint...done Loading /etc/emacs/site-start.d/50pylint.el (source)...done Loading /etc/emacs/site-start.d/50python-docutils.el (source)...done Loading /etc/emacs/site-start.d/50w3m-el-snapshot.el (source)...done Loading /etc/emacs/site-start.d/51debian-el.el (source)...done Loading term/xterm...done For information about GNU Emacs and the GNU system, type C-h C-a. Load-path shadows: /usr/share/emacs/24.4/site-lisp/debian-startup hides /usr/share/emacs/site-lisp/debian-startup /usr/share/emacs/site-lisp/rst hides /usr/share/emacs/24.4/lisp/textmodes/rst /usr/share/emacs24/site-lisp/dictionaries-common/ispell hides /usr/share/emacs/24.4/lisp/textmodes/ispell /usr/share/emacs24/site-lisp/dictionaries-common/flyspell hides /usr/share/emacs/24.4/lisp/textmodes/flyspell Features: (shadow sort gnus-util mail-extr emacsbug message format-spec rfc822 mml easymenu mml-sec mm-decode mm-bodies mm-encode mail-parse rfc2231 mailabbrev gmm-utils mailheader xterm advice sendmail rfc2047 rfc2045 ietf-drums mm-util help-fns mail-prsvr mail-utils jka-compr edmacro kmacro cl-loaddefs cl-lib disp-table xt-mouse ido savehist icomplete time-date paren delsel saveplace debian-el debian-el-loaddefs w3m-load pylint compile comint regexp-opt ansi-color ring tool-bar 50magit emacs-goodies-el emacs-goodies-custom emacs-goodies-loaddefs easy-mmode dpkg-dev-el dpkg-dev-el-loaddefs tooltip electric uniquify ediff-hook vc-hooks lisp-float-type tabulated-list newcomment lisp-mode prog-mode register page menu-bar rfn-eshadow timer select mouse jit-lock font-lock syntax facemenu font-core frame cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao korean japanese hebrew greek romanian slovak czech european ethiopic indian cyrillic chinese case-table epa-hook jka-cmpr-hook help simple abbrev minibuffer nadvice loaddefs button faces cus-face macroexp files text-properties overlay sha1 md5 base64 format env code-pages mule custom widget hashtable-print-readable backquote make-network-process dbusbind gfilenotify multi-tty emacs) Memory information: ((conses 16 102548 4949) (symbols 48 19418 0) (miscs 40 41 71) (strings 32 25459 5386) (string-bytes 1 684956) (vectors 16 9215) (vector-slots 8 355333 14722) (floats 8 69 208) (intervals 56 261 0) (buffers 960 11) (heap 1024 8764 721))