From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: Use the Unicode replacement character for replacing unencodable characters into UTF-16 Date: Tue, 18 Aug 2020 19:19:52 +0300 Message-ID: <83y2mcdizb.fsf@gnu.org> References: <7399CD85-E381-4BE6-81D6-10AA9FC56685@acm.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="27316"; mail-complaints-to="usenet@ciao.gmane.io" Cc: emacs-devel@gnu.org To: Mattias =?utf-8?Q?Engdeg=C3=A5rd?= Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Tue Aug 18 18:20:53 2020 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1k84M1-00071T-Rw for ged-emacs-devel@m.gmane-mx.org; Tue, 18 Aug 2020 18:20:53 +0200 Original-Received: from localhost ([::1]:60540 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1k84M0-0006fZ-Sy for ged-emacs-devel@m.gmane-mx.org; Tue, 18 Aug 2020 12:20:52 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:41328) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1k84LH-0005O4-U0 for emacs-devel@gnu.org; Tue, 18 Aug 2020 12:20:07 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:55118) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1k84LH-0002qw-8c; Tue, 18 Aug 2020 12:20:07 -0400 Original-Received: from [176.228.60.248] (port=2690 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1k84LE-0002kR-N6; Tue, 18 Aug 2020 12:20:05 -0400 In-Reply-To: <7399CD85-E381-4BE6-81D6-10AA9FC56685@acm.org> (message from Mattias =?utf-8?Q?Engdeg=C3=A5rd?= on Tue, 18 Aug 2020 17:36:10 +0200) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:253953 Archived-At: > Feedback-ID: mattiase@acm.or > From: Mattias EngdegÄrd > Date: Tue, 18 Aug 2020 17:36:10 +0200 > > The attached patch makes sure that non-Unicode characters are replaced with U+FFFD REPLACEMENT CHARACTER instead of a space when converting to UTF-16. (The space is from all evidence a historical accident.) Can you describe under which circumstances this default-character will be used? The issue that bothers me is whether u+FFFD can appear in situations where it cannot be displayed by Emacs, because then the result will be more confusing than helping.