From mboxrd@z Thu Jan  1 00:00:00 1970
Path: news.gmane.org!not-for-mail
From: Ted Zlatanov <tzz@lifelogs.com>
Newsgroups: gmane.emacs.devel
Subject: Re: smtpmail and ~/.authinfo
Date: Tue, 27 Sep 2011 15:15:24 -0500
Organization: =?utf-8?B?0KLQtdC+0LTQvtGAINCX0LvQsNGC0LDQvdC+0LI=?= @ Cienfuegos
Message-ID: <87bou5vj1f.fsf@lifelogs.com>
References: <83ippsqsz8.fsf@gnu.org> <m3mxf3nztb.fsf@stories.gnus.org>
	<E1Qv1Gb-000872-0E@fencepost.gnu.org>
	<m3obzik1mk.fsf@stories.gnus.org> <83hb5ay1rs.fsf@gnu.org>
	<87litc7qen.fsf@lifelogs.com> <E1R7o7r-0007ZY-Uv@fencepost.gnu.org>
	<87pqio69lr.fsf@lifelogs.com> <m3vcsf18p1.fsf@stories.gnus.org>
	<87vcsf1537.fsf@lifelogs.com> <m3hb3zw187.fsf@stories.gnus.org>
	<8739fj13vb.fsf@lifelogs.com> <jwv7h4vdmbt.fsf-monnier+emacs@gnu.org>
	<m3sjnjkmj7.fsf@stories.gnus.org>
	<87bou639wx.fsf@uwakimon.sk.tsukuba.ac.jp>
	<878vpauvm9.fsf@lifelogs.com>
	<8762ke2mhq.fsf@uwakimon.sk.tsukuba.ac.jp>
Reply-To: emacs-devel@gnu.org
NNTP-Posting-Host: lo.gmane.org
Mime-Version: 1.0
Content-Type: text/plain
X-Trace: dough.gmane.org 1317154559 2189 80.91.229.12 (27 Sep 2011 20:15:59 GMT)
X-Complaints-To: usenet@dough.gmane.org
NNTP-Posting-Date: Tue, 27 Sep 2011 20:15:59 +0000 (UTC)
To: emacs-devel@gnu.org
Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Tue Sep 27 22:15:55 2011
Return-path: <emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org>
Envelope-to: ged-emacs-devel@m.gmane.org
Original-Received: from lists.gnu.org ([140.186.70.17])
	by lo.gmane.org with esmtp (Exim 4.69)
	(envelope-from <emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org>)
	id 1R8e4H-0001Wn-EX
	for ged-emacs-devel@m.gmane.org; Tue, 27 Sep 2011 22:15:53 +0200
Original-Received: from localhost ([::1]:56933 helo=lists.gnu.org)
	by lists.gnu.org with esmtp (Exim 4.71)
	(envelope-from <emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org>)
	id 1R8e4G-00012P-Vr
	for ged-emacs-devel@m.gmane.org; Tue, 27 Sep 2011 16:15:52 -0400
Original-Received: from eggs.gnu.org ([140.186.70.92]:37399)
	by lists.gnu.org with esmtp (Exim 4.71)
	(envelope-from <ged-emacs-devel@m.gmane.org>) id 1R8e4D-000128-PH
	for emacs-devel@gnu.org; Tue, 27 Sep 2011 16:15:51 -0400
Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
	(envelope-from <ged-emacs-devel@m.gmane.org>) id 1R8e4B-0005GD-Bm
	for emacs-devel@gnu.org; Tue, 27 Sep 2011 16:15:49 -0400
Original-Received: from lo.gmane.org ([80.91.229.12]:41603)
	by eggs.gnu.org with esmtp (Exim 4.71)
	(envelope-from <ged-emacs-devel@m.gmane.org>) id 1R8e4B-0005Fz-32
	for emacs-devel@gnu.org; Tue, 27 Sep 2011 16:15:47 -0400
Original-Received: from list by lo.gmane.org with local (Exim 4.69)
	(envelope-from <ged-emacs-devel@m.gmane.org>) id 1R8e4A-0001TS-58
	for emacs-devel@gnu.org; Tue, 27 Sep 2011 22:15:46 +0200
Original-Received: from 38.98.147.133 ([38.98.147.133])
	by main.gmane.org with esmtp (Gmexim 0.1 (Debian))
	id 1AlnuQ-0007hv-00
	for <emacs-devel@gnu.org>; Tue, 27 Sep 2011 22:15:46 +0200
Original-Received: from tzz by 38.98.147.133 with local (Gmexim 0.1 (Debian))
	id 1AlnuQ-0007hv-00
	for <emacs-devel@gnu.org>; Tue, 27 Sep 2011 22:15:46 +0200
X-Injected-Via-Gmane: http://gmane.org/
Mail-Followup-To: emacs-devel@gnu.org
Original-Lines: 113
Original-X-Complaints-To: usenet@dough.gmane.org
X-Gmane-NNTP-Posting-Host: 38.98.147.133
X-Face: bd.DQ~'29fIs`T_%O%C\g%6jW)yi[zuz6;
	d4V0`@y-~$#3P_Ng{@m+e4o<4P'#(_GJQ%TT=
	D}[Ep*b!\e,fBZ'j_+#"Ps?s2!4H2-Y"sx"
Mail-Copies-To: never
User-Agent: Gnus/5.110018 (No Gnus v0.18) Emacs/24.0.90 (gnu/linux)
Cancel-Lock: sha1:5EoHIJIihITrcJACvNVH7+WMGXI=
X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3)
X-Received-From: 80.91.229.12
X-BeenThere: emacs-devel@gnu.org
X-Mailman-Version: 2.1.14
Precedence: list
List-Id: "Emacs development discussions." <emacs-devel.gnu.org>
List-Unsubscribe: <https://lists.gnu.org/mailman/options/emacs-devel>,
	<mailto:emacs-devel-request@gnu.org?subject=unsubscribe>
List-Archive: </archive/html/emacs-devel>
List-Post: <mailto:emacs-devel@gnu.org>
List-Help: <mailto:emacs-devel-request@gnu.org?subject=help>
List-Subscribe: <https://lists.gnu.org/mailman/listinfo/emacs-devel>,
	<mailto:emacs-devel-request@gnu.org?subject=subscribe>
Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org
Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org
Xref: news.gmane.org gmane.emacs.devel:144422
Archived-At: <http://permalink.gmane.org/gmane.emacs.devel/144422>

On Tue, 27 Sep 2011 21:33:37 +0900 "Stephen J. Turnbull" <stephen@xemacs.org> wrote: 

SJT> Ted Zlatanov writes:
>> UTF-8 is an encoding; you're talking about charsets.

SJT> No, I'm talking about encodings.  I'm not entirely sure about GB 2312,
SJT> but I believe it has a defined preferred encoding (the one registered
SJT> as the MIME charset GB2312 -- MIME charsets are all encodings, they
SJT> specify what *bytes* will appear in the stream, not just an abstract
SJT> character to abstract integer mapping).  Shift JIS is most definitely
SJT> an encoding for the JIS character set (although which JIS character
SJT> set is poorly defined).

Thanks for correcting my misunderstanding.

SJT> If you already have a password, it should be read verbatim (binary, or
SJT> raw-text should do given the line-oriented nature of these
SJT> configuration files) and treated as a binary blob.
>> 
>> That's not helpful when you need to encode it for IMAP, for instance.
>> You have to know the actual characters that make up the binary blob.

SJT> Since when?  I haven't paid much attention to IMAP since RFC 3501 was
SJT> an internet-draft, but in that document there are a few commands that
SJT> accept a CHARSET parameter.  LOGIN and AUTHENTICATE aren't among them.
SJT> So you're just passing along binary blobs, which in the case of LOGIN
SJT> will often look like somebody's birthday or a child's name, but that's
SJT> just an unfortunate accident.

Ditto.  I thought the CHARSET was used for passwords.

On Tue, 27 Sep 2011 07:31:23 -0400 Eli Zaretskii <eliz@gnu.org> wrote: 

>> From: Ted Zlatanov <tzz@lifelogs.com>
>> Date: Tue, 27 Sep 2011 05:38:28 -0500
>> Reply-To: emacs-devel@gnu.org
>> 
>> On Tue, 27 Sep 2011 05:57:28 +0300 Eli Zaretskii <eliz@gnu.org> wrote: 
>> 
>> >> From: Stefan Monnier <monnier@IRO.UMontreal.CA>
>> >> Date: Mon, 26 Sep 2011 17:31:52 -0400
>> >> 
>> >> I think raw-text is more likely to work, based on what Lars says.
>> 
EZ> That was also my conclusion.
>> 
>> I think we should make an effort to make the netrc/authinfo file
>> shareable with other programs

EZ> I agree.  But to do that, it sounds like we are lacking some knowledge
EZ> about the intended use of these files, especially when they are used
EZ> in conjunction with external services.  If someone can prepare an
EZ> exhaustive list of such uses, or at least those we want to support,
EZ> and tell what encodings can be used with each of them, we can take it
EZ> from there the way you want it.  But if such details are not known at
EZ> the moment, we may actually break some legitimate uses, which would be
EZ> a pity.

I know for sure only ASCII (up to 0xff) is supported by libcurl and
older FTP clients.  I thought UTF-8 would be a good compatibility path
but apparently I'm wrong.

EZ> So I think you are being overly optimistic in asserting that UTF-8 is
EZ> "the safest choice".

OK.

EZ> You read "binary" incorrectly.  For the purposes of this discussion,
EZ> "binary" == "arbitrary byte values".  Not every 8-bit byte is valid as
EZ> part of a UTF-8 sequence.  If the authinfo file includes such bytes,
EZ> it cannot be encoded in UTF-8, except if we use the Emacs extensions,
EZ> which will be only useful for Emacs.  Such bytes can easily come from
EZ> some single-byte encoding, for example.  To DTRT with such bytes, we
EZ> _must_ know its precise encoding; then we could _recode_ it in UTF-8,
EZ> and encode back when we send the string to external services.

Got it.

On Tue, 27 Sep 2011 08:55:45 -0400 Stefan Monnier <monnier@iro.umontreal.ca> wrote: 

SM> Here's my take on it:
SM> .authinfo contains various things and is used in different ways, and
SM> there isn't a single answer that covers all cases:
SM> - each kind of field (hostname, username, password) may require
SM>   a different encoding/decoding.
SM> - when reading a password from the file, it should be read using
SM>   raw-text (i.e. as a "unibyte string").
SM>   In other words, the password should not be decoded into chars but left
SM>   as a sequence of bytes that will be sent as-is to whoever needs it.
SM> - when a password is typed by the user it'll be a sequence of chars, so
SM>   we'll have to convert it into a sequence of bytes.  The best coding
SM>   system to use for that purpose is probably going to be
SM>   locale-coding-system.  That sequence of bytes is then send to whoever
SM>   needs it and saved as-is (using raw-text) into the .authinfo file.
SM> - i.e. authinfo should be read as a unibyte file.
SM> - i.e. when reading other fields than passwords, we'll have to
SM>   explicitly decode them using the coding system we want to use for
SM>   those fields.
SM> - similarly, we'll have to encode those other fields manually when
SM>   writing them into .authinfo.

SM> Of course, another option is to just read&write authinfo without
SM> thinking about it, so Emacs will usually pick locale-coding-system for
SM> it and it'll work just fine in 99.9% of the cases.

It sounds like the latter option is the least work and most reliable.
Users should be able to override the coding system as with any other
file, and we'll just keep the status quo.  I appreciate all the details
and corrections; I thought UTF-8 was better and more widely useful than
it really is.

Thanks
Ted