* Re: Invalid read syntax for compiled bool vector [not found] ` <857jw35hcq.fsf@junk.nocrew.org> @ 2004-04-26 14:10 ` Richard Stallman 2004-04-26 16:08 ` Andreas Schwab 0 siblings, 1 reply; 12+ messages in thread From: Richard Stallman @ 2004-04-26 14:10 UTC (permalink / raw) Cc: emacs-devel Apparently, you have to bind coding-system-for-write before writing a source file with a literal bool-vector constant in it, or else Emacs will either ask the user for the coding system, or write the file using some default coding system which may not do the right thing. I guess we should change the syntax for bool-vectors so as to put just 4 bits into each character. The question is how to do that in a somewhat compatible way. ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: Invalid read syntax for compiled bool vector 2004-04-26 14:10 ` Invalid read syntax for compiled bool vector Richard Stallman @ 2004-04-26 16:08 ` Andreas Schwab 2004-04-26 17:47 ` Lars Brinkhoff 2004-04-27 16:28 ` Richard Stallman 0 siblings, 2 replies; 12+ messages in thread From: Andreas Schwab @ 2004-04-26 16:08 UTC (permalink / raw) Cc: Lars Brinkhoff, emacs-devel Richard Stallman <rms@gnu.org> writes: > Apparently, you have to bind coding-system-for-write before writing a > source file with a literal bool-vector constant in it, or else Emacs > will either ask the user for the coding system, or write the file > using some default coding system which may not do the right thing. > > I guess we should change the syntax for bool-vectors > so as to put just 4 bits into each character. > The question is how to do that in a somewhat compatible way. The print syntax could use octal or hexadecimal escapes in the bit string. Andreas. -- Andreas Schwab, SuSE Labs, schwab@suse.de SuSE Linux AG, Maxfeldstraße 5, 90409 Nürnberg, Germany Key fingerprint = 58CA 54C7 6D53 942B 1756 01D3 44D5 214B 8276 4ED5 "And now for something completely different." ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: Invalid read syntax for compiled bool vector 2004-04-26 16:08 ` Andreas Schwab @ 2004-04-26 17:47 ` Lars Brinkhoff 2004-04-26 22:01 ` Andreas Schwab 2004-04-27 16:28 ` Richard Stallman 1 sibling, 1 reply; 12+ messages in thread From: Lars Brinkhoff @ 2004-04-26 17:47 UTC (permalink / raw) Cc: rms, emacs-devel Andreas Schwab <schwab@suse.de> writes: > Richard Stallman <rms@gnu.org> writes: > > Apparently, you have to bind coding-system-for-write before > > writing a source file with a literal bool-vector constant in > > it, or else Emacs will either ask the user for the coding > > system, or write the file using some default coding system > > which may not do the right thing. > > I guess we should change the syntax for bool-vectors so as to put > > just 4 bits into each character. The question is how to do that > > in a somewhat compatible way. > The print syntax could use octal or hexadecimal escapes in the bit string. Yes. Since the print syntax for bool-vectors looks like strings, I would sugggest doing whatever the print syntax for strings does. -- Lars Brinkhoff, Services for Unix, Linux, GCC, HTTP Brinkhoff Consulting http://www.brinkhoff.se/ ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: Invalid read syntax for compiled bool vector 2004-04-26 17:47 ` Lars Brinkhoff @ 2004-04-26 22:01 ` Andreas Schwab 2004-04-26 22:53 ` David Kastrup 2004-04-27 16:29 ` Richard Stallman 0 siblings, 2 replies; 12+ messages in thread From: Andreas Schwab @ 2004-04-26 22:01 UTC (permalink / raw) Cc: rms, emacs-devel Lars Brinkhoff <lars@nocrew.org> writes: > Andreas Schwab <schwab@suse.de> writes: >> Richard Stallman <rms@gnu.org> writes: >> > Apparently, you have to bind coding-system-for-write before >> > writing a source file with a literal bool-vector constant in >> > it, or else Emacs will either ask the user for the coding >> > system, or write the file using some default coding system >> > which may not do the right thing. >> > I guess we should change the syntax for bool-vectors so as to put >> > just 4 bits into each character. The question is how to do that >> > in a somewhat compatible way. >> The print syntax could use octal or hexadecimal escapes in the bit string. > > Yes. Since the print syntax for bool-vectors looks like strings, I > would sugggest doing whatever the print syntax for strings does. I have now changed the print syntax to always use octal escapes for non-ascii characters in the bool-vector string. This way the string will always be read as unibyte string, avoiding all coding issues. Andreas. -- Andreas Schwab, SuSE Labs, schwab@suse.de SuSE Linux AG, Maxfeldstraße 5, 90409 Nürnberg, Germany Key fingerprint = 58CA 54C7 6D53 942B 1756 01D3 44D5 214B 8276 4ED5 "And now for something completely different." ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: Invalid read syntax for compiled bool vector 2004-04-26 22:01 ` Andreas Schwab @ 2004-04-26 22:53 ` David Kastrup 2004-04-27 12:42 ` Andreas Schwab 2004-04-27 16:29 ` Richard Stallman 1 sibling, 1 reply; 12+ messages in thread From: David Kastrup @ 2004-04-26 22:53 UTC (permalink / raw) Cc: Lars Brinkhoff, rms, emacs-devel Andreas Schwab <schwab@suse.de> writes: > Lars Brinkhoff <lars@nocrew.org> writes: > > > Yes. Since the print syntax for bool-vectors looks like strings, > > I would sugggest doing whatever the print syntax for strings does. > > I have now changed the print syntax to always use octal escapes for > non-ascii characters in the bool-vector string. This way the string > will always be read as unibyte string, avoiding all coding issues. I think that hexadecimal notation would be quite more compact, without a loss of generality and (probably unimportant) readability. -- David Kastrup, Kriemhildstr. 15, 44793 Bochum ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: Invalid read syntax for compiled bool vector 2004-04-26 22:53 ` David Kastrup @ 2004-04-27 12:42 ` Andreas Schwab 2004-04-27 12:52 ` David Kastrup 0 siblings, 1 reply; 12+ messages in thread From: Andreas Schwab @ 2004-04-27 12:42 UTC (permalink / raw) Cc: Lars Brinkhoff, rms, emacs-devel David Kastrup <dak@gnu.org> writes: > I think that hexadecimal notation would be quite more compact, without > a loss of generality and (probably unimportant) readability. If you use hexadecimal notation then the reader will force the string to multibyte, with octal notation it is forced to unibyte. The process of converting a unibyte string to multibyte will change characters in the range 0x80..0x9f. Maybe the bool vector reader should just force the string back to unibyte. Andreas. -- Andreas Schwab, SuSE Labs, schwab@suse.de SuSE Linux AG, Maxfeldstraße 5, 90409 Nürnberg, Germany Key fingerprint = 58CA 54C7 6D53 942B 1756 01D3 44D5 214B 8276 4ED5 "And now for something completely different." ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: Invalid read syntax for compiled bool vector 2004-04-27 12:42 ` Andreas Schwab @ 2004-04-27 12:52 ` David Kastrup 0 siblings, 0 replies; 12+ messages in thread From: David Kastrup @ 2004-04-27 12:52 UTC (permalink / raw) Cc: Lars Brinkhoff, rms, emacs-devel Andreas Schwab <schwab@suse.de> writes: > David Kastrup <dak@gnu.org> writes: > > > I think that hexadecimal notation would be quite more compact, without > > a loss of generality and (probably unimportant) readability. > > If you use hexadecimal notation then the reader will force the string to > multibyte, with octal notation it is forced to unibyte. The process of > converting a unibyte string to multibyte will change characters in the > range 0x80..0x9f. Maybe the bool vector reader should just force the > string back to unibyte. Ah, uh, ok. Just forget it, then. Looks like I did not know what I was talking about. Not that this happens too rarely... -- David Kastrup, Kriemhildstr. 15, 44793 Bochum ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: Invalid read syntax for compiled bool vector 2004-04-26 22:01 ` Andreas Schwab 2004-04-26 22:53 ` David Kastrup @ 2004-04-27 16:29 ` Richard Stallman 2004-04-27 17:54 ` Andreas Schwab 1 sibling, 1 reply; 12+ messages in thread From: Richard Stallman @ 2004-04-27 16:29 UTC (permalink / raw) Cc: lars, emacs-devel I have now changed the print syntax to always use octal escapes for non-ascii characters in the bool-vector string. This way the string will always be read as unibyte string, avoiding all coding issues. That was too hasty. As I mentioned earlier, this solution would cause some previously-written bool-vectors to be read wrong. If nobody has any previously-written bool-vectors, that incompatibility does not matter, but are we confident of that? And is there a better way? ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: Invalid read syntax for compiled bool vector 2004-04-27 16:29 ` Richard Stallman @ 2004-04-27 17:54 ` Andreas Schwab 0 siblings, 0 replies; 12+ messages in thread From: Andreas Schwab @ 2004-04-27 17:54 UTC (permalink / raw) Cc: lars, emacs-devel Richard Stallman <rms@gnu.org> writes: > I have now changed the print syntax to always use octal escapes for > non-ascii characters in the bool-vector string. This way the string will > always be read as unibyte string, avoiding all coding issues. > > That was too hasty. As I mentioned earlier, this solution would cause > some previously-written bool-vectors to be read wrong. The new syntax variant is completely backward and forward compatible. Andreas. -- Andreas Schwab, SuSE Labs, schwab@suse.de SuSE Linux AG, Maxfeldstraße 5, 90409 Nürnberg, Germany Key fingerprint = 58CA 54C7 6D53 942B 1756 01D3 44D5 214B 8276 4ED5 "And now for something completely different." ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: Invalid read syntax for compiled bool vector 2004-04-26 16:08 ` Andreas Schwab 2004-04-26 17:47 ` Lars Brinkhoff @ 2004-04-27 16:28 ` Richard Stallman 2004-04-27 17:47 ` Andreas Schwab 1 sibling, 1 reply; 12+ messages in thread From: Richard Stallman @ 2004-04-27 16:28 UTC (permalink / raw) Cc: lars, emacs-devel The print syntax could use octal or hexadecimal escapes in the bit string. That would be incompatible for some bool-vector values, wouldn't it? The \ character could appear in a bool-vector with the current syntax. To avoid misinterpreting some constants, I think we need to change the syntax in a bigger way, to use a new syntax that would not be valid at all under the old rules. ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: Invalid read syntax for compiled bool vector 2004-04-27 16:28 ` Richard Stallman @ 2004-04-27 17:47 ` Andreas Schwab 2004-04-29 10:43 ` Richard Stallman 0 siblings, 1 reply; 12+ messages in thread From: Andreas Schwab @ 2004-04-27 17:47 UTC (permalink / raw) Cc: lars, emacs-devel Richard Stallman <rms@gnu.org> writes: > The print syntax could use octal or hexadecimal escapes in the bit string. > > That would be incompatible for some bool-vector values, wouldn't it? > The \ character could appear in a bool-vector with the current syntax. No, the print syntax already used backslash as escape within the string, and the reader just uses the normal string parser. Andreas. -- Andreas Schwab, SuSE Labs, schwab@suse.de SuSE Linux AG, Maxfeldstraße 5, 90409 Nürnberg, Germany Key fingerprint = 58CA 54C7 6D53 942B 1756 01D3 44D5 214B 8276 4ED5 "And now for something completely different." ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: Invalid read syntax for compiled bool vector 2004-04-27 17:47 ` Andreas Schwab @ 2004-04-29 10:43 ` Richard Stallman 0 siblings, 0 replies; 12+ messages in thread From: Richard Stallman @ 2004-04-29 10:43 UTC (permalink / raw) Cc: lars, emacs-devel No, the print syntax already used backslash as escape within the string, and the reader just uses the normal string parser. Ok, in that case I see no problem in your solution. Thanks for taking care of the problem. ^ permalink raw reply [flat|nested] 12+ messages in thread
end of thread, other threads:[~2004-04-29 10:43 UTC | newest] Thread overview: 12+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- [not found] <85k70cbeil.fsf@junk.nocrew.org> [not found] ` <E1BFdNk-0003Ia-Oq@fencepost.gnu.org> [not found] ` <857jw35hcq.fsf@junk.nocrew.org> 2004-04-26 14:10 ` Invalid read syntax for compiled bool vector Richard Stallman 2004-04-26 16:08 ` Andreas Schwab 2004-04-26 17:47 ` Lars Brinkhoff 2004-04-26 22:01 ` Andreas Schwab 2004-04-26 22:53 ` David Kastrup 2004-04-27 12:42 ` Andreas Schwab 2004-04-27 12:52 ` David Kastrup 2004-04-27 16:29 ` Richard Stallman 2004-04-27 17:54 ` Andreas Schwab 2004-04-27 16:28 ` Richard Stallman 2004-04-27 17:47 ` Andreas Schwab 2004-04-29 10:43 ` Richard Stallman
Code repositories for project(s) associated with this public inbox https://git.savannah.gnu.org/cgit/emacs.git This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).