unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
From: Robert Pluim <rpluim@gmail.com>
To: Po Lu <luangruo@yahoo.com>
Cc: emacs-devel@gnu.org
Subject: Re: default charset for text/html selection in X11
Date: Thu, 22 Jun 2023 11:07:45 +0200	[thread overview]
Message-ID: <87edm3g90e.fsf@gmail.com> (raw)
In-Reply-To: <878rcc0vzs.fsf@yahoo.com> (Po Lu's message of "Thu, 22 Jun 2023 15:57:59 +0800")

>>>>> On Thu, 22 Jun 2023 15:57:59 +0800, Po Lu <luangruo@yahoo.com> said:

    Po Lu> Robert Pluim <rpluim@gmail.com> writes:
    >>>>>>> On Thu, 22 Jun 2023 11:37:14 +0800, Po Lu <luangruo@yahoo.com> said:
    >> 
    >> Po Lu> Po Lu <luangruo@yahoo.com> writes:
    >> >> What is the type of the string?  IOW, what's
    >> >> 
    >> >> (get-text-property html 'foreign-selection)
    >> 
    >> Po Lu> (get-text-property 0 html 'foreign-selection), of course.  Sorry about
    >> Po Lu> the confusion.
    >> 
    >> (get-text-property 0 'foreign-selection html) => STRING
    >> 
    >> but itʼs definitely a utf-8 string, not iso-latin-1.

    Po Lu> Would you please report this as a bug, to the Chromium developers?
    Po Lu> That is, if:

    Po Lu>   (x-get-selection-internal 'CLIPBOARD 'text/html)

    Po Lu> returns a string of the same type.

It does.

    Po Lu> The ICCCM clearly states that:

    Po Lu>   STRING as a type or a target specifies the ISO Latin-1 character set
    Po Lu>   plus the control characters TAB (octal 11) and NEWLINE (octal 12.)
    Po Lu>   The spacing interpretation of TAB is context dependent.  Other ASCII
    Po Lu>   control characters are explicitly not included in STRING at the
    Po Lu>   present time.

Iʼm not about to contradict the ICCCM, but `gui-get-selection' does
the following

                    ;; Guess at the charset for types like text/html
                    ;; -- it can be anything, and different
                    ;; applications use different encodings.
                    ((string-match-p "\\`text/" (symbol-name data-type))
                     (decode-coding-string
                      data (car (detect-coding-string data))))
                    ;; Do nothing.

I took a closer look, and `yank-media' does the wrong thing, but
`(yank-media-types t)' and selecting "text/html" does the right
thing. The difference is that the former uses
`gui-backend-get-selection', and the latter uses `gui-get-selection',
and thus does the auto-detection.

Robert
-- 



  reply	other threads:[~2023-06-22  9:07 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-06-21 15:51 default charset for text/html selection in X11 Robert Pluim
2023-06-21 17:13 ` Eli Zaretskii
2023-06-22  0:56 ` Po Lu
2023-06-22  3:37   ` Po Lu
2023-06-22  7:23     ` Robert Pluim
2023-06-22  7:57       ` Po Lu
2023-06-22  9:07         ` Robert Pluim [this message]
2023-06-22 11:48           ` Po Lu
2023-06-22 12:27             ` Robert Pluim
2023-06-22 10:08       ` Eli Zaretskii
2023-06-22 12:14         ` Robert Pluim
2023-06-22 12:26           ` Yuri Khan

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87edm3g90e.fsf@gmail.com \
    --to=rpluim@gmail.com \
    --cc=emacs-devel@gnu.org \
    --cc=luangruo@yahoo.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).