From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Stefan Kangas Newsgroups: gmane.emacs.bugs Subject: bug#40794: 26.3; HTML entities ☆ and ★ (inter alia) are not parsed by libxml-parse-html-region Date: Wed, 9 Sep 2020 06:22:11 -0700 Message-ID: References: <87368uwd1f.fsf@passepartout.tim-landscheidt.de> <878sf23n9k.fsf@gnus.org> <874kpq3mtk.fsf@gnus.org> Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="6358"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) Cc: 40794@debbugs.gnu.org, Tim Landscheidt To: Lars Ingebrigtsen Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Wed Sep 09 15:23:13 2020 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1kG048-0001WV-U1 for geb-bug-gnu-emacs@m.gmane-mx.org; Wed, 09 Sep 2020 15:23:12 +0200 Original-Received: from localhost ([::1]:45718 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kG047-0001HI-RG for geb-bug-gnu-emacs@m.gmane-mx.org; Wed, 09 Sep 2020 09:23:11 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:60714) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kG03y-0001Dp-Lb for bug-gnu-emacs@gnu.org; Wed, 09 Sep 2020 09:23:02 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:46338) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1kG03y-0004Qg-BE for bug-gnu-emacs@gnu.org; Wed, 09 Sep 2020 09:23:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1kG03y-0001D2-6u for bug-gnu-emacs@gnu.org; Wed, 09 Sep 2020 09:23:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Stefan Kangas Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Wed, 09 Sep 2020 13:23:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 40794 X-GNU-PR-Package: emacs Original-Received: via spool by 40794-submit@debbugs.gnu.org id=B40794.15996577394545 (code B ref 40794); Wed, 09 Sep 2020 13:23:02 +0000 Original-Received: (at 40794) by debbugs.gnu.org; 9 Sep 2020 13:22:19 +0000 Original-Received: from localhost ([127.0.0.1]:57875 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kG03H-0001BD-43 for submit@debbugs.gnu.org; Wed, 09 Sep 2020 09:22:19 -0400 Original-Received: from mail-ej1-f48.google.com ([209.85.218.48]:33636) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kG03G-0001B1-07 for 40794@debbugs.gnu.org; Wed, 09 Sep 2020 09:22:18 -0400 Original-Received: by mail-ej1-f48.google.com with SMTP id j11so3566276ejk.0 for <40794@debbugs.gnu.org>; Wed, 09 Sep 2020 06:22:17 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:in-reply-to:references:user-agent :mime-version:date:message-id:subject:to:cc; bh=e8AokCPQUlZwM8hvu+AgfQAvfWScWqC5IOSW1ApNrUE=; b=VXdJEUV9n3+wcSfRbCQvpz2uWCZJ4AEPMkeebPn5aq/IaKFd3PQhdpPrPpjTk99bRn qyKYTdcA1RahZvW2Cr6fj6DgldGy7Ny8ozyenllD2E60z8TlaOstZTtMSS4t3i89LaZj mNIEm6D0AhBgUiSgpCdg60iPoBlsRRHlQjXzvWcGpWSEyqliTdvmKUyE1MOetLKkHZ/I BjR8f+Q7FrkfEE4+SdaVOEuVqM1AthUmYZLU3fpOcsAWTb+EG+axDDrAWMdqGzAWzH5o irJ/l7WQ8r/bbtSDekY/Oy3E7ksZsEQ8Pasrm7BJ2DfQg2feRQAd+Zg+CAbbJgmcYKIp SpEQ== X-Gm-Message-State: AOAM530i5F00E3yb4JxxQl9C/1YyyZ/C3C2WfrZWVkyG3nLGMqtjh56g 14lu7nPfnbUnLMtPSFKStlLd+kk99Fviml7SIog= X-Google-Smtp-Source: ABdhPJz1aq4X9PnikieDpDNDC54MYM+LETmTsZswJa1RKOxNys/knFcf4ddKGyqiNiPsZ/3ZGTX9VQBTVzvoqEo+vIM= X-Received: by 2002:a17:906:16c8:: with SMTP id t8mr3432778ejd.272.1599657732174; Wed, 09 Sep 2020 06:22:12 -0700 (PDT) Original-Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Wed, 9 Sep 2020 06:22:11 -0700 In-Reply-To: <874kpq3mtk.fsf@gnus.org> (Lars Ingebrigtsen's message of "Wed, 29 Jul 2020 07:35:51 +0200") X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:187649 Archived-At: Lars Ingebrigtsen writes: > I had a look at the libxml2 sources. The logic isn't really explained, > but apparently they include all the <255-value entities, and then a > selected number of the other entities (about 160 of them). > > I have no idea what the logic behind this is... perhaps they've just > forgotten to add the new ones? Which makes me think that this is really > a libxml2 bug, and you should report it there instead. Agreed. Tim, could you please report this to the libxml2 developers?