From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from localhost (localhost [127.0.0.1]) by arlo.cworth.org (Postfix) with ESMTP id 7A8386DE0FE6 for ; Sun, 12 Nov 2017 13:04:33 -0800 (PST) X-Virus-Scanned: Debian amavisd-new at cworth.org X-Spam-Flag: NO X-Spam-Score: -2.331 X-Spam-Level: X-Spam-Status: No, score=-2.331 tagged_above=-999 required=5 tests=[RCVD_IN_DNSWL_MED=-2.3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01] autolearn=disabled Received: from arlo.cworth.org ([127.0.0.1]) by localhost (arlo.cworth.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id LSMc0FdS7uiV for ; Sun, 12 Nov 2017 13:04:32 -0800 (PST) Received: from nef2.ens.fr (nef2.ens.fr [129.199.96.40]) by arlo.cworth.org (Postfix) with ESMTP id 11E3A6DE0FD2 for ; Sun, 12 Nov 2017 13:04:31 -0800 (PST) Received: from geologie.ens.fr (geologie.ens.fr [129.199.70.34]) by nef2.ens.fr (8.13.6/1.01.28121999) with ESMTP id vACL4TZY037781 for ; Sun, 12 Nov 2017 22:04:29 +0100 (CET) Received: from lmd.ens.fr (strauss.ens.fr [129.199.71.3]) by geologie.ens.fr (8.13.8/8.13.1) with ESMTP id vACL4Skh003906 for ; Sun, 12 Nov 2017 22:04:29 +0100 Received: from localhost (localhost [127.0.0.1]) by lmd.ens.fr (Postfix) with ESMTP id 1BB21558002 for ; Sun, 12 Nov 2017 22:04:30 +0100 (CET) X-Virus-Scanned: amavisd-new at lmd.ens.fr Received: from lmd.ens.fr ([127.0.0.1]) by localhost (strauss.ens.fr [127.0.0.1]) (amavisd-new, port 10025) with LMTP id M0gftC2dh9ae for ; Sun, 12 Nov 2017 22:04:29 +0100 (CET) Received: from localhost (lns-bzn-30-82-253-128-101.adsl.proxad.net [82.253.128.101]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (Client did not present a certificate) (Authenticated sender: bderembl) by lmd.ens.fr (Postfix) with ESMTPSA id CF18F558001 for ; Sun, 12 Nov 2017 22:04:29 +0100 (CET) User-agent: mu4e 0.9.19; emacs 25.3.1 From: Bruno Deremble To: notmuch@notmuchmail.org Subject: accented characters Date: Sun, 12 Nov 2017 22:02:32 +0100 Message-ID: <87h8tz8b2v.fsf@ens.fr> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Scanned-By: MIMEDefang 2.79 on 129.199.70.34 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.4.3 (nef2.ens.fr [129.199.96.32]); Sun, 12 Nov 2017 22:04:29 +0100 (CET) X-Mailman-Approved-At: Sun, 12 Nov 2017 13:17:26 -0800 X-BeenThere: notmuch@notmuchmail.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Use and development of the notmuch mail system." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 12 Nov 2017 21:04:33 -0000 Hi, I am still new to notmuch and keep experimenting it; a lot of very interesting features. I realized that searching "été" and "ete" do not give the same answer which may be confusing in some situation (in case the sender has an accented name and may or may not sign his email with his accented name) A way to handle this could be to only index non accented words which requires to add a filter before the indexing process. I looked at the code and it seems that this should be handled by gmime? there are also libraries that are supposed to do that such as 'unac'. Is it something that you have been exploring already? thank you bruno