From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: John Mastro Newsgroups: gmane.emacs.help Subject: "Unidecode" functionality in Emacs Date: Mon, 19 Mar 2018 15:04:29 -0700 Message-ID: NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Trace: blaine.gmane.org 1521496999 28363 195.159.176.226 (19 Mar 2018 22:03:19 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Mon, 19 Mar 2018 22:03:19 +0000 (UTC) To: Help Gnu Emacs mailing list Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Mon Mar 19 23:03:15 2018 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1ey2s7-0007HM-Cn for geh-help-gnu-emacs@m.gmane.org; Mon, 19 Mar 2018 23:03:15 +0100 Original-Received: from localhost ([::1]:44225 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ey2uA-0005cR-8m for geh-help-gnu-emacs@m.gmane.org; Mon, 19 Mar 2018 18:05:22 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:44977) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ey2tg-0005bK-KF for help-gnu-emacs@gnu.org; Mon, 19 Mar 2018 18:04:53 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ey2tf-0005of-BM for help-gnu-emacs@gnu.org; Mon, 19 Mar 2018 18:04:52 -0400 Original-Received: from mail-qt0-x236.google.com ([2607:f8b0:400d:c0d::236]:35174) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1ey2tf-0005oU-6G for help-gnu-emacs@gnu.org; Mon, 19 Mar 2018 18:04:51 -0400 Original-Received: by mail-qt0-x236.google.com with SMTP id s2so4596494qti.2 for ; Mon, 19 Mar 2018 15:04:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to :content-transfer-encoding; bh=MfNv00lBymmGaquCq1spe/5cqgkaKxdwm0qBnmLZtYQ=; b=b9GmBOYdqkw8Nim0c6sWRaE/aTgKf99IlN/q2oGo5lYodgdRZSYL71QZ43x1YYsLfx pue9R3fkvr1Wo+BD1wuBaN6EnrPCAibmLl4OWks/mT6QeLAW6HBSZ1gRw42rFgDceZXs 846z4M8bcv0tLHnF6uy/tpbyW277Z3QI6FE1Tb36uBrGNxQlP/3HctzukWAq2W8O46so xK4ZSDECU/I+ikdYoc7aC7IkRzSIw04QSBWdFcoJ7fhet3d2vtNdiuU73q9KcwL/8UW8 l6p8L3YWzmHMvGHAuKF5bVY6PvulU76I6cEqzfyQGCaW6cO+ShgzbIYdlZ15OqXXffcM oFNg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to :content-transfer-encoding; bh=MfNv00lBymmGaquCq1spe/5cqgkaKxdwm0qBnmLZtYQ=; b=i2n3VHWX3EK1lYXfJ3WF6nL3rVax1zv/iXnzJ7E+OYSP+gn3QtdQNw6cZlQmUSpEvF 9EI/KEAQALKT8O55M7ceUtnDS9+s5tW6yVocMhlXHYYCiJorbB4GAUSQmTWD9lRqqb4D fMhouw/EJsSaI80Yb12Cg26tiSuByAB94B1AygK/vQoKQ9TP0TMwDZ7YaEC5585VVfn8 Q4LetfdVzl8GMLcuJblKildIU43EVxip1eHLf7gpSp4we4wlGDZlC1w29ZVRBCXniSIQ pqcDDupjAun2xAEPUajQ5lCZXXhozYq8aLwU25QzRe7Ya9tPlv1bPjYPyzAKnSV4BPcd Cqkg== X-Gm-Message-State: AElRT7GV/qXVinDEm3fjwWUUbToErqJuL+aUJXLgHFhaIt4hZC5eFmdl poWFstKMrRAEeFnoH0AV7BlZ6z+Im9rYjczr0DYL1RfM X-Google-Smtp-Source: AG47ELspBAVpEYn620G01JMdfy5WvpOFbb5wISMMBA5w65KS9nbUFKEsMmpb97Wes1C67qCIZed9Hxpw4pJ9oJBY68I= X-Received: by 10.200.47.163 with SMTP id l32mr19872713qta.195.1521497090179; Mon, 19 Mar 2018 15:04:50 -0700 (PDT) Original-Received: by 10.200.48.179 with HTTP; Mon, 19 Mar 2018 15:04:29 -0700 (PDT) X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 2607:f8b0:400d:c0d::236 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "help-gnu-emacs" Xref: news.gmane.org gmane.emacs.help:116208 Archived-At: There are "Unidecode" packages for Perl[1], Python[2], and Emacs[3] (derived from one another in that order). They each transliterate Unicode text to ASCII, e.g.: (unidecode "D=C3=A9j=C3=A0 vu") ;=3D> "Deja vu" (unidecode "=E5=8C=97=E4=BA=B0") ;=3D> "Bei Jing " Does Emacs have equivalent functionality built-in? [ The context for this is that I recently submitted a change to the MELPA recipe, and Steve Purcell mentioned[4] that he would be surprised if Emacs doesn't already have such functionality. ] Thanks for any pointers John [1]: http://search.cpan.org/~sburke/Text-Unidecode-1.30/lib/Text/Unidecode.= pm [2]: https://pypi.python.org/pypi/Unidecode [3]: https://github.com/sindikat/unidecode [4]: https://github.com/melpa/melpa/pull/5351#issuecomment-373966218