From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Alex Ott Newsgroups: gmane.emacs.devel Subject: Re: Language identification Date: Fri, 28 Aug 2009 08:46:06 +0200 Organization: Alex Ott's Consulting Message-ID: <87my5kl9ld.fsf@alexott.dev.webwasher.com> References: <87skfczqc8.fsf@mail.jurta.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1251441963 22855 80.91.229.12 (28 Aug 2009 06:46:03 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 28 Aug 2009 06:46:03 +0000 (UTC) Cc: joakim@verona.se, Emacs Development To: Juri Linkov Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Aug 28 08:45:56 2009 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1MgvDf-0002kI-J1 for ged-emacs-devel@m.gmane.org; Fri, 28 Aug 2009 08:45:55 +0200 Original-Received: from localhost ([127.0.0.1]:54685 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MgvDe-0005Ma-OY for ged-emacs-devel@m.gmane.org; Fri, 28 Aug 2009 02:45:54 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1MgvDZ-0005JN-3C for emacs-devel@gnu.org; Fri, 28 Aug 2009 02:45:49 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1MgvDU-00059m-C9 for emacs-devel@gnu.org; Fri, 28 Aug 2009 02:45:48 -0400 Original-Received: from [199.232.76.173] (port=43114 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MgvDT-00059K-KG for emacs-devel@gnu.org; Fri, 28 Aug 2009 02:45:43 -0400 Original-Received: from mx20.gnu.org ([199.232.41.8]:31290) by monty-python.gnu.org with esmtps (TLS-1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1MgvDT-0007oN-05 for emacs-devel@gnu.org; Fri, 28 Aug 2009 02:45:43 -0400 Original-Received: from mail-fx0-f226.google.com ([209.85.220.226]) by mx20.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1MgvDR-0003df-UN for emacs-devel@gnu.org; Fri, 28 Aug 2009 02:45:42 -0400 Original-Received: by fxm26 with SMTP id 26so1685358fxm.42 for ; Thu, 27 Aug 2009 23:45:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:from:to:cc:subject :organization:references:date:in-reply-to:message-id:user-agent :mime-version:content-type; bh=3jAboX+K3JvY/uspMvKoeH10uk4pzHvqHCXAq9GSYK0=; b=nOwZ7CqsQToO8n/XHPvyJQsQSGsr6jNQmZBsmMFfQ8M175mn6MDFQDU5Ff95Oeg+Dt 6trRQz1vG1tudqsIWUHb4PH6fAt5vkHFiCc/Dxu7m7sqIbMpRRa0IYAqb3H2EKHmf1Py UaI3Qdz3fqsaKA1WNjw5USCVYYK3MhD2z+TJQ= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:to:cc:subject:organization:references:date:in-reply-to :message-id:user-agent:mime-version:content-type; b=aDLcuGWKCVFFwY6FaN4PzURz+xqzgZ9+mfxrzOmqCHp4RQUPhCz2x602p5+EECiyzj k8l9BIMF+sZXuB1DFumV5i2t+WCVPnLFmEz3p25J850N/4SdVmYdnmCUUQ+hRSVaUZqO zp+k/oKkzmoGn8Si1aGgb3XjS61+YIv+1qyFE= Original-Received: by 10.204.157.24 with SMTP id z24mr561000bkw.208.1251441940420; Thu, 27 Aug 2009 23:45:40 -0700 (PDT) Original-Received: from alexott.dev.webwasher.com (pdbfw01.securecomputing.com [80.66.20.180]) by mx.google.com with ESMTPS id z15sm1047556fkz.34.2009.08.27.23.45.39 (version=TLSv1/SSLv3 cipher=RC4-MD5); Thu, 27 Aug 2009 23:45:39 -0700 (PDT) In-Reply-To: <87skfczqc8.fsf@mail.jurta.org> (Juri Linkov's message of "Fri, 28 Aug 2009 03:27:35 +0300") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.1.50 (gnu/linux) X-Detected-Operating-System: by mx20.gnu.org: GNU/Linux 2.6 (newer, 2) X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.6, seldom 2.4 (older, 4) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:114723 Archived-At: Sorry, I skipped, that this was about programming languages, not real languages. Juri Linkov at "Fri, 28 Aug 2009 03:27:35 +0300" wrote: >> I often wish that files would open in Emacs with correct mode >> more often when there is no file extension. JL> In `auto-mode-alist' you can see that with the exception of JL> `archive-mode', `doc-view-mode' and `image-mode', all remaining JL> modes are programming text modes. It would be more useful JL> to identify file types for these modes that libmagic can't do. JL> Do you know a library that identifies programming languages? JL> Such a library might be implemented using a Bayesian classifier JL> trained on a sufficiently large corpus of different programming JL> languages. -- With best wishes, Alex Ott, MBA http://alexott.blogspot.com/ http://xtalk.msk.su/~ott/ http://alexott-ru.blogspot.com/