From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: "Alan E. Davis" Newsgroups: gmane.emacs.help Subject: Re: local database types for personal use Date: Sun, 22 Jan 2012 07:51:16 -0800 Message-ID: References: <87hb0oi049.fsf@ambire.localdomain> <87d3bchz76.fsf@ambire.localdomain> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=0016e6dd9615a1c2d104b71fe318 X-Trace: dough.gmane.org 1327247517 19059 80.91.229.12 (22 Jan 2012 15:51:57 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Sun, 22 Jan 2012 15:51:57 +0000 (UTC) To: Soichi Ishida , help-gnu-emacs@gnu.org, Thien-Thi Nguyen Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Sun Jan 22 16:51:49 2012 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([140.186.70.17]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Rozhs-0004xd-HQ for geh-help-gnu-emacs@m.gmane.org; Sun, 22 Jan 2012 16:51:48 +0100 Original-Received: from localhost ([::1]:55195 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Rozhr-0002rP-DO for geh-help-gnu-emacs@m.gmane.org; Sun, 22 Jan 2012 10:51:47 -0500 Original-Received: from eggs.gnu.org ([140.186.70.92]:44404) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Rozhm-0002rJ-Ev for help-gnu-emacs@gnu.org; Sun, 22 Jan 2012 10:51:43 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Rozhj-0002a9-Qx for help-gnu-emacs@gnu.org; Sun, 22 Jan 2012 10:51:42 -0500 Original-Received: from mail-we0-f169.google.com ([74.125.82.169]:42465) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Rozhj-0002Zv-Et for help-gnu-emacs@gnu.org; Sun, 22 Jan 2012 10:51:39 -0500 Original-Received: by werl4 with SMTP id l4so1880210wer.0 for ; Sun, 22 Jan 2012 07:51:37 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=UpWZqiGDZFXjj8lZmtVbJ3BPfR7QfMANeXf+GMft494=; b=Uij6De+LLcVSfR49zWebM0Fri/sO1m2emCzrGdUlRWTz2kd99IvQztxNTvHzu+njgU qOfbQZ5FjghDl/wlzqfCHv+MQv5Xm5BqgE2M0kXsXx7llkrG6Cj+NjxLtqNwlwtMyDDH rPcX5m9DInul0X/E0lIuB1EWuTblqC1btc9xs= Original-Received: by 10.216.136.70 with SMTP id v48mr1991338wei.48.1327247497740; Sun, 22 Jan 2012 07:51:37 -0800 (PST) Original-Received: by 10.216.165.85 with HTTP; Sun, 22 Jan 2012 07:51:16 -0800 (PST) In-Reply-To: X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 74.125.82.169 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:83537 Archived-At: --0016e6dd9615a1c2d104b71fe318 Content-Type: text/plain; charset=ISO-8859-1 Did you find a format amenable to your needs? I had intended to send this short example of band format some months ago. Ban format is perfect for what you are doing. You can write simple utility routines to search or sort the database. Two entries in band format follow. A record does not need to occupy on line. "..hw" always begins a new record: ..hw lun kaeja .gs .ge Jellyfish (secondary etc names describe the specific type) .ec Jellyfish .cg cnidaria .la Mokil .src .n Father of a student .dt [2010-08-27 Fri] ..hw lagong .gs .ge Big, green fly .ec .cg Insect .la Cebuano .src FSD .n came up as an exception to the general name for fly. .dt [2010-06-11 Fri] .. begins a record ..hw headword tag .gs Scientific Name .ge English gloss .ec English common name .cg catagory .la language ..src Source .dt date I wrote some little macros / routines that converted a record into a typeset dictionary in LaTeX. A frontend, as it were. Each tag is typeset in a distinct font. Robert Hsu who was at the University of Hawaii linguistics department in the 90s, wrote a spitbol/snobol4 package for manipulating data in band format, used in producing several dictionaries of several pacific languages. Others have developed band formats distinct from this one. Alan On Mon, Dec 26, 2011 at 11:08 AM, Alan E. Davis wrote: > Band format is a free-form format used in linguistics for lexical data. > > The following is from a google search. > > > http://billposer.org/Linguistics/Computation/LectureNotes/ParsingLexica.html > > I have used my own version of this format for a lexicon. If you wish, I > can spend more time explaining how ti did it. > > Basically, each "field" is marked by a dot and one or more letters. Two > dots mark the beginning of a record. > > ..hw means "headword" (use anything you want) > .d could mean date > .ge could mean "english gloss" > .gd could mean "dutch gloss" > .p could mean part of speech. > > Two spaces preceed each .X > > New categories can be made up on the fly. > > Alan Davis > > > On Sun, Dec 25, 2011 at 5:57 PM, Thien-Thi Nguyen wrote: > >> () Soichi Ishida >> () Mon, 26 Dec 2011 10:47:58 +0900 >> >> I have totally forgotten EBD :) >> >> What if I have more than one table? >> >> Then your forgetfulness is ideal. >> >> > --0016e6dd9615a1c2d104b71fe318 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Did you find a format amenable to your needs?=A0=A0

I had intended = to send this short example of band format some months ago.=A0=A0=A0 Ban for= mat is perfect for what you are doing.=A0 You can write simple utility rout= ines to search or sort the database.=A0

Two entries in=A0 band format follow.=A0 A record does not need to occu= py on line.=A0 "..hw" always begins a new record:

..hw lun= kaeja=A0 .gs=A0=A0 .ge Jellyfish (secondary etc names describe the specifi= c type)=A0 .ec Jellyfish=A0 .cg cnidaria=A0 .la Mokil=A0 .src=A0 .n Father = of a student=A0=A0 .dt [2010-08-27 Fri]

..hw lagong=A0 .gs=A0=A0 .ge Big, green fly=A0 .ec=A0=A0 .cg Insect=A0 = .la Cebuano=A0 .src FSD .n came up as an exception to the general name for = fly.=A0=A0 .dt [2010-06-11 Fri]

.. begins a record
..hw=A0 headword tag
.gs=A0 Scientific Name .ge=A0 English gloss
.ec=A0 English common name
.cg=A0 catagory .la=A0 language
..src=A0 Source
.dt=A0 date

I wrote some lit= tle macros / routines that converted a record into a typeset dictionary in = LaTeX.=A0 A frontend, as it were.=A0 Each tag is typeset in a distinct font= .=A0 Robert Hsu who was at the University of Hawaii linguistics department = in the 90s, wrote a spitbol/snobol4 package for manipulating data in band f= ormat, used in producing several dictionaries of several pacific languages.= =A0 Others have developed band formats distinct from this one.=A0

Alan


On Mon, Dec 26, 2011 at 11:08 AM, Alan E= . Davis <lngndvs@gmail.com> wrote:
Band format is a free-form format used in linguistics for lexical data.=A0= =A0

The following is=A0 from a google search.

http://billposer.org/Linguistics/Computation/LectureNotes= /ParsingLexica.html

I have used my own version of this format for a lexicon.=A0 If you wish= , I can spend more time explaining how ti did it.

Basically, each &q= uot;field" is marked by a dot and one or more letters.=A0 Two dots mar= k the beginning of a record.

..hw=A0 means "headword"=A0 (use anything you want)
.d=A0= =A0=A0 could mean date
.ge=A0 could mean "english gloss"
.g= d=A0 could mean "dutch gloss"
.p=A0=A0=A0 could mean part of s= peech.

Two spaces preceed each .X=A0

New categories can be made up on the fly.=

Alan Davis


On Sun, Dec 25, 2011 at 5:57 PM, Thien-Thi Nguyen <<= a href=3D"mailto:ttn@gnuvola.org" target=3D"_blank">ttn@gnuvola.org>= wrote:
() Soichi Ishida <soujiro0725@gmail.com>
() Mon, 26 Dec 2011 10:47:58 +0900

=A0 I have totally forgotten EBD :)

=A0 What if I have more than one table?

Then your forgetfulness is ideal.



--0016e6dd9615a1c2d104b71fe318--