From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Amirouche Boubekki Newsgroups: gmane.lisp.guile.user Subject: neon: git for structured data [Was: Functional database] Date: Wed, 21 Feb 2018 15:49:00 +0100 Message-ID: <3bf20807996ce0bdc4e5ca6ea1d3776f@hypermove.net> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII; format=flowed Content-Transfer-Encoding: 7bit X-Trace: blaine.gmane.org 1519224487 28614 195.159.176.226 (21 Feb 2018 14:48:07 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Wed, 21 Feb 2018 14:48:07 +0000 (UTC) User-Agent: Roundcube Webmail/1.1.2 To: Guile User Original-X-From: guile-user-bounces+guile-user=m.gmane.org@gnu.org Wed Feb 21 15:48:03 2018 Return-path: Envelope-to: guile-user@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1eoVgg-00070F-42 for guile-user@m.gmane.org; Wed, 21 Feb 2018 15:48:02 +0100 Original-Received: from localhost ([::1]:33163 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1eoVii-0000LX-Cf for guile-user@m.gmane.org; Wed, 21 Feb 2018 09:50:08 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:57494) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1eoVhg-0008SZ-6K for guile-user@gnu.org; Wed, 21 Feb 2018 09:49:05 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1eoVhe-0006zF-VF for guile-user@gnu.org; Wed, 21 Feb 2018 09:49:04 -0500 Original-Received: from relay2-d.mail.gandi.net ([2001:4b98:c:538::194]:58940) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1eoVhe-0006xE-Ou for guile-user@gnu.org; Wed, 21 Feb 2018 09:49:02 -0500 Original-Received: from webmail.gandi.net (webmail4-d.mgt.gandi.net [10.58.1.144]) (Authenticated sender: amirouche@hypermove.net) by relay2-d.mail.gandi.net (Postfix) with ESMTPA id DF0FDC5A8E for ; Wed, 21 Feb 2018 15:49:00 +0100 (CET) X-Sender: amirouche@hypermove.net X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4b98:c:538::194 X-BeenThere: guile-user@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: General Guile related discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guile-user-bounces+guile-user=m.gmane.org@gnu.org Original-Sender: "guile-user" Xref: news.gmane.org gmane.lisp.guile.user:14455 Archived-At: I tried chez scheme and I think GNU Guile is a better platform for what I am trying to achieve, so I am back. I also know better what I want to achieve. I will create a triple store that comply with semantic web standard that is a RDF triple store. At [0] and [1] you will find a primer on what is RDF in the former and the concepts in the latter. [0] https://www.w3.org/TR/rdf11-primer/ [1] https://www.w3.org/TR/rdf11-concepts/ It will also be branch-able etc... like git. Also, I also plan to implement sparql. If you find sparql difficult I recommend the tutorial at data.world [2] in the mean time. It's not very difficult and looks like SQL. Hence I also plan to implement sparql [3]. [2] https://docs.data.world/tutorials/sparql/ [3] https://www.w3.org/TR/sparql11-overview/ What I want to do is something similar to data.world, that is a gitlab-like platform for data and replace the use of git in projects like datahub.io [4]. [4] http://datahub.io/core/registry Enough talking, what is the status? Well I finished porting what I had in chez and can now run the following scenario: - In master branch, I commit two triples - In other branch, that is orphan branch, I commit two triples among where one of them overlaps with master. - I can query both branch - In a merge commit, I fix the conflict between both branch. - I can query the resulting branch and get the expected result. The code might be easier to read [5] [5] https://github.com/amirouche/neon/blob/master/guile/neon.scm What is missing, in order of difficulty: - microkanren package https://framagit.org/a-guile-mind/microkanren - wiredtiger 3 package - Turtle aka. .ttl format parser https://www.w3.org/TR/turtle/ - sparql queries parser https://www.w3.org/TR/rdf-sparql-query/ - I am not sure of the status of guile-squee yet https://notabug.org/cwebber/guile-squee/ - pluggable backends If you want to work one of this item, send me an email. What I plan to work on next: There is a semantic difference between neon and RDF triple stores. In a triple store you can have as many times as you want the same attribute given a subject. That is (ref subject) doesn't return a proper alist. There is two other links that remain to be cited - https://www.w3.org/TR/rdf11-mt/ - https://www.w3.org/TR/2014/NOTE-rdf11-datasets-20140225/ Happy hacking, -- Amirouche ~ amz3 ~ http://www.hyperdev.fr