From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.devel Subject: Re: CSV parsing and other issues (Re: LC_NUMERIC) Date: Thu, 10 Jun 2021 17:10:51 -0400 Message-ID: References: <20210606233638.v7b7rwbufay5ltn7@E15-2016.optimum.net> <83a6o1hn9l.fsf@gnu.org> <20210608004510.usj7rw2i6tmx6qnw@E15-2016.optimum.net> <83h7i9f5ij.fsf@gnu.org> <73df2202-081b-5e50-677d-e4498b6782d4@gmail.com> <83eedcdw8k.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="10652"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) Cc: emacs-devel@gnu.org, boruch_baum@gmx.com To: Maxim Nikulin Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Fri Jun 11 00:29:09 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1lrTAj-0002Yt-Fr for ged-emacs-devel@m.gmane-mx.org; Fri, 11 Jun 2021 00:29:09 +0200 Original-Received: from localhost ([::1]:33948 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1lrTAh-0004eH-N5 for ged-emacs-devel@m.gmane-mx.org; Thu, 10 Jun 2021 18:29:07 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:47238) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1lrTA9-0003yB-Nb for emacs-devel@gnu.org; Thu, 10 Jun 2021 18:28:33 -0400 Original-Received: from mailscanner.iro.umontreal.ca ([132.204.25.50]:53368) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1lrTA6-0001or-W3 for emacs-devel@gnu.org; Thu, 10 Jun 2021 18:28:33 -0400 Original-Received: from pmg1.iro.umontreal.ca (localhost.localdomain [127.0.0.1]) by pmg1.iro.umontreal.ca (Proxmox) with ESMTP id 4F8A91007CA; Thu, 10 Jun 2021 18:28:29 -0400 (EDT) Original-Received: from mail01.iro.umontreal.ca (unknown [172.31.2.1]) by pmg1.iro.umontreal.ca (Proxmox) with ESMTP id 9861610028B; Thu, 10 Jun 2021 18:28:27 -0400 (EDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=iro.umontreal.ca; s=mail; t=1623364107; bh=Egp09l/PbY14H8HfJ6iRTrcAHwk2dS/7Upydm9ggULQ=; h=From:To:Cc:Subject:References:Date:In-Reply-To:From; b=IqGcReaAHFYrFhbOHsM/Iq1jzY24qt4a3mYm1UeveEpE7rvtoW+vAWq2rdXwPVbih vNkXifJNCeU1ngrIedqK65L0lomFfWRCK0Iu9Y71xEjjxdDHT8P/V97+9+LbsoLT1u TlkgO3O8y34F0QRhbJ//AwLORZZj22/JTXeMjFHdyXF+dXv40iNfWEYrLfmd8ZObNv IaBsHOaMj80jx9SPeVunm3HtT/tNH3h+mw5/aC5Nswe2uwlFgSDXQr6YfIQ6OtjuWF YYga8xzzCqAZF+yoHXg6h3sGC0vnsMM3DudPtHz1xuRDn8ppe9tcIWAM3UDAHqr/Jp +hwxKmaf5JWsA== Original-Received: from alfajor (69-196-163-239.dsl.teksavvy.com [69.196.163.239]) by mail01.iro.umontreal.ca (Postfix) with ESMTPSA id C1F421209C7; Thu, 10 Jun 2021 17:10:51 -0400 (EDT) In-Reply-To: (Maxim Nikulin's message of "Thu, 10 Jun 2021 23:28:59 +0700") Received-SPF: pass client-ip=132.204.25.50; envelope-from=monnier@iro.umontreal.ca; helo=mailscanner.iro.umontreal.ca X-Spam_score_int: -42 X-Spam_score: -4.3 X-Spam_bar: ---- X-Spam_report: (-4.3 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_MED=-2.3, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:270678 Archived-At: > There are plenty of CSV dialects. If decimal separator is "," then office > software uses ";" instead of comma as cell (field) separator. But there's no reason to presume that a given CSV file was generated in the same locale as the one we're currently using. So the locale could be one ingredient in the machinery used to guess which separator was used, but I'm not sure it would be of much help. [ BTW, I'll take the opportunity to advocate for the use of TSV instead, which is slightly less ill-defined. ] Stefan