From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: moasenwood--- via Users list for the GNU Emacs text editor Newsgroups: gmane.emacs.help Subject: Re: "split-sentences"? Date: Sat, 23 Jan 2021 07:38:49 +0100 Message-ID: <87v9bo9myu.fsf@zoho.eu> References: <87zh109r2d.fsf@zoho.eu> Reply-To: Emanuel Berg Mime-Version: 1.0 Content-Type: text/plain Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="1978"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) To: help-gnu-emacs@gnu.org Cancel-Lock: sha1:Szpx5yPAOZJk5nVdPAYc9EocJFU= Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Sat Jan 23 07:40:05 2021 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1l3Cab-0000Md-8Y for geh-help-gnu-emacs@m.gmane-mx.org; Sat, 23 Jan 2021 07:40:05 +0100 Original-Received: from localhost ([::1]:57322 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1l3Caa-0001DV-Az for geh-help-gnu-emacs@m.gmane-mx.org; Sat, 23 Jan 2021 01:40:04 -0500 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:49192) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1l3CZX-0001D0-Gn for help-gnu-emacs@gnu.org; Sat, 23 Jan 2021 01:38:59 -0500 Original-Received: from ciao.gmane.io ([116.202.254.214]:39896) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1l3CZU-0002Aq-Ep for help-gnu-emacs@gnu.org; Sat, 23 Jan 2021 01:38:59 -0500 Original-Received: from list by ciao.gmane.io with local (Exim 4.92) (envelope-from ) id 1l3CZR-0009fY-Pi for help-gnu-emacs@gnu.org; Sat, 23 Jan 2021 07:38:53 +0100 X-Injected-Via-Gmane: http://gmane.org/ Mail-Followup-To: help-gnu-emacs@gnu.org Mail-Copies-To: never Received-SPF: pass client-ip=116.202.254.214; envelope-from=geh-help-gnu-emacs@m.gmane-mx.org; helo=ciao.gmane.io X-Spam_score_int: -16 X-Spam_score: -1.7 X-Spam_bar: - X-Spam_report: (-1.7 / 5.0 requ) BAYES_00=-1.9, HEADER_FROM_DIFFERENT_DOMAINS=0.249, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=no autolearn_force=no X-Spam_action: no action X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "help-gnu-emacs" Xref: news.gmane.io gmane.emacs.help:127322 Archived-At: moasenwood--- via Users list for the GNU Emacs text editor wrote: > Can I parse/split a string into sentences based on > human-language punctuation? > > Did anyone do that already? I mean very mechanically is fine, no linguistics or anything. So this "'This sentence is spoken by Mr. W. E. B Dubois, Esq.!' played through amazon.com alexa speakers?" would be ("'" "This sentence is spoken by Mr" "." "W" "." "E" "." "B Dubois" "," "Esq" "." "!" "'" "played through amazon" "." "com" "alexa "speakers" "?") -- underground experts united http://user.it.uu.se/~embe8573 https://dataswamp.org/~incal