From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.ciao.gmane.io!not-for-mail From: Boruch Baum Newsgroups: gmane.emacs.w3m,gmane.emacs.help Subject: [emacs-w3m:13611] Re: Browser Fingerprinting Date: Fri, 17 Apr 2020 04:57:16 -0400 Message-ID: <20200417085716.vbfkk5ibchc74ev5@E15-2016.optimum.net> References: <87lfmx8frv.fsf@ebih.ebihd> <873694mu9f.fsf@fliptop.i-did-not-set--mail-host-address--so-tickle-me> <20200417025514.5gotmp6vlvg3v25x@E15-2016.optimum.net> <87h7xio7dw.fsf@fliptop.i-did-not-set--mail-host-address--so-tickle-me> Reply-To: emacs-w3m@namazu.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="ciao.gmane.io:159.69.161.202"; logging-data="123778"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: NeoMutt/20180716 Cc: emacs-w3m@namazu.org, Emanuel Berg , help-gnu-emacs@gnu.org To: Tomas Nordin Original-X-From: emacs-w3m-admin@namazu.org Fri Apr 17 10:57:35 2020 Return-path: Envelope-to: gew-emacs-w3m-3@m.gmane-mx.org Original-Received: from vaj.namazu.org ([202.221.179.42]) by ciao.gmane.io with esmtp (Exim 4.92) (envelope-from ) id 1jPMoY-000VyP-Mq for gew-emacs-w3m-3@m.gmane-mx.org; Fri, 17 Apr 2020 10:57:34 +0200 Original-Received: from vaj.namazu.org (localhost [127.0.0.1]) by vaj.namazu.org (Postfix) with ESMTP id 565B6F40022; Fri, 17 Apr 2020 17:57:29 +0900 (JST) Original-Received: from mout.gmx.net (mout.gmx.net [212.227.15.18]) by vaj.namazu.org (Postfix) with ESMTP id 0A194F40021 for ; Fri, 17 Apr 2020 17:57:27 +0900 (JST) Original-Received: from E15-2016.optimum.net ([100.38.74.74]) by mail.gmx.com (mrgmx004 [212.227.17.184]) with ESMTPSA (Nemesis) id 1MRCK6-1jdbs52QjM-00N9WA; Fri, 17 Apr 2020 10:57:21 +0200 In-Reply-To: <87h7xio7dw.fsf@fliptop.i-did-not-set--mail-host-address--so-tickle-me> X-ML-Name: emacs-w3m X-Mail-Count: 13611 X-MLServer: fml [fml 4.0.3 release (20011202/4.0.3)]; post only (anyone can post) X-ML-Info: If you have a question, send e-mail with the body "help" (without quotes) to the address emacs-w3m-ctl@namazu.org; help= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1587113842; bh=Ad+CQWqvwHidXQD80+c7KNOE1NrwdcJ/yRWRRG1aVAA=; h=X-UI-Sender-Class:Date:From:To:Cc:Subject:References:In-Reply-To; b=SWjzNaLU4Osm+zYZvHKyympLpWTy8J9SOB+ivJvcTFvpzRzIhxm4+iKXeorDp+YUy ix6LcjNoHcufCFGZcZFZTwAx6Iw/5lt1u6ZmOsLiFCM3euiQp5gj/1WU7pT1qA9dex fMPDxhYOp/Rlzm+s5wGbrEjfpMipdxNU4KlNAqpY= X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c Content-Disposition: inline X-Provags-ID: V03:K1:UQech9kXVphYXMtEDCG4JK5bFxnCvxcch3kKqqWYyOBapCr90zA sY+L/U09TgkSAHES3sTPgQLBLJ75E53gwG+XyhZxehiymVXxaFB/vC3Uv7dOptgRFefxzfV hswK5DnhPl6+TBR76B98/kszi2FVj7Li5681R1VoFb9fczJfE090MeRaO2R0bA/H3PVs2aj KE80+ycuijZwbSBAA98Dg== X-UI-Out-Filterresults: notjunk:1;V03:K0:8E1/avfG14I=:skqlAuSAJ2YPwdlJ0b3+FT Yri/d/PHcqOSwSAf+SEUNWMdC/jyEXcozg5TZyPVXmW9C63/kXOx5zlbSeP0fsK+3s5d/Kthi ++9+MFUptFFBqFKzDel/t5Sx6HGl3vR27QEvCcdpI23lpfA4ssj7GEhOktfw6YXkM3MQDOtIZ r0Ak7LG0sKKIWYjmL79eDsoc+jftNzvU4GShvXPtPjfRik9PBh8aFwHSU2DM3Js88yUdNVmOB 20TcPV9MhG1i6b+f/FNvqrDMTMIAzJwqvWpsH3bXmbZdTOENyoO8a79+f79crIeqYpYVOZL3S mZm6gSXBv8sFJTLcLIaKS31m1xU1yggDrPnzQA8IB0P5ZTUeQyJnFV1ZaZhalXDL51In4SdGz 4XTAEUhD2i3G5CHrvv1VVuHaixrL/E+Za1C9NGJkRGNR4sc02OTJkzREe/CDf37q13VSnR9IH XEwx4BrfhWRHUqpFBDwCfvAYxIcI2hqekO6t9NROZlLJO8fyZo8mkVPvrhqZg46Akt1Oa9AS/ chnBf9kJJFVquurrr5fWV/YTerYcgxOKu+uIAxFXncN7eqNfZREUodY3wj2y2GBh75ldHPu3a MuZWqEoO9v3cWe6n+9ShS9h3R35YybAe5tKICdcFpcdReMmGE2mq7h9DcbG1K2QMJpGzfgaoR zIcse/KARMb+wLvNa75OaUzv+eU54n6N42Nf2H50KJ2l7nB/6V52n7NKTNMxjVrfD6DVHpz9C 9GfYitfoMfZm0LJNeQ+9aJ1nbN30hhlf5waI4BY2ZsWDDr9cab0WrTgGzQS//yg4L/4BnRwr Precedence: bulk Original-Lines: 46 List-Id: emacs-w3m.namazu.org List-Software: fml [fml 4.0.3 release (20011202/4.0.3)] List-Post: List-Owner: List-Help: List-Unsubscribe: Xref: news.gmane.io gmane.emacs.w3m:10491 gmane.emacs.help:122859 Archived-At: On 2020-04-17 10:15, Tomas Nordin wrote: > The fields tested as browser characteristtics were > ... Thanks. Good to know. > The most identifying characteristics is User Agent followed by > HTTP_ACCEPT Headers. Of course emacs-w3m and w3m have no choice but to send them, but ... > Browsing the web with a text based browser is not a common thing to do, > so from a browser fingerprinting point of view I guess the uniqeness is > to be expected. Right. That's why I suggested in my original e-mail... "What might be more useful is to set variable w3m-add-user-agent to t, and then set w3m-user-agent to some generic and popular user-agent string." That leaves us with the matter of the HTTP_ACCEPT Headers. My memory is that the information sent in that header is very limited, just a set of mime-types that the server can use to send data to the client. That doesn't seem to me to be much from which to create a fingerprint, but it would be great to do a comparison. emacs-w3m does have configurable variables for that feature that I guess ought to be what is in those headers (see, for example, w3m-content-type-alist and w3m-default-content-type) Otherwise, I don't know what identifying information is in that header. I don't have time to look at this now, but have added it to my (long) emacs-w3m to-do list. As is true with the case of the user-agent string, I would have to also see what w3m is sending. If you do more research on this, please keep me informed, and if I have any other insights, I'd be happy to share them. The simplest test would be check the uniqueness score after following my suggestion about using a false user-agent string. A second step would be to compare ACCEPT headers with a common browser, and see how changing the emacs-w3m variables can cause the results to yield a better result. =2D- hkp://keys.gnupg.net CA45 09B5 5351 7C11 A9D1 7286 0036 9E45 1595 8BC0