-
Notifications
You must be signed in to change notification settings - Fork 96
Is there evidence for referral tracking by CDNs? #99
Comments
Until explicitly stated by the CDN in its public documentation regarding privacy and their service, that they DON'T collect NOR use The WWW has a DNT (Do Not Track) header, but as many know that is not honored by those that WANT to track you for their own benefit... |
I noticed that the MathJax CDN is actually being run behind CloudFlare, which means that CloudFlare's privacy policy applies. From the CloudFlare privacy policy:
From a privacy perspective, it is also noteworthy that CloudFlare uses cookies to uniquely identify your device across networks. It is not clear if this information is logged or used outside of identifying your computer as "trusted" when you change networks:
While I am not a lawyer, it seems possible that a referer header could be considered part of a visitor interaction and therefore can/would be logged. |
Can anyone explain how a web-server would be able to separate if his visitor is a web crawler or a human if that crawler doesn't expose itself as a crawler via headers? What they describe in the part i quoted is that they will use ANY data they can get ahold of to assemble (eg. combine) data to determine the connection paths between websites on the internet.
|
TriMoon asked "how". Individual 'headers' exist as concatenated label:value pairs with an http request and their consecutive order varies among agents. This detail serves as a bit of a fingerprint (and extremely few bots, or headless browsers, bother to spoof this detail). for reference, find a downloadable copy of this 47page PDF: Also, google crawlers (and those operated by other search indexing entities) hail from a known/published set of IPaddresses, er, netranges. Many webservers employ "web application firewalls" which contain netrange lookup lists ~~ WAF will detect (and or react) if your request is a spoof, based on e.g. "no, googlebot would NEVER hail from that IPaddress". |
https://developers.google.com/ad-exchange/rtb/cookie-guide See: Cookie Syncing |
Is there any evidence to believe, or not believe, that CDN's are recording HTTP referrers (technically
referers
) from billions of hits?For those that aren't familiar, HTTP referrers (technically
referers
) are strings that are often transmitted as part of an HTTP request for a resource. That string tells the other resource which page requested it. Historically, most browsers sent them for almost every request, which posed massive privacy and security issues. That's been clamped down a little as of late, but they are still in wide use (and Google appears to use them for some surprising things).It is trivial to strip out those headers programmatically or via an extension, but most people do neither.
By the way, here is a Mozilla development page regarding this issue: https://wiki.mozilla.org/Privacy/Features/Shortened_HTTP_Referer_header
So, is there any evidence, one way or another, regarding the collection of this data by CDN's?
The text was updated successfully, but these errors were encountered: