Search Support

Avoid support scams. We will never ask you to call or text a phone number or share personal information. Please report suspicious activity using the “Report Abuse” option.

Learn More

Web Privacy Security Crawl Using Selenium Blocking Enhanced Tracking Protection

  • 1 phendula
  • 0 inale ngxaki
  • 8 views
  • Impendulo yokugqibela ngu msmay1

more options

Hi,

I am working on a web crawler that scrapes set of websites analyzing certain privacy strings that are present before and after Global Privacy Signals are sent. We make use of Selenium Web Driver to perform the crawl. Part of the data we collect is the urlClassification of third party sites that are tracking users on the site, via Firefox's Enhanced Tracking Protection. However when performing the crawl, Enhanced Tracking Protection data is no longer available while Selenium has automatic control over Firefox, even though there is indeed third party sites (fingerprinters, cross-site tracking cookies) active on these sites when they are manually visited not using our crawler. Does anyone have an idea why our software may be interfering with Firefox's Enhanced Tracking Protection? Thanks!

Hi, I am working on a web crawler that scrapes set of websites analyzing certain privacy strings that are present before and after Global Privacy Signals are sent. We make use of Selenium Web Driver to perform the crawl. Part of the data we collect is the urlClassification of third party sites that are tracking users on the site, via Firefox's Enhanced Tracking Protection. However when performing the crawl, Enhanced Tracking Protection data is no longer available while Selenium has automatic control over Firefox, even though there is indeed third party sites (fingerprinters, cross-site tracking cookies) active on these sites when they are manually visited not using our crawler. Does anyone have an idea why our software may be interfering with Firefox's Enhanced Tracking Protection? Thanks!

All Replies (1)

more options

Additional note for above: It is important to note that the crawler worked as expected in early June 2024, before failing to log the urlClassification data provided by ETP in July 2024.

Helpful?

Buza umbuzo

You must log in to your account to reply to posts. Please start a new question, if you do not have an account yet.