I work at an ad network company - and many of our publishers use WordPress and Cloudflare. They need to have their ads.txt files available to demand partners to crawl programmatically with tools like curl. Often this will fail using curl but not a normal web browser so the publisher thinks all is well but partners like Google and Appnexus are not able to crawl and injest this important file. When using curl on a site like this we get:
HTTP/1.1 403 Forbidden
Date: Wed, 10 May 2023 17:50:21 GMT
Content-Type: text/html; charset=UTF-8
Transfer-Encoding: chunked
Connection: close
Cross-Origin-Embedder-Policy: require-corp
Cross-Origin-Opener-Policy: same-origin
Cross-Origin-Resource-Policy: same-origin
Permissions-Policy: accelerometer=(),autoplay=(),camera=(),clipboard-read=(),clipboard-write=(),fullscreen=(),geolocation=(),gyroscope=(),hid=(),interest-cohort=(),magnetometer=(),microphone=(),payment=(),publickey-credentials-get=(),screen-wake-lock=(),serial=(),sync-xhr=(),usb=()
Referrer-Policy: same-origin
X-Frame-Options: SAMEORIGIN
cf-mitigated: challenge
Cache-Control: private, max-age=0, no-store, no-cache, must-revalidate, post-check=0, pre-check=0
Expires: Thu, 01 Jan 1970 00:00:01 GMT
Report-To: {"endpoints":[{"url":"https:\/\/a.nel.cloudflare.com\/report\/v3?s=t3R4vOZNP6I42V6Z0sx4Za0pPh2%2F94bmTCTLdhatr%2B6L99iWlnif9LFgoKPS2kRl7twWSyZKko%2Bsmo9MVV%2B1XOzfzSomZ4rKcY1CTZDlPIObQXnrjtmVmFbwC1hH59Dp124xNcrvt2CVf6jn"}],"group":"cf-nel","max_age":604800}
NEL: {"success_fraction":0,"report_to":"cf-nel","max_age":604800}
Server: cloudflare
CF-RAY: 7c54062ab86e31df-LAX
alt-svc: h3=":443"; ma=86400, h3-29=":443"; ma=86400
<!DOCTYPE html>
<html lang="en-US">
<head>
<title>Just a moment...</title>
yada yada
<span id="challenge-error-text">
Enable JavaScript and cookies to continue
</span>
What can we tell users to do to keep this text file from being part of Cloudflare’s javascript challenge?