Increase in 429 errors since 19th July

What is the name of the domain?

http://petredisinvestmentadvisors.com, https://www.markrahnwm.com, http://kfgstl.com

What is the error number?

429

What is the error message?

Too Many Requests

What is the issue you’re encountering

Page requests are being rejected

We crawl and archive websites for our clients on daily basis. Since last Wednesday (19th July) we have seen a large increase http 429 errors when crawling some client sites and have been unable to archive their sites. All the sites that have this error are using Cloudflare. Are there any global settings in Cloudflare and or any other changes from last week that could have caused this issue?

A 429 response means that the request has been rate limited. You are going to need to reach out to your customers and work with them to make sure that your service is not blocked.

These blocks are normally set at the domain level, and so they should be able to allow your traffic,

Thanks for your reply. We are now seeing 403 errors rather than 429, which presumably means we are being identified as a bot. This issue is affecting about 200 websites across about 60 different clients and the problem for these all started at the same time last week. We have previously asked all our clients to allowlist our IP addresses and these haven’t changed.

We have started the process of reaching out clients to ask them to allow our traffic, but given the number of sites this could take weeks to resolve. It seemed strange that we saw this issue across so many websites that all use Cloudflare at the same time, hence the question if there was any global Cloudflare configuration that might have caused this and if there might be any quicker way to resolve this than contacting clients individually.

Hi @marcus.aidley

hence the question if there was any global Cloudflare configuration that might have caused this and if there might be any quicker way to resolve this than contacting clients individually.

I don’t think there is such a configuration.

Could you please confirm if the 403s that you are seeing are Cloudflare branded?

If so, could you please provide us an example of a Ray ID so that we may further investigate this issue?

We are now seeing 403 errors rather than 429, which presumably means we are being identified as a bot.

Not necessarily, Cloudflare will serve 403 responses if the request violated either a default WAF managed rule enabled for all orange-clouded Cloudflare domains or a WAF managed rule enabled for that particular zone.

I hope this helps.

Looking forward to your next reply.

1 Like

Thanks for you reply as an example we crawled http://www.bcgadvisor.com/ at 12:14 UTC today and got a 403. Ray Id is 89a55bc4cc5c3070-SEA

The issue is still ongoing, we have asked our clients to reach out to you directly to resolve the issue. In the meantime here are some more Ray Ids from yesterday:

Url Ray Id
http://bcafunds.com/ 89c5293af9a8ba09-SEA
http://www.reaplegacy.com/ 89c570b85b6d936c-SEA
http://www.rmcifinancial.com/ 89c5601249063084-SEA
https://www.coppolawealth.com/ 89c508267d31a3bf-SEA
URL Ray ID
https://www.intergenerationalplanning.com/ 89c594b898dd6a11-SEA
https://www.johnstonwealth.com/ 89c5691bae81681e-SEA
https://www.networthadv.com/ 89c53e0f2ab1ba2d-SEA
https://www.vtretirementplanners.com/ 89c53da21aa6c380-SEA

Apologies for multiple replies but can post more the 4 links in on reply:

URL Ray Id
https://fiduciaryfinancialpartners.com/ 89c528ae9bcfc495-SEA
https://www.ramfinancialservices.com/ 89c55763c93c7690-SEA

Put the URLs between backticks so that they don’t get turned into links and appear as </> Preformatted text instead.

If you write: `example.com`

You get: example.com

Things are looking much better today and sites that had started rejecting our requests last week are all working again today. It seems unlikely that all our clients updated their configuration to allow list our IPs at the same time (and they should have already been allow-listed anyway), so I’m still inclined to believe there was some sort of global issue that was causing our requests to be rejected

This topic was automatically closed after 15 days. New replies are no longer allowed.