When I looked at the googlebot.json
file, I initially saw that it was last updated on 2024-12-31
, or, roughly 4 days before your thread.
Looking right now, they claim that they last updated it on 2025-01-07
.
Unfortuantely, I do not have any verified bot my self, but … when I go through the Add a bot link on the Verified Bots page, I see multiple verification methods, such as e.g. Reverse DNS and IP List, but only one of them are selectable at a time.
I don’t see it documented more than, e.g. this, for “IP List”:
Provide extra information for your selected Verification Method (e.g. IP list URL(s))
Whether it will accept only a plain text file, e.g. with one IP address or IP subnet per line, or it will be able to parse the JSON that Google provides, remains unknown to me.
In addition, whether it will be possible for bot owners to add or adjust a specific frequency, where Cloudflare will automatically be updating a such IP list, is likewise unknown to me.
That said, -
To my knowledge, it is the bot vendor (e.g. Google, in this case) that needs to keep their bot registrations up-to-date, including submitting requests for updating IP ranges.
And again, to my knowledge, too, I also know very well that this does often not happen from many organisations, that are often forgetting it, apparently because they’re believing it will be some sort of “fire-and-forget” kind of registration.
Fortunately though, - Google is both well-documenting bot their bot crawlers, as well as the IP ranges that their bots are using, so I’m wondering if everything needed, is simply to make Cloudflare request (and update) the list more frequently than they eventually (already) are doing.
I do not see that Google is documenting their expected update frequency though.
And I didn’t keep a note of the timestamp for 2024-12-31
, however, with 2025-01-07 15:46:02
, that I assume is UTC, it looks a little bit like the frequency for updating googlebot.json
might be weekly, at Tuesday’s, and I suspect that, if it isn’t already being done, then a weekly update, for example at Tuesday, 18:00:00 UTC, might eventually be able to limit the potential issues, at least a bit.
I’ll poke around, and try to see if I can get someone to look in to this.