Googlebot Blocked with 429 & 472 Errors – Product Page Not Crawlable

What is the name of the domain?

The name of your domain is: goldsun.in

What is the issue you’re encountering

We are encountering a problem where Googlebot is unable to crawl certain product pages on our website. The pages are live and publicly accessible to users, but Google Merchant Center reports a 404 error, saying the product page is unavailable. After server-side investigation, it’s confirmed: The server is not blocking access robots.txt and .htaccess are correctly configured The issue is likely due to Cloudflare returning 429 and 472 errors when Googlebot tries to access the product pages This is affecting our product visibility in Google Shopping, especially for users in India.

What is the current SSL/TLS setting?

Off

What are the steps to reproduce the issue?

Hi Community,

We’re currently facing a crawl issue where Googlebot receives 429 and 472 status codes when trying to access certain product pages on our website.

This is causing products to be disapproved in Google Merchant Center, showing the error:

“Product page unavailable – Page returned a 404 when crawled by Google”

However:

The page is working fine on both desktop and mobile for real users.

Our server support has confirmed there are no firewall blocks, redirects, or robots.txt restrictions.

The issue started in June and continues into July.

robots.txt allows Googlebot access.

After checking server logs, we found that Cloudflare returns 429 and 472 to Googlebot’s requests, which likely triggers this issue.

Hi,

Thank you for reaching out to us.

Have you tried searching within your Security Events to see what is triggering this block? Security Events · Cloudflare Web Application Firewall (WAF) docs

Also, you can try deploy a custom rule like this example: Allow traffic from search engine bots and other verified bots · Cloudflare Web Application Firewall (WAF) docs

Please let us know if you still have questions.