Redirect with unicode URLs doesn't work

Hello,

I have trouble redirecting urls with unicode, here is an example of redirection:
from: https://www.coralgranddivers.com/post/ใช-หน-ากากย-งไง-ให-ด

to: https://th.coralgranddivers.com/post/ใช-หน-ากากย-งไง-ให-ด

here is the curl command to test with encoded url:

curl -I "https://www.coralgranddivers.com/post/%E0%B9%83%E0%B8%8A-%E0%B8%AB%E0%B8%99-%E0%B8%B2%E0%B8%81%E0%B8%B2%E0%B8%81%E0%B8%A2-%E0%B8%87%E0%B9%84%E0%B8%87-%E0%B9%83%E0%B8%AB-%E0%B8%94"

Can you share a screenshot of the URL Normalisation settings on your Cloudflare Dashboard?

https://developers.cloudflare.com/rules/normalization/manage/

OK. With those settings you just use the URLEncoded version in the Page Rule:

% curl communitymvp.ga/ใช-หน-ากากย-งไง-ให-ด --dump-header - --silent | egrep -i '^HTTP|^Location'
HTTP/1.1 302 Moved Temporarily
Location: https://example.com/urlencoded

Thanks, it works!

So using page rules fixed it but it wouldn’t work with bulk redirects feature.

I guess there is a bug in Bulk redirects somewhere, from this page I can see only one URL is properly encoded, that might be a clue:

I was using Page Rules. Looks like there is a bug in Bulk Redirects. Calling @smarsh for backup.

A Source of https://communitymvp.ga/bulk/%E0%B9%83%E0%B8%8A-%E0%B8%AB%E0%B8%99-%E0%B8%B2%E0%B8%81%E0%B8%B2%E0%B8%81%E0%B8%A2-%E0%B8%87%E0%B9%84%E0%B8%87-%E0%B9%83%E0%B8%AB-%E0%B8%94 in Bulk redirects is converted in the UI to https://communitymvp.ga/bulk/ใช-หน-ากากย-งไง-ให-ด. Requesting either format does not get a URL match.

Not a bug; Bulk Redirects doesnt support non-ASCII input. We have a feature request open but currently not prioritised. Will update here as soon as we are able to pick it up.

FWIW we dont support non-ASCII Page Rules either.

1 Like

Ah, I think i see what you mean now. So i’ve got a URL, encoded it to www.seriesoftubes.co.uk%2Fa%2Fb%2Fc, saved it in my redirect list (with the first %2F converted to a /, for validation) and once saved I can see its been translated to www.seriesoftubes.co.uk/a/b/c.

Let me chat w/ the team - I suspect it all falls into the same bucket of ‘ASCII only’.

1 Like