As designed Romanize titles in URLs

ge66

Well-known member
Affected version
XF2.0
There is a problem with how /src/vendor/dokuwiki/utf8.php is used.
This bit doesn't seem to be used:
Code:
global $UTF8_ROMANIZATION;
if(empty($UTF8_ROMANIZATION)) $UTF8_ROMANIZATION = array(
  // scandinavian - differs from what we do in deaccent
  'å'=>'a','Å'=>'A','ä'=>'a','Ä'=>'A','ö'=>'o','Ö'=>'O',

It uses, as an example, 'ä' => 'ae', instead.

Might there be a language setting or other setting I have missed?
 
I'm fairly sure this is expected.

We don't just romanize the URL, but we also deaccent it. The deaccent process (IMO correctly) converts ä to ae. We then run it through the romanization process which takes care of any other non-accented non-romanized characters.
 
In Sweden it should be 'å'=>'a','Å'=>'A','ä'=>'a','Ä'=>'A','ö'=>'o','Ö'=>'O' IMO as it says, scandinavian differs.
As this about scandinavian languages somehow is recognized in utf8.php it maybe could be triggered in some way so it doesn't have to be changed by hand on every XF update?
 
The only other solution I can think of is romanising before deaccenting.

Thing is either way I don’t think there’s a one size fits all solution so it won’t be “right” in some contexts.
 
Top Bottom