[erlang-questions] Erlang 3000?
Johnny Billquist
bqt@REDACTED
Wed Nov 19 12:03:12 CET 2008
Hasan Veldstra wrote:
>
> On 19 Nov 2008, at 09:33AM, Johnny Billquist wrote:
>
>> Richard O'Keefe wrote:
>>> On 17 Nov 2008, at 1:44 pm, Håkan Stenholm wrote:
>>>> Conditionals that have more than 2 branches where none of them
>>>> branch on
>>>> patterns - e.g. something like the to_upper code below:
>>>>
>>>> to_upper_char(C) when is_integer(C) ->
>>>> if
>>>> $a =< C, C =< $z -> C - 32;
>>>> 16#E0 =< C, C =< 16#F6 -> C - 32;
>>>> 16#F8 =< C, C =< 16#FE -> C - 32;
>>>> true -> C
>>>> end.
>>>
>>> If this is supposed to convert ISO Latin 1 characters
>>> to upper case, it's wrong. (In at least two different
>>> ways, both of which are left as an exercise for the
>>> reader.) Code for such purposes simply shouldn't be
>>> written by hand these days, but should be generated by
>>> some sort of script from the (relevant portion of the)
>>> Unicode data base. As such, whether it's written using
>>> 'case', 'if', or embedded Intercal should be of no interest.
>>
>> ??? That's a perfectly correct translation from lowercase to uppercase
>> for Latin-1.
>> Unicode is something else.
>>
>> Or is there just some bug in the code as such that I fail to see?
>
> "straße" - German for "street" - can be encoded in ISO Latin 1, but both
> your code and the one in the standard string module will fail to convert
> it to upper-case properly.
Well, Latin-1 don't have an uppercase version of ß, so that letter can't
be converted, if you want to stay in Latin-1. The rest will do just fine.
So, within the limitations of Latin-1, the function works fine, as far
as I can tell.
Johnny
More information about the erlang-questions
mailing list