[erlang-questions] Downcase Accented characters

Loïc Hoguin essen@REDACTED
Sun Oct 21 21:46:17 CEST 2012


This only works for letters found in latin1, not for all the uppercases 
found in unicode. If that's good enough for you then you don't need ux. :)

On 10/21/2012 09:39 PM, Roberto Ostinelli wrote:
> For the records, this just works..
>
> start() ->
> Unicode = list_to_binary("∞-HOpe@☺.EXAMple.com/My❤"),
> Result = list_to_binary(string:to_lower(binary_to_list(Unicode))),
> "∞-hope@☺.example.com/my <http://example.com/my>❤" = binary_to_list(Result).
>
> any downsides I'm not seeing?
>
> On Sun, Oct 21, 2012 at 12:25 PM, Roberto Ostinelli <roberto@REDACTED
> <mailto:roberto@REDACTED>> wrote:
>
>     Thank you Loïc,
>
>     did you happen to benchmark it? Would that be better/faster than a
>     simple list_to_binary(string:to_lower(binary_to_list(Bin)))?
>
>
>     On Sun, Oct 21, 2012 at 12:18 PM, Loïc Hoguin <essen@REDACTED
>     <mailto:essen@REDACTED>> wrote:
>
>         On 10/21/2012 09:14 PM, Roberto Ostinelli wrote:
>
>             Dear list,
>
>             I've a binary string which includes accented characters and
>             unicode,
>             that i need to downcase.
>
>             Is my real best option here to convert everything to list
>             and downcase that?
>
>
>         Your current best option is ux_string:to_lower/1 from the ux
>         library which will properly lower all characters, not just A-Z.
>
>         Should be at https://github.com/erlang-__unicode/ux
>         <https://github.com/erlang-unicode/ux>
>
>         --
>         Loďc Hoguin
>         Erlang Cowboy
>         Nine Nines
>         http://ninenines.eu
>
>
>


-- 
Loïc Hoguin
Erlang Cowboy
Nine Nines
http://ninenines.eu



More information about the erlang-questions mailing list