[erlang-questions] Downcase Accented characters
Loïc Hoguin
essen@REDACTED
Sun Oct 21 21:46:17 CEST 2012
This only works for letters found in latin1, not for all the uppercases
found in unicode. If that's good enough for you then you don't need ux. :)
On 10/21/2012 09:39 PM, Roberto Ostinelli wrote:
> For the records, this just works..
>
> start() ->
> Unicode = list_to_binary("∞-HOpe@☺.EXAMple.com/My❤"),
> Result = list_to_binary(string:to_lower(binary_to_list(Unicode))),
> "∞-hope@☺.example.com/my <http://example.com/my>❤" = binary_to_list(Result).
>
> any downsides I'm not seeing?
>
> On Sun, Oct 21, 2012 at 12:25 PM, Roberto Ostinelli <roberto@REDACTED
> <mailto:roberto@REDACTED>> wrote:
>
> Thank you Loïc,
>
> did you happen to benchmark it? Would that be better/faster than a
> simple list_to_binary(string:to_lower(binary_to_list(Bin)))?
>
>
> On Sun, Oct 21, 2012 at 12:18 PM, Loïc Hoguin <essen@REDACTED
> <mailto:essen@REDACTED>> wrote:
>
> On 10/21/2012 09:14 PM, Roberto Ostinelli wrote:
>
> Dear list,
>
> I've a binary string which includes accented characters and
> unicode,
> that i need to downcase.
>
> Is my real best option here to convert everything to list
> and downcase that?
>
>
> Your current best option is ux_string:to_lower/1 from the ux
> library which will properly lower all characters, not just A-Z.
>
> Should be at https://github.com/erlang-__unicode/ux
> <https://github.com/erlang-unicode/ux>
>
> --
> Loďc Hoguin
> Erlang Cowboy
> Nine Nines
> http://ninenines.eu
>
>
>
--
Loïc Hoguin
Erlang Cowboy
Nine Nines
http://ninenines.eu
More information about the erlang-questions
mailing list