[erlang-bugs] xmerl default encoding

Bertil Karlsson <>
Wed Jan 23 12:31:17 CET 2008


Thank you for reporting this problem. I will take a look at it soon.

/Bertil Karlsson


Paul Mineiro wrote:
> when i run the attached document (a simple xml document that lacks an
> encoding declaration) through xmerl_scan:file/1 the result contains the
> iso-8859-1 encoding of tilde n (\361).  however the original contains the
> utf-8 encoding of tilde n (\303\261) and the character set change
> suprised me.  adding a { encoding, "utf-8" } option to xmerl_scan:file/2
> fixed things but the reference manual (and xml spec) say the utf-8 is
> the default.
>
> thanks,
>
> -- p
>
> Eshell V5.5.5  (abort with ^G)
> 1> xmerl_scan:file ("noencodingdecl.xml").
> {{xmlElement,'Actor',
>              'Actor',
>              [],
>              {xmlNamespace,[],[]},
>              [],
>              1,
>              [],
>              [{xmlText,[{'Actor',1}],1,[],"Elizabeth Pe\361a",text}],
>              [],
>              ".",
>              undeclared},
>  []}
> 2> xmerl_scan:file ("noencodingdecl.xml", [ { encoding, "utf-8" } ]).
> {{xmlElement,'Actor',
>              'Actor',
>              [],
>              {xmlNamespace,[],[]},
>              [],
>              1,
>              [],
>              [{xmlText,[{'Actor',1}],1,[],"Elizabeth Pe\303\261a",text}],
>              [],
>              ".",
>              undeclared},
>  []}
> ------------------------------------------------------------------------
>
> _______________________________________________
> erlang-bugs mailing list
> 
> http://www.erlang.org/mailman/listinfo/erlang-bugs




More information about the erlang-bugs mailing list