Re: Unicode escapes and String literals?
Knute Johnson wrote:
Thomas Richter wrote:
Knute Johnson wrote:
I just had a great revelation as I was putting together my SSCCE for the
question I was going to ask. So it has changed my question. How do I do
the conversion of unicode [sic] escape sequences to a String that are done by
string literals?
They aren't done by String literals.
String s = "\u0066\u0065\u0064";
becomes "fed" but if you create a String with \u0066\u0065\u0064 in it
Exactly how?
without using the literal it stays \u0066\u0065\u0064. Is there a built
in mechanism in Java for doing that translation to a String?
No.
Yes. It's called "compiler". The same part of the compiler that
That's not exactly correct, and it certainly is not the same part that translates '\t'.
translates a "\t" in a string literal to the TAB control character also
replaces the unicode sequences in the string literal to the
corresponding unicode encoding.
Nope.
I want to be able to do it to a String not to a string literal.
You want to do what, exactly? I'm not clear on what you're trying to accomplish.
'\u' sequences are pre-compile, not during compile. Their presence is exactly equivalent
to typing the corresponding Unicode character directly.
You can embed them in identifiers, directives, anywhere the corresponding character can go.
Not just literals.
For that matter, you can use them in numeric literals.
<sscce>
package temp;
/**
* ShowUnicodeEscapes.
*/
public class ShowUnicodeEscapes {
static final \u0069nt COUN\u0054 = \u0030\u003b
/**
* main.
*
* @param args String array of arguments.
*/
public static void main(String[] args) {
System.out.println("COUNT = \u0022+ COUNT);
}
}
</sscce>