Re: reading filenames from stdin - with umlauts?
On Mon, 28 Jul 2008 00:32:23 +0000, Stefan Ram wrote:
Dan Stromberg <dstromberglists@gmail.com> writes:
isr = (new InputStreamReader(System.in, "ISO-8859-1")
Now, I become aware of another fact:
??java.lang.System.in?? already has an encoding.
You might try to use this as a base instead:
http://download.java.net/jdk7/docs/api/java/io/FileDescriptor.html#in
I tried this but still I get file not found with OpenJDK. gcj seems fine
though:
FileReader fr = null;
// isr = (new InputStreamReader(System.in, "ISO-8859-1"));
// isr = (new InputStreamReader(System.in, "UTF-8"));
fr = (new FileReader(java.io.FileDescriptor.in));
System.err.println("Encoding on fr is " + fr.getEncoding());
//BufferedReader stdin = new BufferedReader (fr);
StringBuffer line;
char ch;
int int_char;
try
{
while (true)
{
line = new StringBuffer("");
while(true)
{
int_char = fr.read();
if (int_char == -1)
{
break;
}
ch = (char)int_char;
System.out.println("" + ch);
if (ch == (char)10)
{
break;
}
line.append(ch);
}
if (int_char == -1)
{
break;
}
System.out.println(new String(line));
lst.add(new Sortable_file(new String(line)));
}
}
catch(java.io.IOException e)
{
BTW, this code says the encoding is ASCII when I run it, whether using
OpenJDK or gcj.
Is the java String type -always- 16 bits per character? That is, if I
try to stick an 8 bit value into a String, is it always going to be
converted to a different encoding that maps back most of the time, but
not always?
Do java strings of any sort have an associated but variable encoding?
Are there different string types that have different encodings?
Is there any way of opening a filename that isn't stored in a String?
Short of something like SWIG, JNI or ctypes that is?