Re: How to get file name from URL object

From:

"Oliver Wong" <owong@castortech.com>

Newsgroups:

comp.lang.java.help

Date:

Mon, 6 Nov 2006 11:55:02 -0500

Message-ID:

<HNJ3h.47046$Jv3.598611@weber.videotron.net>

"John W. Kennedy" <jwkenne@attglobal.net> wrote in message
news:FMs3h.1183$AQ6.957@newsfe11.lga...

carlbernardi@gmail.com wrote:

Hi,

I am trying to save web pages to file but sometimes the URL doesn't
have the file name with it as www.google.com doesn't for example. In
this case I don't know how to get the file name of the web page since
it is not specified. If there is way to use the URL class or other
classes to get the URL to look like so http://www.google.com/index.html
then my problem would be solved.

Just in case you didn't know, sometimes the contents of webpages come
from multiple files, and sometimes the contents of webpages don't come from
a file at all.

Not in general. If the server chooses to regard "/" as an adequate
resource name -- and most do --, there is nothing the client can do about
it.

Possible alternative designs include using heuristics (like what FireFox
does when you try to save a webpage), or somehow establishing a 1-to-1
correspondance between URLs and legal filenames, perhaps via some sort of
encoding process. For example, you might use base64 encoding so that the URL
"http://www.google.com/" is encoded to the filename
"aHR0cDovL3d3dy5nb29nbGUuY29tLw==.html".

- Oliver