[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [ajug-members]: Parsing HTML
- To: ajug-members@ajug.org
- Subject: Re: [ajug-members]: Parsing HTML
- From: Rob Kischuk <rkischuk@gttx.org>
- Date: Thu, 13 May 2004 11:38:29 -0400
- In-Reply-To: <20040512140715.12534.qmail@web50206.mail.yahoo.com>
- References: <20040512140715.12534.qmail@web50206.mail.yahoo.com>
- User-Agent: Mozilla Thunderbird 0.5 (Windows/20040207)
Although I've heard that some people hava a beef with it, I've found
that the Swing HTML parser (doesn't need to be in a swing app) has met
my needs. It has a nice and simple callback API that lets you choose
what tags you care about and ignore everything else:
http://java.sun.com/products/jfc/tsc/articles/bookmarks/
-Rob
Mike Barnes wrote:
>I have a project that needs to be able to analyze html
>files.
>
>What is the best way to do this. Is it to first run
>HTML through JTIDY
>and then an XML parser? Or is there a more efficient
>way?
>
>
>Thanks in advance.
>
>Mike Barnes
>
>
>
>
>__________________________________
>Do you Yahoo!?
>Yahoo! Movies - Buy advance tickets for 'Shrek 2'
>http://movies.yahoo.com/showtimes/movie?mid=1808405861
>
>