jsoup For Mac Publisher's description
from Jonathan Hedley
jsoup: Java HTML Parser
jsoup is a Java library for working with real-world HTML. It provides a very convenient API for extracting and manipulating data, using the best of DOM, CSS, and jquery-like methods.
* parse HTML from a URL, file, or string
* find and extract data, using DOM traversal or CSS selectors
* manipulate the HTML elements, attributes, and text
* clean user-submitted content against a safe white-list
jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree.
What's New in This Release:В· Added .before(html) and .after(html) methods to Element and Elements, to insert sibling HTML
В· Added :contains(text) selector, to search for elements containing the specified text
В· Added :has(selector) pseudo-selector
В· Added Element#parents and Elements#parents to retrieve an element\'s ancestor chain
В· Fixes an issue where appending / prepending rows to a table (or to similar implicit element structures) would create a redundant wrapping elements
В· Improved implicit close tag heuristic detection when parsing malformed HTML
В· Fixes an issue where text content after a script (or other data-node) was incorrectly added to the data node.
В· Fixes an issue where text order was incorrect when parsing pre-document HTML.
System Requirements:В· Java 1.5 or later
В· Apache Commons Lang
Program Release Status:
Program Install Support: Install and Uninstall