TagSoup Clean-Up Service
- Base URL
- Request Methods
- Request Parameters
- Response Codes
- Response Format
- Implementation Notes
The XAK TagSoup Clean-Up Service provides a simple online service to allow on the fly correction of malformed and dodgy HTML documents found in the wild. This allows them to be processed further, e.g. to extract metadata or apply XSLT transformations.
The service is based on John Cowan's TagSoup Parser.
The Base URL of the query service is:
This service currently only supports the HTTP
|html-uri||URL of HTML data to process||Yes||1|
TagSoup supports a number of other parameters, these same parameters can be applied to this service. Consult the TagSoup documentation for a complete list of options (see section "TagSoup as a stand-alone program").
200-- successful transformation
400-- missing parameter
500-- error fixing data or fetching data
The service currently returns all responses with a
html option is specified, in which case the response is served as
TagSoup also supports responses in PYX format.
These responses are returned as
This service has been implemented using TagSoup 1.0 Release Candidate 6.