Parsers Layer

Parsing classes should simply take some input stream and convert it into a valid DOM. For text/html and text/xml the result is a logical grove or tree of Elements. For parsing text/plain, the parser could simply create a DOM of one element whose contents would be all the text in the file. Other files types which could be parsed into a tree structure include: C, Java, (any programming language), mail, news, amigaguide, (any hypertext format which could be transmitted over the WWW), etc.

Possible Class/Interfaces

pedantiWEB.parsers.GenericParser
For parsing things. It would be given an input stream and a string describing the type. Based on that it would use the proper specific class.
pedantiWEB.parsers.MIME.text.GenericParser
For text of unknown type.
pedantiWEB.parsers.MIME.text.plain.GenericParser
pedantiWEB.parsers.MIME.text.html.GenericParser
pedantiWEB.parsers.MIME.text.xml.GenericParser
pedantiWEB.parsers.MIME.text.css.GenericParser
pedantiWEB.parsers.MIME.application.rtf.GenericParser
pedantiWEB.parsers.MIME.application.x-latex.GenericParser
pedantiWEB.parsers.MIME.application.x-troff.GenericParser
Parsers for handling things of that specific MIME type.

Go to pedantiWEB, a GPL, Java web browser

Bill Bereza - bereza@pobox.com
<URL:http://www.pobox.com/~bereza/pedantiWEB/parsers.html>

Last Updated on Wednesday, December 03, 1997.

$Id: parsers.html,v 1.3 1997/12/12 16:36:25 bereza Exp bereza $

HTML Checked!