Class XmlParser
- java.lang.Object
-
- com.microstar.xml.XmlParser
-
public class XmlParser extends java.lang.Object
Parse XML documents and return parse events through call-backs.You need to define a class implementing the
XmlHandler
interface: an object belonging to this class will receive the callbacks for the events. (As an alternative to implementing the full XmlHandler interface, you can simply extend theHandlerBase
convenience class.)Usage (assuming that
MyHandler
is your implementation of theXmlHandler
interface):XmlHandler handler = new MyHandler(); XmlParser parser = new XmlParser(); parser.setHandler(handler); try { parser.parse("http://www.host.com/doc.xml", null); } catch (Exception e) { [do something interesting] }
Alternatively, you can use the standard SAX interfaces with the
SAXDriver
class as your entry point.- Since:
- Ptolemy II 0.2
- Version:
- 1.1
- Author:
- Copyright (c) 1997, 1998 by Microstar Software Ltd., Written by David Megginson <dmeggins@microstar.com>
- See Also:
XmlHandler
,HandlerBase
-
-
Field Summary
Fields Modifier and Type Field Description static int
ATTRIBUTE_CDATA
Constant: the attribute value is a string value.static int
ATTRIBUTE_DEFAULT_FIXED
Constant: the attribute was declared #FIXED.static int
ATTRIBUTE_DEFAULT_IMPLIED
Constant: the attribute was declared #IMPLIED.static int
ATTRIBUTE_DEFAULT_REQUIRED
Constant: the attribute was declared #REQUIRED.static int
ATTRIBUTE_DEFAULT_SPECIFIED
Constant: the attribute has a literal default value specified.static int
ATTRIBUTE_DEFAULT_UNDECLARED
Constant: the attribute is not declared.static int
ATTRIBUTE_ENTITIES
Constant: the attribute value is a list of entity names.static int
ATTRIBUTE_ENTITY
Constant: the attribute value is the name of an entity.static int
ATTRIBUTE_ENUMERATED
Constant: the attribute value is a token from an enumeration.static int
ATTRIBUTE_ID
Constant: the attribute value is a unique identifier.static int
ATTRIBUTE_IDREF
Constant: the attribute value is a reference to a unique identifier.static int
ATTRIBUTE_IDREFS
Constant: the attribute value is a list of ID references.static int
ATTRIBUTE_NMTOKEN
Constant: the attribute value is a name token.static int
ATTRIBUTE_NMTOKENS
Constant: the attribute value is a list of name tokens.static int
ATTRIBUTE_NOTATION
Constant: the attribute is the name of a notation.static int
ATTRIBUTE_UNDECLARED
Constant: the attribute has not been declared for this element type.static int
CONTENT_ANY
Constant: the element has a content model of ANY.static int
CONTENT_ELEMENTS
Constant: the element has element content.static int
CONTENT_EMPTY
Constant: the element has declared content of EMPTY.static int
CONTENT_MIXED
Constant: the element has mixed content.static int
CONTENT_UNDECLARED
Constant: an element has not been declared.static int
ENTITY_INTERNAL
Constant: the entity is internal.static int
ENTITY_NDATA
Constant: the entity is external, non-XML data.static int
ENTITY_TEXT
Constant: the entity is external XML data.static int
ENTITY_UNDECLARED
Constant: the entity has not been declared.
-
Constructor Summary
Constructors Constructor Description XmlParser()
Construct a new parser with no associated handler.
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description java.util.Enumeration
declaredAttributes(java.lang.String elname)
Get the declared attributes for an element type.java.util.Enumeration
declaredElements()
Get the declared elements for an XML document.java.util.Enumeration
declaredEntities()
Get declared entities.java.util.Enumeration
declaredNotations()
Get declared notations.java.lang.String
getAttributeDefaultValue(java.lang.String name, java.lang.String aname)
Retrieve the default value of a declared attribute.int
getAttributeDefaultValueType(java.lang.String name, java.lang.String aname)
Retrieve the default value type of a declared attribute.java.lang.String
getAttributeEnumeration(java.lang.String name, java.lang.String aname)
Retrieve the allowed values for an enumerated attribute type.java.lang.String
getAttributeExpandedValue(java.lang.String name, java.lang.String aname)
Retrieve the expanded value of a declared attribute.int
getAttributeType(java.lang.String name, java.lang.String aname)
Retrieve the declared type of an attribute.int
getColumnNumber()
Return the current column number.java.lang.String
getCurrentElement()
Return the current element.java.lang.String
getElementContentModel(java.lang.String name)
Look up the content model of an element.int
getElementContentType(java.lang.String name)
Look up the content type of an element.java.lang.String
getEntityNotationName(java.lang.String eName)
Get the notation name associated with an NDATA entity.java.lang.String
getEntityPublicId(java.lang.String ename)
Return an external entity's public identifier, if any.java.lang.String
getEntitySystemId(java.lang.String ename)
Return an external entity's system identifier.int
getEntityType(java.lang.String ename)
Find the type of an entity.java.lang.String
getEntityValue(java.lang.String ename)
Return the value of an internal entity.int
getLineNumber()
Return the current line number.java.lang.String
getNotationPublicId(java.lang.String nname)
Look up the public identifier for a notation.java.lang.String
getNotationSystemId(java.lang.String nname)
Look up the system identifier for a notation.java.lang.String
intern(char[] ch, int start, int length)
Create an internalised string from a character array.java.lang.String
intern(java.lang.String s)
Return an internalised version of a string.void
parse(java.lang.String systemId, java.lang.String publicId, java.io.InputStream stream, java.lang.String encoding)
Parse an XML document from a byte stream.void
parse(java.lang.String systemId, java.lang.String publicId, java.io.Reader reader)
Parse an XML document from a character stream.void
parse(java.lang.String systemId, java.lang.String publicId, java.lang.String encoding)
Parse an XML document from a URI.void
setHandler(XmlHandler handler)
Set the handler that will receive parsing events.
-
-
-
Field Detail
-
CONTENT_UNDECLARED
public static final int CONTENT_UNDECLARED
Constant: an element has not been declared.
-
CONTENT_ANY
public static final int CONTENT_ANY
Constant: the element has a content model of ANY.
-
CONTENT_EMPTY
public static final int CONTENT_EMPTY
Constant: the element has declared content of EMPTY.
-
CONTENT_MIXED
public static final int CONTENT_MIXED
Constant: the element has mixed content.
-
CONTENT_ELEMENTS
public static final int CONTENT_ELEMENTS
Constant: the element has element content.
-
ENTITY_UNDECLARED
public static final int ENTITY_UNDECLARED
Constant: the entity has not been declared.
-
ENTITY_INTERNAL
public static final int ENTITY_INTERNAL
Constant: the entity is internal.
-
ENTITY_NDATA
public static final int ENTITY_NDATA
Constant: the entity is external, non-XML data.
-
ENTITY_TEXT
public static final int ENTITY_TEXT
Constant: the entity is external XML data.
-
ATTRIBUTE_UNDECLARED
public static final int ATTRIBUTE_UNDECLARED
Constant: the attribute has not been declared for this element type.
-
ATTRIBUTE_CDATA
public static final int ATTRIBUTE_CDATA
Constant: the attribute value is a string value.
-
ATTRIBUTE_ID
public static final int ATTRIBUTE_ID
Constant: the attribute value is a unique identifier.
-
ATTRIBUTE_IDREF
public static final int ATTRIBUTE_IDREF
Constant: the attribute value is a reference to a unique identifier.
-
ATTRIBUTE_IDREFS
public static final int ATTRIBUTE_IDREFS
Constant: the attribute value is a list of ID references.
-
ATTRIBUTE_ENTITY
public static final int ATTRIBUTE_ENTITY
Constant: the attribute value is the name of an entity.
-
ATTRIBUTE_ENTITIES
public static final int ATTRIBUTE_ENTITIES
Constant: the attribute value is a list of entity names.
-
ATTRIBUTE_NMTOKEN
public static final int ATTRIBUTE_NMTOKEN
Constant: the attribute value is a name token.
-
ATTRIBUTE_NMTOKENS
public static final int ATTRIBUTE_NMTOKENS
Constant: the attribute value is a list of name tokens.
-
ATTRIBUTE_ENUMERATED
public static final int ATTRIBUTE_ENUMERATED
Constant: the attribute value is a token from an enumeration.
-
ATTRIBUTE_NOTATION
public static final int ATTRIBUTE_NOTATION
Constant: the attribute is the name of a notation.
-
ATTRIBUTE_DEFAULT_UNDECLARED
public static final int ATTRIBUTE_DEFAULT_UNDECLARED
Constant: the attribute is not declared.
-
ATTRIBUTE_DEFAULT_SPECIFIED
public static final int ATTRIBUTE_DEFAULT_SPECIFIED
Constant: the attribute has a literal default value specified.
-
ATTRIBUTE_DEFAULT_IMPLIED
public static final int ATTRIBUTE_DEFAULT_IMPLIED
Constant: the attribute was declared #IMPLIED.
-
ATTRIBUTE_DEFAULT_REQUIRED
public static final int ATTRIBUTE_DEFAULT_REQUIRED
Constant: the attribute was declared #REQUIRED.
-
ATTRIBUTE_DEFAULT_FIXED
public static final int ATTRIBUTE_DEFAULT_FIXED
Constant: the attribute was declared #FIXED.
-
-
Method Detail
-
setHandler
public void setHandler(XmlHandler handler)
Set the handler that will receive parsing events.- Parameters:
handler
- The handler to receive callback events.- See Also:
parse(java.lang.String, java.lang.String, java.lang.String)
,XmlHandler
-
parse
public void parse(java.lang.String systemId, java.lang.String publicId, java.lang.String encoding) throws java.lang.Exception
Parse an XML document from a URI.You may parse a document more than once, but only one thread may call this method for an object at one time.
- Parameters:
systemId
- The URI of the document.publicId
- The public identifier of the document, or null.encoding
- The suggested encoding, or null if unknown.- Throws:
java.lang.Exception
- Any exception thrown by your own handlers, or any derivation of java.io.IOException thrown by the parser itself.
-
parse
public void parse(java.lang.String systemId, java.lang.String publicId, java.io.InputStream stream, java.lang.String encoding) throws java.lang.Exception
Parse an XML document from a byte stream.The URI that you supply will become the base URI for resolving relative links, but Ælfred will actually read the document from the supplied input stream.
You may parse a document more than once, but only one thread may call this method for an object at one time.
- Parameters:
systemId
- The base URI of the document, or null if not known.publicId
- The public identifier of the document, or null if not known.stream
- A byte input stream.encoding
- The suggested encoding, or null if unknown.- Throws:
java.lang.Exception
- Any exception thrown by your own handlers, or any derivation of java.io.IOException thrown by the parser itself.
-
parse
public void parse(java.lang.String systemId, java.lang.String publicId, java.io.Reader reader) throws java.lang.Exception
Parse an XML document from a character stream.The URI that you supply will become the base URI for resolving relative links, but Ælfred will actually read the document from the supplied input stream.
You may parse a document more than once, but only one thread may call this method for an object at one time.
- Parameters:
systemId
- The base URI of the document, or null if not known.publicId
- The public identifier of the document, or null if not known.reader
- A character stream.- Throws:
java.lang.Exception
- Any exception thrown by your own handlers, or any derivation of java.io.IOException thrown by the parser itself.
-
intern
public java.lang.String intern(java.lang.String s)
Return an internalised version of a string.Ælfred uses this method to create an internalised version of all names and attribute values, so that it can test equality with
==
instead ofString.equals()
.If you want to be able to test for equality in the same way, you can use this method to internalise your own strings first:
String PARA = handler.intern("PARA");
Note that this will not return the same results as String.intern().
- Parameters:
s
- The string to internalise.- Returns:
- An internalised version of the string.
- See Also:
intern(char[],int,int)
,String.intern()
-
intern
public java.lang.String intern(char[] ch, int start, int length)
Create an internalised string from a character array.This is much more efficient than constructing a non-internalised string first, and then internalising it.
Note that this will not return the same results as String.intern().
- Parameters:
ch
- an array of characters for building the string.start
- the starting position in the array.length
- the number of characters to place in the string.- Returns:
- an internalised string.
- See Also:
intern(String)
,String.intern()
-
declaredElements
public java.util.Enumeration declaredElements()
Get the declared elements for an XML document.The results will be valid only after the DTD (if any) has been parsed.
- Returns:
- An enumeration of all element types declared for this document (as Strings).
- See Also:
getElementContentType(java.lang.String)
,getElementContentModel(java.lang.String)
-
getElementContentType
public int getElementContentType(java.lang.String name)
Look up the content type of an element.- Parameters:
name
- The element type name.- Returns:
- An integer constant representing the content type.
- See Also:
getElementContentModel(java.lang.String)
,CONTENT_UNDECLARED
,CONTENT_ANY
,CONTENT_EMPTY
,CONTENT_MIXED
,CONTENT_ELEMENTS
-
getElementContentModel
public java.lang.String getElementContentModel(java.lang.String name)
Look up the content model of an element.The result will always be null unless the content type is CONTENT_ELEMENTS or CONTENT_MIXED.
- Parameters:
name
- The element type name.- Returns:
- The normalised content model, as a string.
- See Also:
getElementContentType(java.lang.String)
-
declaredAttributes
public java.util.Enumeration declaredAttributes(java.lang.String elname)
Get the declared attributes for an element type.- Parameters:
elname
- The name of the element type.- Returns:
- An Enumeration of all the attributes declared for a specific element type. The results will be valid only after the DTD (if any) has been parsed.
- See Also:
getAttributeType(java.lang.String, java.lang.String)
,getAttributeEnumeration(java.lang.String, java.lang.String)
,getAttributeDefaultValueType(java.lang.String, java.lang.String)
,getAttributeDefaultValue(java.lang.String, java.lang.String)
,getAttributeExpandedValue(java.lang.String, java.lang.String)
-
getAttributeType
public int getAttributeType(java.lang.String name, java.lang.String aname)
Retrieve the declared type of an attribute.- Parameters:
name
- The name of the associated element.aname
- The name of the attribute.- Returns:
- An integer constant representing the attribute type.
- See Also:
ATTRIBUTE_UNDECLARED
,ATTRIBUTE_CDATA
,ATTRIBUTE_ID
,ATTRIBUTE_IDREF
,ATTRIBUTE_IDREFS
,ATTRIBUTE_ENTITY
,ATTRIBUTE_ENTITIES
,ATTRIBUTE_NMTOKEN
,ATTRIBUTE_NMTOKENS
,ATTRIBUTE_ENUMERATED
,ATTRIBUTE_NOTATION
-
getAttributeEnumeration
public java.lang.String getAttributeEnumeration(java.lang.String name, java.lang.String aname)
Retrieve the allowed values for an enumerated attribute type.- Parameters:
name
- The name of the associated element.aname
- The name of the attribute.- Returns:
- A string containing the token list.
- See Also:
ATTRIBUTE_ENUMERATED
,ATTRIBUTE_NOTATION
-
getAttributeDefaultValue
public java.lang.String getAttributeDefaultValue(java.lang.String name, java.lang.String aname)
Retrieve the default value of a declared attribute.- Parameters:
name
- The name of the associated element.aname
- The name of the attribute.- Returns:
- The default value, or null if the attribute was #IMPLIED or simply undeclared and unspecified.
- See Also:
getAttributeExpandedValue(java.lang.String, java.lang.String)
-
getAttributeExpandedValue
public java.lang.String getAttributeExpandedValue(java.lang.String name, java.lang.String aname)
Retrieve the expanded value of a declared attribute.All general entities will be expanded.
- Parameters:
name
- The name of the associated element.aname
- The name of the attribute.- Returns:
- The expanded default value, or null if the attribute was #IMPLIED or simply undeclared
- See Also:
getAttributeDefaultValue(java.lang.String, java.lang.String)
-
getAttributeDefaultValueType
public int getAttributeDefaultValueType(java.lang.String name, java.lang.String aname)
Retrieve the default value type of a declared attribute.- Parameters:
name
- The name of the element.aname
- The name of the attribute.- Returns:
- ATTRIBUTE_DEFAULT_UNDECLARED if the attribute cannot be found, otherwise return an integer.
- See Also:
ATTRIBUTE_DEFAULT_SPECIFIED
,ATTRIBUTE_DEFAULT_IMPLIED
,ATTRIBUTE_DEFAULT_REQUIRED
,ATTRIBUTE_DEFAULT_FIXED
-
declaredEntities
public java.util.Enumeration declaredEntities()
Get declared entities.- Returns:
- An Enumeration of all the entities declared for this XML document. The results will be valid only after the DTD (if any) has been parsed.
- See Also:
getEntityType(java.lang.String)
,getEntityPublicId(java.lang.String)
,getEntitySystemId(java.lang.String)
,getEntityValue(java.lang.String)
,getEntityNotationName(java.lang.String)
-
getCurrentElement
public java.lang.String getCurrentElement()
Return the current element.- Returns:
- The current Element.
-
getEntityType
public int getEntityType(java.lang.String ename)
Find the type of an entity.- Parameters:
ename
- The name of the entity.- Returns:
- An integer constant representing the entity type.
- See Also:
ENTITY_UNDECLARED
,ENTITY_INTERNAL
,ENTITY_NDATA
,ENTITY_TEXT
-
getEntityPublicId
public java.lang.String getEntityPublicId(java.lang.String ename)
Return an external entity's public identifier, if any.- Parameters:
ename
- The name of the external entity.- Returns:
- The entity's system identifier, or null if the entity was not declared, if it is not an external entity, or if no public identifier was provided.
- See Also:
getEntityType(java.lang.String)
-
getEntitySystemId
public java.lang.String getEntitySystemId(java.lang.String ename)
Return an external entity's system identifier.- Parameters:
ename
- The name of the external entity.- Returns:
- The entity's system identifier, or null if the entity was not declared, or if it is not an external entity.
- See Also:
getEntityType(java.lang.String)
-
getEntityValue
public java.lang.String getEntityValue(java.lang.String ename)
Return the value of an internal entity.- Parameters:
ename
- The name of the internal entity.- Returns:
- The entity's value, or null if the entity was not declared, or if it is not an internal entity.
- See Also:
getEntityType(java.lang.String)
-
getEntityNotationName
public java.lang.String getEntityNotationName(java.lang.String eName)
Get the notation name associated with an NDATA entity.- Parameters:
eName
- The NDATA entity name.- Returns:
- The associated notation name, or null if the entity was not declared, or if it is not an NDATA entity.
- See Also:
getEntityType(java.lang.String)
-
declaredNotations
public java.util.Enumeration declaredNotations()
Get declared notations.- Returns:
- An Enumeration of all the notations declared for this XML document. The results will be valid only after the DTD (if any) has been parsed.
- See Also:
getNotationPublicId(java.lang.String)
,getNotationSystemId(java.lang.String)
-
getNotationPublicId
public java.lang.String getNotationPublicId(java.lang.String nname)
Look up the public identifier for a notation. You will normally use this method to look up a notation that was provided as an attribute value or for an NDATA entity.- Parameters:
nname
- The name of the notation.- Returns:
- A string containing the public identifier, or null if none was provided or if no such notation was declared.
- See Also:
getNotationSystemId(java.lang.String)
-
getNotationSystemId
public java.lang.String getNotationSystemId(java.lang.String nname)
Look up the system identifier for a notation. You will normally use this method to look up a notation that was provided as an attribute value or for an NDATA entity.- Parameters:
nname
- The name of the notation.- Returns:
- A string containing the system identifier, or null if no such notation was declared.
- See Also:
getNotationPublicId(java.lang.String)
-
getLineNumber
public int getLineNumber()
Return the current line number.- Returns:
- The current line number.
-
getColumnNumber
public int getColumnNumber()
Return the current column number.- Returns:
- The current column number.
-
-