|
|||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.jasen.core.StandardParserData
Holds the information obtained from parsing and tokenizing the message.
Constructor Summary | |
StandardParserData()
|
Method Summary | |
int |
getConcealedHtmlCount()
Gets the number of occurrances of concealed HTML. |
int |
getFalseAnchorCount()
Gets the count of anchor tags whose text was URL text (eg http://...) . |
String |
getHtmlAsText()
Gets the HTML part of the message as plain text |
int |
getImageCount()
Gets the number of images in the email body. |
String[] |
getMessageTokens()
Gets the single (word) tokens extracted from the message |
int |
getObfuscatedCharacterCount()
Gets the number of character obfuscation observations. |
List |
getPorts()
Gets the list of TCP ports found appended to URLs in the HTML body of the message. |
int |
getSrcCgiCount()
Gets the number of occurrances of SRC cgi references. |
int |
getSrcPortCount()
Gets the list of anchor or image src (or href) attributes which had alternate TCP ports appended. |
String |
getTextParsed()
Gets the text part of the message as "clean" text |
TokenErrorRecorder |
getTokenErrorRecorder()
Gets the recorder used to record errors or anomalies found while tokenizing during a parse |
void |
setConcealedHtmlCount(int concealedHtmlCount)
Sets the number of occurrances of concealed HTML. |
void |
setFalseAnchorCount(int falseAnchorCount)
Sets the count of anchor tags whose text was URL text (eg http://...) but did not match the href attribute . |
void |
setHtmlAsText(String parsedHtml)
Sets the parsed html. |
void |
setImageCount(int imageCount)
Sets the number of images in the email body. |
void |
setMessageTokens(String[] htmlTokens)
Sets the message tokens obtained from tokenization. |
void |
setPorts(List ports)
Sets the list of TCP ports found appended to URLs in the HTML body of the message. |
void |
setSrcCgiCount(int srcCgiCount)
Sets the number of occurrances of SRC cgi references. |
void |
setSrcPortCount(int srcPortCount)
Sets the list of anchor or image src (or href) attributes which had alternate TCP ports appended. |
void |
setTextParsed(String textParsed)
Sets the parsed (cleaned) text resulting from the message parse. |
Methods inherited from class java.lang.Object |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
public StandardParserData()
Method Detail |
public String[] getMessageTokens()
ParserData
getMessageTokens
in interface ParserData
public void setMessageTokens(String[] htmlTokens)
htmlTokens
- EmailTokenizer
public String getHtmlAsText()
ParserData
getHtmlAsText
in interface ParserData
public void setHtmlAsText(String parsedHtml)
parsedHtml
- public int getConcealedHtmlCount()
public void setConcealedHtmlCount(int concealedHtmlCount)
concealedHtmlCount
- The concealedHtmlCount to set.public int getImageCount()
public void setImageCount(int imageCount)
imageCount
- The imageCount to set.public int getSrcCgiCount()
That is, occurrances of HTML tags where a SRC (or other remote reference) which would normally be expected to be a flat file (eg the IMG tag) was found to reference a cgi script or similar. This often indicates the presence of mail bugs
public void setSrcCgiCount(int srcCgiCount)
srcCgiCount
- The srcCgiCount to set.getSrcCgiCount()
public String getTextParsed()
ParserData
getTextParsed
in interface ParserData
public void setTextParsed(String textParsed)
textParsed
- public List getPorts()
public void setPorts(List ports)
ports
- A list of String objectspublic int getSrcPortCount()
public void setSrcPortCount(int srcPortCount)
srcPortCount
- The number of occurrances.public int getFalseAnchorCount()
public void setFalseAnchorCount(int falseAnchorCount)
falseAnchorCount
- The number of occurrances.public int getObfuscatedCharacterCount()
public TokenErrorRecorder getTokenErrorRecorder()
ParserData
getTokenErrorRecorder
in interface ParserData
|
|||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |