Class OdfTextExtractor
java.lang.Object
org.odftoolkit.odfdom.dom.DefaultElementVisitor
org.odftoolkit.odfdom.incubator.doc.text.OdfTextExtractor
- All Implemented Interfaces:
ElementVisitor
- Direct Known Subclasses:
OdfEditableTextExtractor
It's a sub class of DefaultElementVisitor. It provides a method to get the display text of a
single element.
If you pass the content root as the parameter, the whole document content will be returned, without any tag information.
It implements part of white space handling fuctions: text:p, text:h, text:s, text:tab, text:linebreak are processed according to ODF specification.
-
Field Summary
FieldsModifier and TypeFieldDescriptionprotected StringBuilder
protected static final char
protected static final char
-
Constructor Summary
ConstructorsModifierConstructorDescriptionprotected
Default constructorprotected
OdfTextExtractor
(OdfElement element) Constructor with an ODF element as paramter -
Method Summary
Modifier and TypeMethodDescriptionprotected void
Append the text content of this element to string buffer.getText()
Return the text content as a stringstatic OdfTextExtractor
newOdfTextExtractor
(OdfElement element) An instance of OdfTextExtractor will be created to extract the text content of an ODF element.void
visit
(TextHElement ele) A method to visit "text:h" elementsvoid
A method to visit "text:line-break" elementsvoid
visit
(TextPElement ele) A method to visit "text:p" elementsvoid
visit
(TextSElement ele) A method to visit "text:s" elementsvoid
visit
(TextTabElement ele) A method to visit "text:tab" elementsvoid
visit
(OdfElement element) Visit odf element and process some operationsMethods inherited from class org.odftoolkit.odfdom.dom.DefaultElementVisitor
visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit
-
Field Details
-
mTextBuilder
-
NewLineChar
protected static final char NewLineChar- See Also:
-
TabChar
protected static final char TabChar- See Also:
-
-
Constructor Details
-
OdfTextExtractor
protected OdfTextExtractor()Default constructor -
OdfTextExtractor
Constructor with an ODF element as paramter- Parameters:
element
- the ODF element whose text would be extracted.
-
-
Method Details
-
appendElementText
Append the text content of this element to string buffer.- Parameters:
ele
- the ODF element whose text will be appended.
-
newOdfTextExtractor
An instance of OdfTextExtractor will be created to extract the text content of an ODF element.- Parameters:
element
- the ODF element whose text will be extracted.- Returns:
- An instance of OdfTextExtractor
-
visit
Description copied from interface:ElementVisitor
Visit odf element and process some operations- Specified by:
visit
in interfaceElementVisitor
- Specified by:
visit
in classDefaultElementVisitor
- Parameters:
element
- ODF element
-
visit
Description copied from class:DefaultElementVisitor
A method to visit "text:p" elements- Overrides:
visit
in classDefaultElementVisitor
- Parameters:
ele
- - an instance of TextPElement
-
visit
Description copied from class:DefaultElementVisitor
A method to visit "text:h" elements- Overrides:
visit
in classDefaultElementVisitor
- Parameters:
ele
- - an instance of TextHElement
-
visit
Description copied from class:DefaultElementVisitor
A method to visit "text:s" elements- Overrides:
visit
in classDefaultElementVisitor
- Parameters:
ele
- - an instance of TextSElement
-
visit
Description copied from class:DefaultElementVisitor
A method to visit "text:tab" elements- Overrides:
visit
in classDefaultElementVisitor
- Parameters:
ele
- - an instance of TextTabElement
-
visit
Description copied from class:DefaultElementVisitor
A method to visit "text:line-break" elements- Overrides:
visit
in classDefaultElementVisitor
- Parameters:
ele
- - an instance of TextLineBreakElement
-
getText
Return the text content as a string- Returns:
- the text content as a string
-