Package com.simplicite.util.tools
Class DocumentParser
- java.lang.Object
-
- com.simplicite.util.tools.DocumentParser
-
public class DocumentParser extends java.lang.Object
Document content parser, currently based on Tika
-
-
Constructor Summary
Constructors Constructor Description DocumentParser()
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static java.lang.String
parse(DocumentDB doc)
Parse document as textstatic java.lang.String
parse(java.io.File file)
Parse document as textstatic java.lang.String
parse(java.lang.String path)
Parse document as textstatic java.lang.String
parse(java.lang.String path, byte[] data)
Parse document as text
-
-
-
Method Detail
-
parse
public static java.lang.String parse(DocumentDB doc)
Parse document as text- Parameters:
doc
- Document- Returns:
- Extracted text
-
parse
public static java.lang.String parse(java.lang.String path)
Parse document as text- Parameters:
path
- Document path (if relative, use local doc dir as base directory)- Returns:
- Extracted text
-
parse
public static java.lang.String parse(java.io.File file)
Parse document as text- Parameters:
file
- Document file- Returns:
- Extracted text
-
parse
public static java.lang.String parse(java.lang.String path, byte[] data)
Parse document as text- Parameters:
path
- Document pathdata
- Document data- Returns:
- Extracted text
-
-