public class ooxmlParser extends AbstractParser implements Parser
Parser.Failure
Modifier and Type | Field and Description |
---|---|
private static java.lang.ThreadLocal<javax.xml.parsers.SAXParser> |
tlSax |
log, SUPPORTED_EXTENSIONS, SUPPORTED_MIME_TYPES
Constructor and Description |
---|
ooxmlParser() |
Modifier and Type | Method and Description |
---|---|
private static javax.xml.parsers.SAXParser |
getParser() |
private Document[] |
parse(DigestURL location,
java.lang.String mimeType,
java.lang.String charset,
java.io.File dest) |
Document[] |
parse(DigestURL location,
java.lang.String mimeType,
java.lang.String charset,
VocabularyScraper scraper,
int timezoneOffset,
java.io.InputStream source)
parse an input stream
|
equals, getName, hashCode, singleList, supportedExtensions, supportedMimeTypes
clone, finalize, getClass, notify, notifyAll, toString, wait, wait, wait
equals, getName, hashCode, supportedExtensions, supportedMimeTypes
private static javax.xml.parsers.SAXParser getParser() throws org.xml.sax.SAXException
org.xml.sax.SAXException
private Document[] parse(DigestURL location, java.lang.String mimeType, java.lang.String charset, java.io.File dest) throws Parser.Failure, java.lang.InterruptedException
Parser.Failure
java.lang.InterruptedException
public Document[] parse(DigestURL location, java.lang.String mimeType, java.lang.String charset, VocabularyScraper scraper, int timezoneOffset, java.io.InputStream source) throws Parser.Failure, java.lang.InterruptedException
Parser
parse
in interface Parser
location
- the url of the sourcemimeType
- the mime type of the source, if knowncharset
- the charset of the source, if knownscraper
- an entity scraper to detect facets from text annotation contextsource
- a input streamParser.Failure
java.lang.InterruptedException