public class sitemapParser extends AbstractParser implements Parser
Modifier and Type | Class and Description |
---|---|
static class |
sitemapParser.SitemapEntry |
static class |
sitemapParser.SitemapReader
for schemas see:
http://www.sitemaps.org/schemas/sitemap/0.9
http://www.google.com/schemas/sitemap/0.84
|
static class |
sitemapParser.URLEntry |
Parser.Failure
Modifier and Type | Field and Description |
---|---|
static sitemapParser.URLEntry |
POISON_URLEntry |
log, SUPPORTED_EXTENSIONS, SUPPORTED_MIME_TYPES
Constructor and Description |
---|
sitemapParser() |
Modifier and Type | Method and Description |
---|---|
static sitemapParser.SitemapReader |
parse(DigestURL sitemapURL,
ClientIdentification.Agent agent) |
Document[] |
parse(DigestURL location,
java.lang.String mimeType,
java.lang.String charset,
VocabularyScraper scraper,
int timezoneOffset,
java.io.InputStream source)
parse an input stream
|
private static java.lang.String |
val(org.w3c.dom.Element parent,
java.lang.String label,
java.lang.String dflt) |
equals, getName, hashCode, singleList, supportedExtensions, supportedMimeTypes
clone, finalize, getClass, notify, notifyAll, toString, wait, wait, wait
equals, getName, hashCode, supportedExtensions, supportedMimeTypes
public static final sitemapParser.URLEntry POISON_URLEntry
public Document[] parse(DigestURL location, java.lang.String mimeType, java.lang.String charset, VocabularyScraper scraper, int timezoneOffset, java.io.InputStream source) throws Parser.Failure, java.lang.InterruptedException
Parser
parse
in interface Parser
location
- the url of the sourcemimeType
- the mime type of the source, if knowncharset
- the charset of the source, if knownscraper
- an entity scraper to detect facets from text annotation contextsource
- a input streamParser.Failure
java.lang.InterruptedException
public static sitemapParser.SitemapReader parse(DigestURL sitemapURL, ClientIdentification.Agent agent) throws java.io.IOException
java.io.IOException
private static java.lang.String val(org.w3c.dom.Element parent, java.lang.String label, java.lang.String dflt)