|Modifier and Type||Method and Description|
Iterator<URL> getEmbeddedResourceURLs(String userAgent, byte responseData, URL baseUrl, String encoding) throws LinkExtractorParseException
URLs should not appear twice in the returned iterator.
Malformed URLs can be reported to the caller by having the Iterator return the corresponding RL String. Overall problems parsing the html should be reported by throwing an HTMLParseException.
userAgent- User Agent
responseData- Response data
baseUrl- Base URL from which the HTML code was obtained
LinkExtractorParseException- when extracting the links fails
Copyright © 1998-2021 Apache Software Foundation. All Rights Reserved.