linkcheck.parser.sitemap

Main functions for link parsing

Functions

parse_sitemap(url_data)

Parse XML sitemap data.

parse_sitemapindex(url_data)

Parse XML sitemap index data.

Classes

XmlTagUrlParser(tag)

Parse XML files and find URLs in text content of a tag name.

class linkcheck.parser.sitemap.XmlTagUrlParser(tag)[source]

Bases: object

Parse XML files and find URLs in text content of a tag name.

Initialize the parser.

add_url()[source]

Add non-empty URLs to the queue.

char_data(data)[source]

If inside the wanted tag, append data to URL.

end_element(name)[source]

If end tag is our tag, call add_url().

parse(url_data)[source]

Parse XML URL data.

start_element(name, attrs)[source]

Set tag status for start element.

linkcheck.parser.sitemap.parse_sitemap(url_data)[source]

Parse XML sitemap data.

linkcheck.parser.sitemap.parse_sitemapindex(url_data)[source]

Parse XML sitemap index data.