linkcheck.cache.urlqueue
Handle a queue of URLs to check.
Classes
|
A queue supporting several consumer tasks. |
Exceptions
Exception raised by get(). |
|
Raised by join() |
- class linkcheck.cache.urlqueue.UrlQueue(max_allowed_urls=None)[source]
Bases:
object
A queue supporting several consumer tasks. The task_done() idea is from the Python 2.5 implementation of Queue.Queue().
Initialize the queue state and task counters.
- empty()[source]
Return True if the queue is empty, False otherwise. Result is thread-safe, but not reliable since the queue could have been changed before the result is returned!
- get(timeout=None)[source]
Get first not-in-progress url from the queue and return it. If no such url is available return None.
- join(timeout=None)[source]
Blocks until all items in the Queue have been gotten and processed.
The count of unfinished tasks goes up whenever an item is added to the queue. The count goes down whenever a consumer thread calls task_done() to indicate the item was retrieved and all work on it is complete.
When the count of unfinished tasks drops to zero, join() unblocks.
- task_done(url_data)[source]
Indicate that a formerly enqueued task is complete.
Used by Queue consumer threads. For each get() used to fetch a task, a subsequent call to task_done() tells the queue that the processing on the task is complete.
If a join() is currently blocking, it will resume when all items have been processed (meaning that a task_done() call was received for every item that had been put() into the queue).
Raises a ValueError if called more times than there were items placed in the queue.