| Name | Type | Description |
|---|---|---|
urls | Optional[Sequence[str]] | Default: NoneURLs for RSS feeds to load. Each articles in the feed is loaded into its own document. |
opml | Optional[str] | Default: NoneOPML file to load feed urls from. Only one of urls or opml should be provided. The value |
continue_on_failure | bool | Default: True |
show_progress_bar | bool | Default: False |
**newsloader_kwargs | Any | Default: {} |
Load news articles from RSS feeds using Unstructured.
Example:
.. code-block:: python
from langchain_community.document_loaders import RSSFeedLoader
loader = RSSFeedLoader(
urls=["
The loader uses feedparser to parse RSS feeds. The feedparser library is not installed by default so you should install it if using this loader: https://pythonhosted.org/feedparser/
If you use OPML, you should also install listparser: https://pythonhosted.org/listparser/
Finally, newspaper is used to process each article: https://newspaper.readthedocs.io/en/latest/
If True, continue loading documents even if loading fails for a particular URL.
If True, use tqdm to show a loading progress bar. Requires
tqdm to be installed, pip install tqdm.
Any additional named arguments to pass to NewsURLLoader.