# DocusaurusLoader

> **Class** in `langchain_community`

📖 [View in docs](https://reference.langchain.com/python/langchain-community/document_loaders/docusaurus/DocusaurusLoader)

Load from Docusaurus Documentation.

It leverages the SitemapLoader to loop through the generated pages of a
Docusaurus Documentation website and extracts the content by looking for specific
HTML tags. By default, the parser searches for the main content of the Docusaurus
page, which is normally the <article>. You can also define your own
custom HTML tags by providing them as a list, for example: ["div", ".main", "a"].

## Signature

```python
DocusaurusLoader(
    self,
    url: str,
    custom_html_tags: Optional[List[str]] = None,
    **kwargs: Any = {},
)
```

## Parameters

| Name | Type | Required | Description |
|------|------|----------|-------------|
| `url` | `str` | Yes | The base URL of the Docusaurus website. |
| `custom_html_tags` | `Optional[List[str]]` | No | Optional custom html tags to extract content from pages. (default: `None`) |
| `kwargs` | `Any` | No | Additional args to extend the underlying SitemapLoader, for example: filter_urls, blocksize, meta_function, is_local, continue_on_failure (default: `{}`) |

## Extends

- `SitemapLoader`

## Constructors

```python
__init__(
    self,
    url: str,
    custom_html_tags: Optional[List[str]] = None,
    **kwargs: Any = {},
)
```

| Name | Type |
|------|------|
| `url` | `str` |
| `custom_html_tags` | `Optional[List[str]]` |


## Properties

- `custom_html_tags`

---

[View source on GitHub](https://github.com/langchain-ai/langchain-community/blob/4b280287bd55b99b44db2dd849f02d66c89534d5/libs/community/langchain_community/document_loaders/docusaurus.py#L8)