# BS4HTMLParser

> **Class** in `langchain_community`

📖 [View in docs](https://reference.langchain.com/python/langchain-community/document_loaders/parsers/html/bs4/BS4HTMLParser)

Parse HTML files using `Beautiful Soup`.

## Signature

```python
BS4HTMLParser(
    self,
    *,
    features: str = 'lxml',
    get_text_separator: str = '',
    **kwargs: Any = {},
)
```

## Extends

- `BaseBlobParser`

## Constructors

```python
__init__(
    self,
    *,
    features: str = 'lxml',
    get_text_separator: str = '',
    **kwargs: Any = {},
) -> None
```

| Name | Type |
|------|------|
| `features` | `str` |
| `get_text_separator` | `str` |


## Properties

- `bs_kwargs`
- `get_text_separator`

## Methods

- [`lazy_parse()`](https://reference.langchain.com/python/langchain-community/document_loaders/parsers/html/bs4/BS4HTMLParser/lazy_parse)

---

[View source on GitHub](https://github.com/langchain-ai/langchain-community/blob/4b280287bd55b99b44db2dd849f02d66c89534d5/libs/community/langchain_community/document_loaders/parsers/html/bs4.py#L14)