# PDFPlumberParser

> **Class** in `langchain_community`

📖 [View in docs](https://reference.langchain.com/python/langchain-community/document_loaders/parsers/pdf/PDFPlumberParser)

Parse `PDF` with `PDFPlumber`.

## Signature

```python
PDFPlumberParser(
    self,
    text_kwargs: Optional[Mapping[str, Any]] = None,
    dedupe: bool = False,
    extract_images: bool = False,
)
```

## Parameters

| Name | Type | Required | Description |
|------|------|----------|-------------|
| `text_kwargs` | `Optional[Mapping[str, Any]]` | No | Keyword arguments to pass to ``pdfplumber.Page.extract_text()`` (default: `None`) |
| `dedupe` | `bool` | No | Avoiding the error of duplicate characters if `dedupe=True`. (default: `False`) |

## Extends

- `BaseBlobParser`

## Constructors

```python
__init__(
    self,
    text_kwargs: Optional[Mapping[str, Any]] = None,
    dedupe: bool = False,
    extract_images: bool = False,
) -> None
```

| Name | Type |
|------|------|
| `text_kwargs` | `Optional[Mapping[str, Any]]` |
| `dedupe` | `bool` |
| `extract_images` | `bool` |


## Properties

- `text_kwargs`
- `dedupe`
- `extract_images`

## Methods

- [`lazy_parse()`](https://reference.langchain.com/python/langchain-community/document_loaders/parsers/pdf/PDFPlumberParser/lazy_parse)

---

[View source on GitHub](https://github.com/langchain-ai/langchain-community/blob/a6a6079511ac8a5c1293337f88096b8641562e77/libs/community/langchain_community/document_loaders/parsers/pdf.py#L1394)