| Name | Type | Description |
|---|---|---|
handlers* | Mapping[str, BaseBlobParser] | |
fallback_parser | Optional[BaseBlobParser] | Default: None |
| Name | Type |
|---|---|
| handlers | Mapping[str, BaseBlobParser] |
| fallback_parser | Optional[BaseBlobParser] |
Parser that uses mime-types to parse a blob.
This parser is useful for simple pipelines where the mime-type is sufficient to determine how to parse a blob.
To use, configure handlers based on mime-types and pass them to the initializer.
Example:
.. code-block:: python
from langchain_community.document_loaders.parsers.generic import MimeTypeBasedParser
parser = MimeTypeBasedParser(
handlers={
"application/pdf": ...,
},
fallback_parser=...,
)
A mapping from mime-types to functions that take a blob, parse it and return a document.
A fallback_parser parser to use if the mime-type is not found in the handlers. If provided, this parser will be used to parse blobs with all mime-types not found in the handlers. If not provided, a ValueError will be raised if the mime-type is not found in the handlers.
Load documents from a blob.