Module contains a PDF parser based on Document AI from Google Cloud.
You need to install two libraries to use this parser: pip install google-cloud-documentai pip install google-cloud-documentai-toolbox
Returns a client info object with a custom user agent header.
A dataclass to store Document AI parsing results.
Google Cloud Document AI parser.
Inherits from BaseBlobParser.
Parses documents using Google Cloud Document AI for text extraction and layout analysis.
See Document AI documentation for detailed information.
Requires additional dependencies:
pip install langchain-google-community[docai]