BibtexLoader(
self,
file_path: str,
*,
parser: Optional[BibtexparserWrapper] = | Name | Type | Description |
|---|---|---|
file_path* | str | Path to the bibtex file. |
parser | Optional[BibtexparserWrapper] | Default: NoneThe parser to use. If None, a default parser is used. |
max_docs | Optional[int] | Default: None |
max_content_chars | Optional[int] | Default: 4000 |
load_extra_metadata | bool | Default: False |
file_pattern | str | Default: '[^:]+\\.pdf' |
Load bibtex file using bibtexparser and get the article texts plus the article metadata. See https://bibtexparser.readthedocs.io/en/master/
Load a bibtex file.
Each document represents one entry from the bibtex file.
If a PDF file is present in the file bibtex field, the original PDF
is loaded into the document text. If no such file entry is present,
the abstract field is used instead.
Max number of associated documents to load. Use -1 means no limit.
Maximum number of characters to load from the PDF.
Whether to load extra metadata from the PDF.
Regex pattern to match the file name in the bibtex.