A class that extends the BufferLoader class. It represents a document
loader that loads documents from PDF files.
class PDFLoaderBufferLoaderconst loader = new PDFLoader("path/to/bitcoin.pdf");
const docs = await loader.load();
console.log({ docs });A method that loads the text file or blob and returns a promise that
resolves to an array of Document instances. It reads the text from
the file or blob using the readFile function from the
node:fs/promises module or the text() method of the blob. It then
parses the text using the parse() method and creates a Document
instance for each parsed page. The metadata includes the source of the
text (file path or blob) and, if there are multiple pages, the line
number of each page.
A method that takes a raw buffer and metadata as parameters and
returns a promise that resolves to an array of Document instances. It
uses the extractRawText function from the mammoth module or
extract method from the word-extractor module to extract
the raw text content from the buffer. If the extracted text content is
empty, it returns an empty array. Otherwise, it creates a new
Document instance with the extracted text content and the provided
metadata, and returns it as an array.
A static method that imports the readFile function from the
node:fs/promises module. It is used to dynamically import the
function when needed. If the import fails, it throws an error
indicating that the fs/promises module is not available in the
current environment.