langchain.js
    Preparing search index...

    A class that extends the CheerioWebBaseLoader class. It represents a loader for loading web pages from the IMSDB (Internet Movie Script Database) website.

    Hierarchy (View Summary)

    Index

    Constructors

    Properties

    caller: AsyncCaller
    headers?: HeadersInit
    selector?: SelectorType
    textDecoder?: TextDecoder
    timeout: number
    webPath: string

    Methods

    • An asynchronous method that loads the web page using the scrape() method inherited from the base class. It selects the element with the class 'scrtext' using the $ function provided by Cheerio and extracts the text content. It creates a Document instance with the text content as the page content and the source as metadata. It returns an array containing the Document instance.

      Returns Promise<Document[]>

      An array containing a Document instance.

    • Fetches the web document from the webPath and loads it using Cheerio. It returns a CheerioAPI instance.

      Returns Promise<CheerioAPI>

      A Promise that resolves to a CheerioAPI instance.

    • Parameters

      • url: string
      • caller: AsyncCaller
      • timeout: undefined | number
      • OptionaltextDecoder: TextDecoder
      • Optionaloptions: CheerioOptions & { headers?: HeadersInit }

      Returns Promise<CheerioAPI>

    • A static method that dynamically imports the Cheerio library and returns the load function. If the import fails, it throws an error.

      Returns Promise<
          {
              load: (
                  content: string | Buffer<ArrayBufferLike> | AnyNode | AnyNode[],
                  options?: null | CheerioOptions,
                  isDocument?: boolean,
              ) => CheerioAPI;
          },
      >

      A Promise that resolves to an object containing the load function from the Cheerio library.

    • Fetches web documents from the given array of URLs and loads them using Cheerio. It returns an array of CheerioAPI instances.

      Parameters

      • urls: string[]

        An array of URLs to fetch and load.

      • caller: AsyncCaller
      • timeout: undefined | number
      • OptionaltextDecoder: TextDecoder
      • Optionaloptions: CheerioOptions & { headers?: HeadersInit }

      Returns Promise<CheerioAPI[]>

      A Promise that resolves to an array of CheerioAPI instances.