LangChain Reference home pageLangChain ReferenceLangChain Reference
  • GitHub
  • Main Docs
Deep Agents
LangChain
LangGraph
Integrations
LangSmith
  • Overview
  • MCP Adapters
    • Overview
    • Agents
    • Callbacks
    • Chains
    • Chat models
    • Embeddings
    • Evaluation
    • Globals
    • Hub
    • Memory
    • Output parsers
    • Retrievers
    • Runnables
    • LangSmith
    • Storage
    Standard Tests
    Text Splitters
    ⌘I

    LangChain Assistant

    Ask a question to get started

    Enter to send•Shift+Enter new line

    Menu

    MCP Adapters
    OverviewAgentsCallbacksChainsChat modelsEmbeddingsEvaluationGlobalsHubMemoryOutput parsersRetrieversRunnablesLangSmithStorage
    Standard Tests
    Text Splitters
    Language
    Theme
    Pythonlangchain-classicdocument_loaders
    Module●Since v1.0

    document_loaders

    Document Loaders are classes to load Documents.

    Document Loaders are usually used to load a lot of Documents in a single run.

    Attributes

    attribute
    DEPRECATED_LOOKUP: dict

    Functions

    function
    create_importer

    Create a function that helps retrieve objects from their new locations.

    The goal of this function is to help users transition from deprecated imports to new imports.

    The function will raise deprecation warning on loops using deprecated_lookups or fallback_module.

    Module lookups will import without deprecation warnings (used to speed up imports from large namespaces like llms or chat models).

    This function should ideally only be used with deprecated imports not with existing imports that are valid, as in addition to raising deprecation warnings the dynamic imports can create other issues for developers (e.g., loss of type information, IDE support for going to definition etc).

    Modules

    module
    obs_file
    module
    notebook
    module
    s3_directory
    module
    airbyte_json
    module
    sitemap
    module
    tensorflow_datasets
    module
    azure_blob_storage_container
    module
    org_mode
    module
    hugging_face_dataset
    module
    roam
    module
    dataframe
    module
    dropbox
    module
    onenote
    module
    telegram
    module
    ifixit
    module
    word_document
    module
    obs_directory
    module
    snowflake_loader
    module
    bigquery
    module
    rtf
    module
    gcs_file
    module
    pdf
    module
    open_city_data
    module
    xorbits
    module
    lakefs
    module
    onedrive
    module
    baiducloud_bos_file
    module
    epub
    module
    chatgpt
    module
    browserless
    module
    chromium
    module
    async_html
    module
    conllu
    module
    url
    module
    image_captions
    module
    notion
    module
    iugu
    module
    azure_ai_data
    module
    fauna
    module
    googledrive
    module
    mongodb
    module
    web_base
    module
    gcs_directory
    module
    directory
    module
    arcgis_loader
    module
    quip
    module
    concurrent
    module
    assemblyai
    module
    toml
    module
    airtable
    module
    college_confidential
    module
    google_speech_to_text
    module
    azure_blob_storage_file
    module
    polars_dataframe
    module
    geodataframe
    module
    generic
    module
    arxiv
    module
    bibtex
    module
    youtube
    module
    rss
    module
    cube_semantic
    module
    larksuite
    module
    joplin
    module
    max_compute
    module
    twitter
    module
    datadog_logs
    module
    couchbase
    module
    spreedly
    module
    imsdb
    module
    figma
    module
    base_o365
    module
    confluence
    module
    airbyte
    module
    readthedocs
    module
    slack_directory
    module
    azlyrics
    module
    obsidian
    module
    evernote
    module
    python
    module
    hn
    module
    markdown
    module
    weather
    module
    helpers
    module
    sharepoint
    module
    nuclia
    module
    powerpoint
    module
    srt
    module
    diffbot
    module
    tencent_cos_directory
    module
    pyspark_dataframe
    module
    rocksetdb
    module
    duckdb_loader
    module
    apify_dataset
    module
    gitbook
    module
    csv_loader
    module
    blackboard
    module
    gutenberg
    module
    acreom
    module
    stripe
    module
    xml
    module
    merge
    module
    baiducloud_bos_directory
    module
    facebook_chat
    module
    html_bs
    module
    tsv
    module
    s3_file
    module
    image
    module
    json_loader
    module
    url_playwright
    module
    tomarkdown
    module
    blockchain
    module
    docusaurus
    module
    onedrive_file
    module
    mediawikidump
    module
    rst
    module
    mastodon
    module
    recursive_url_loader
    module
    text
    module
    mhtml
    module
    git
    module
    wikipedia
    module
    odt
    module
    news
    module
    reddit
    module
    url_selenium
    module
    trello
    module
    modern_treasury
    module
    pubmed
    module
    unstructured
    module
    etherscan
    module
    html
    module
    whatsapp_chat
    module
    email
    module
    rspace
    module
    brave_search
    module
    notiondb
    module
    tencent_cos_file
    module
    discord
    module
    psychic
    module
    github
    module
    bilibili
    module
    docugami
    module
    excel
    module
    base
    module
    blob_loaders
    module
    parsers
    View source on GitHub