Module●Since v1.0

content

Standard, multimodal content blocks for Large Language Model I/O.

This module provides standardized data structures for representing inputs to and outputs from LLMs. The core abstraction is the Content Block, a TypedDict.

Rationale

Different LLM providers use distinct and incompatible API schemas. This module provides a unified, provider-agnostic format to facilitate these interactions. A message to or from a model is simply a list of content blocks, allowing for the natural interleaving of text, images, and other content in a single ordered sequence.

An adapter for a specific provider is responsible for translating this standard list of blocks into the format required by its API.

Extensibility

Data not yet mapped to a standard block may be represented using the NonStandardContentBlock, which allows for provider-specific data to be included without losing the benefits of type checking and validation.

Furthermore, provider-specific fields within a standard block are fully supported by default in the extras field of each block. This allows for additional metadata to be included without breaking the standard structure. For example, Google's thought signature:

AIMessage(
    content=[
        {
            "type": "text",
            "text": "J'adore la programmation.",
            "extras": {"signature": "EpoWCpc..."},  # Thought signature
        }
    ], ...
)

Note

Following widespread adoption of PEP 728, we intend to add extra_items=Any as a param to Content Blocks. This will signify to type checkers that additional provider-specific fields are allowed outside of the extras field, and that will become the new standard approach to adding provider-specific metadata.

Note

Example with PEP 728 provider-specific fields:

# Content block definition
# NOTE: `extra_items=Any`
class TextContentBlock(TypedDict, extra_items=Any):
    type: Literal["text"]
    id: NotRequired[str]
    text: str
    annotations: NotRequired[list[Annotation]]
    index: NotRequired[int]

from langchain_core.messages.content import TextContentBlock

# Create a text content block with provider-specific fields
my_block: TextContentBlock = {
    # Add required fields
    "type": "text",
    "text": "Hello, world!",
    # Additional fields not specified in the TypedDict
    # These are valid with PEP 728 and are typed as Any
    "openai_metadata": {"model": "gpt-4", "temperature": 0.7},
    "anthropic_usage": {"input_tokens": 10, "output_tokens": 20},
    "custom_field": "any value",
}

# Mutating an existing block to add provider-specific fields
openai_data = my_block["openai_metadata"]  # Type: Any

Example Usage

# Direct construction
from langchain_core.messages.content import TextContentBlock, ImageContentBlock

multimodal_message: AIMessage(
    content_blocks=[
        TextContentBlock(type="text", text="What is shown in this image?"),
        ImageContentBlock(
            type="image",
            url="https://www.langchain.com/images/brand/langchain_logo_text_w_white.png",
            mime_type="image/png",
        ),
    ]
)

# Using factories
from langchain_core.messages.content import create_text_block, create_image_block

multimodal_message: AIMessage(
    content=[
        create_text_block("What is shown in this image?"),
        create_image_block(
            url="https://www.langchain.com/images/brand/langchain_logo_text_w_white.png",
            mime_type="image/png",
        ),
    ]
)

Factory functions offer benefits such as:

Automatic ID generation (when not provided)
No need to manually specify the type field

Attributes

attribute

KNOWN_BLOCK_TYPES: set

These are block types known to langchain-core >= 1.0.0.

If a block has a type not in this set, it is considered to be provider-specific.

Functions

function

ensure_id

Ensure the ID is a valid string, generating a new UUID if not provided.

Auto-generated UUIDs are prefixed by 'lc_' to indicate they are LangChain-generated IDs.

function

is_data_content_block

Check if the provided content block is a data content block.

Returns True for both v0 (old-style) and v1 (new-style) multimodal data blocks.

function

create_text_block

Create a TextContentBlock.

function

create_image_block

Create an ImageContentBlock.

function

create_video_block

Create a VideoContentBlock.

function

create_audio_block

Create an AudioContentBlock.

function

create_file_block

Create a FileContentBlock.

function

create_plaintext_block

Create a PlainTextContentBlock.

function

create_tool_call

Create a ToolCall.

function

create_reasoning_block

Create a ReasoningContentBlock.

function

create_citation

Create a Citation.

function

create_non_standard_block

Create a NonStandardContentBlock.

content

Attributes

Functions

Classes

Type Aliases

Menu

content

Attributes

Functions

Classes

Type Aliases