Client to NVIDIA embeddings models.
Model information.
LangChain LLM that uses the Completions API with NVIDIA NIMs.
LangChain Document Compressor that uses the NVIDIA NeMo Retriever Reranking API.
NVIDIA chat model.
Callback Handler that tracks OpenAI info.
Convert a LangChain message to a dictionary.
Register a model as a known model.
Lookup a model by name, using only the table of known models.
Determine the model to use based on a name, using only the table of known models.
Parse thinking content from text.
Standardize the model name to a format that can be used in the OpenAI API.
Get the cost in USD for a given model and number of tokens.
Get the OpenAI callback handler in a context manager.