● PHANTOM

🇮🇳 IN

✕

LangChain Reference home page

GitHub
Main Docs

Google GenAI (Gemini)

Google Vertex AI

IBM

Overview
Chat Models
LLMs
Embeddings
Rerankers

⌘I

LangChain Assistant

Ask a question to get started

Enter to send•Shift+Enter new line

Menu

Google GenAI (Gemini)

Google Vertex AI

Overview Chat Models LLMs Embeddings Rerankers

Language

Theme

Pythonlangchain-nvidia-ai-endpoints

langchain-nvidia-ai-endpoints

Description

`langchain-nvidia-ai-endpoints`

Classes

ChatNVIDIA

NVIDIA chat model.

UsageCallbackHandler

Callback Handler that tracks OpenAI info.

NVIDIAEmbeddings

Client to NVIDIA embeddings models.

Ranking

NVIDIARerank

LangChain Document Compressor that uses the NVIDIA NeMo Retriever Reranking API.

NVIDIA

LangChain LLM that uses the Completions API with NVIDIA NIMs.

Model

Model information.

ChatNVIDIADynamo

ChatNVIDIA subclass that injects nvext.agent_hints into requests

Functions

convert_message_to_dict

Convert a LangChain message to a dictionary.

parse_thinking_content

Parse thinking content from text.

standardize_model_name

Standardize the model name to a format that can be used in the OpenAI API.

get_token_cost_for_model

Get the cost in USD for a given model and number of tokens.

get_usage_callback

Get the OpenAI callback handler in a context manager.

register_model

Register a model as a known model.

lookup_model

Lookup a model by name, using only the table of known models.

determine_model

Determine the model to use based on a name, using only the table of known models.

Modules

langchain_nvidia_ai_endpoints

LangChain NVIDIA AI Foundation Model Playground Integration

chat_models

Chat Model Components Derived from ChatModel/NVIDIA

callbacks

Callback Handler that prints to std out.

embeddings

reranking

llm

chat_models_dynamo

ChatNVIDIA subclass with Dynamo KV cache optimization support.