NVIDIA chat model.
Callback Handler that tracks OpenAI info.
Client to NVIDIA embeddings models.
LangChain Document Compressor that uses the NVIDIA NeMo Retriever Reranking API.
LangChain LLM that uses the Completions API with NVIDIA NIMs.
Model information.
ChatNVIDIA subclass that injects nvext.agent_hints into requests
Convert a LangChain message to a dictionary.
Parse thinking content from text.
Standardize the model name to a format that can be used in the OpenAI API.
Get the cost in USD for a given model and number of tokens.
Get the OpenAI callback handler in a context manager.
Register a model as a known model.
Lookup a model by name, using only the table of known models.
Determine the model to use based on a name, using only the table of known models.