Using Langroid with Non-OpenAI LLMs¶

Langroid was initially written to work with OpenAI models via their API. This may sound limiting, but fortunately:

Many open-source LLMs can be served via OpenAI-compatible endpoints. See the Local LLM Setup guide for details.
There are tools like LiteLLM that provide an OpenAI-like API for hundreds of non-OpenAI LLM providers (e.g. Anthropic's Claude, Google's Gemini).
AI gateways like LangDB, Portkey, and OpenRouter provide unified access to multiple LLM providers with additional features like cost control, observability, caching, and fallback strategies.

Below we show how you can use these various options with Langroid.

Create an `OpenAIGPTConfig` object with `chat_model = "litellm/..."`¶

Install litellm extra

To use litellm you need to install Langroid with the litellm extra, e.g.: pip install "langroid[litellm]"

Next, look up the instructions in LiteLLM docs for the specific model you are interested. Here we take the example of Anthropic's claude-instant-1 model. Set up the necessary environment variables as specified in the LiteLLM docs, e.g. for the claude-instant-1 model, you will need to set the ANTHROPIC_API_KEY

export ANTHROPIC_API_KEY=my-api-key

Now you are ready to create an instance of OpenAIGPTConfig with the chat_model set to litellm/<model_spec>, where you should set model_spec based on LiteLLM docs. For example, for the claude-instant-1 model, you would set chat_model to litellm/claude-instant-1. But if you are using the model via a 3rd party provider, (e.g. those via Amazon Bedrock), you may also need to have a provider part in the model_spec, e.g. litellm/bedrock/anthropic.claude-instant-v1. In general you can see which of these to use, from the LiteLLM docs.

import langroid.language_models as lm

llm_config = lm.OpenAIGPTConfig(
    chat_model="litellm/claude-instant-v1",
    chat_context_length=8000, # adjust according to model
)

A similar process works for the Gemini 1.5 Pro LLM:

get the API key here
set the GEMINI_API_KEY environment variable in your .env file or shell
set chat_model="litellm/gemini/gemini-1.5-pro-latest" in the OpenAIGPTConfig object

For other gemini models supported by litellm, see their docs

Gemini LLMs via OpenAI client, without LiteLLM¶

This is now the recommended way to use Gemini LLMs with Langroid, where you don't need to use LiteLLM. As of 11/20/2024, these models are available via the OpenAI client.

To use langroid with Gemini LLMs, all you have to do is:

set the GEMINI_API_KEY environment variable in your .env file or shell
set chat_model="gemini/<model_name>" in the OpenAIGPTConfig object,
where is one of "gemini-1.5-flash", "gemini-1.5-flash-8b", or "gemini-1.5-pro"

See here for details on Gemini models.

For example, you can use this llm_config:

llm_config = lm.OpenAIGPTConfig(
    chat_model="gemini/" + lm.OpenAIChatModel.GEMINI_1_5_FLASH,
)

In most tests you can switch to a gemini model, e.g. --m gemini/gemini-1.5-flash, e.g.:

pytest -xvs tests/main/test_llm.py --m gemini/gemini-1.5-flash

Many of the example scripts allow switching the model using -m or --model, e.g.

python3 examples/basic/chat.py -m gemini/gemini-1.5-flash

AI Gateways for Multiple LLM Providers¶

In addition to LiteLLM, Langroid integrates with AI gateways that provide unified access to multiple LLM providers with additional enterprise features:

LangDB¶

LangDB is an AI gateway offering OpenAI-compatible APIs to access 250+ LLMs with cost control, observability, and performance benchmarking. LangDB enables seamless model switching while providing detailed analytics and usage tracking.

To use LangDB with Langroid: - Set up your LANGDB_API_KEY and LANGDB_PROJECT_ID environment variables - Set chat_model="langdb/<provider>/<model_name>" in the OpenAIGPTConfig (e.g., "langdb/anthropic/claude-3.7-sonnet")

For detailed setup and usage instructions, see the LangDB integration guide.

Portkey¶

Portkey is a comprehensive AI gateway that provides access to 200+ models from various providers through a unified API. It offers advanced features like intelligent caching, automatic retries, fallback strategies, and comprehensive observability tools for production deployments.

To use Portkey with Langroid: - Set up your PORTKEY_API_KEY environment variable (plus provider API keys like OPENAI_API_KEY) - Set chat_model="portkey/<provider>/<model_name>" in the OpenAIGPTConfig (e.g., "portkey/openai/gpt-4o-mini")

For detailed setup and usage instructions, see the Portkey integration guide.

OpenRouter¶

OpenRouter provides access to a wide variety of both open and proprietary LLMs through a unified API. It features automatic routing and load balancing, making it particularly useful for accessing larger open LLMs without local resources and for using multiple providers through a single interface.

To use OpenRouter with Langroid: - Set up your OPENROUTER_API_KEY environment variable - Set chat_model="openrouter/<model_name>" in the OpenAIGPTConfig

For more details, see the Local LLM Setup guide.

Working with the created `OpenAIGPTConfig` object¶

From here you can proceed as usual, creating instances of OpenAIGPT, ChatAgentConfig, ChatAgent and Task object as usual.

E.g. you can create an object of class OpenAIGPT (which represents any LLM with an OpenAI-compatible API) and interact with it directly:

llm = lm.OpenAIGPT(llm_config)
messages = [
    LLMMessage(content="You are a helpful assistant",  role=Role.SYSTEM),
    LLMMessage(content="What is the capital of Ontario?",  role=Role.USER),
],
response = mdl.chat(messages, max_tokens=50)

When you interact directly with the LLM, you are responsible for keeping dialog history. Also you would often want an LLM to have access to tools/functions and external data/documents (e.g. vector DB or traditional DB). An Agent class simplifies managing all of these. For example, you can create an Agent powered by the above LLM, wrap it in a Task and have it run as an interactive chat app:

agent_config = lr.ChatAgentConfig(llm=llm_config, name="my-llm-agent")
agent = lr.ChatAgent(agent_config)

task = lr.Task(agent, name="my-llm-task")
task.run()

Example: Simple Chat script with a non-OpenAI proprietary model¶

Many of the Langroid example scripts have a convenient -m flag that lets you easily switch to a different model. For example, you can run the chat.py script in the examples/basic folder with the litellm/claude-instant-v1 model:

python3 examples/basic/chat.py -m litellm/claude-instant-1

Quick testing with non-OpenAI models¶

There are numerous tests in the main Langroid repo that involve LLMs, and once you setup the dev environment as described in the README of the repo, you can run any of those tests (which run against the default GPT4 model) against local/remote models that are proxied by liteLLM (or served locally via the options mentioned above, such as oobabooga, ollama or llama-cpp-python), using the --m <model-name> option, where model-name takes one of the forms above. Some examples of tests are:

pytest -s tests/test_llm.py --m local/localhost:8000
pytest -s tests/test_llm.py --m litellm/claude-instant-1

When the --m option is omitted, the default OpenAI GPT4 model is used.

chat_context_length is not affected by --m

Be aware that the --m only switches the model, but does not affect the chat_context_length parameter in the OpenAIGPTConfig object. which you may need to adjust for different models. So this option is only meant for quickly testing against different models, and not meant as a way to switch between models in a production environment.