Stream and capture reasoning content in addition to final answer, from Reasoning LLMs¶

As of v0.35.0, when using certain Reasoning LLM APIs (e.g. deepseek/deepseek-reasoner):

You can see both the reasoning (dim green) and final answer (bright green) text in the streamed output.
When directly calling the LLM (without using an Agent), the LLMResponse object will now contain a reasoning field, in addition to the earlier message field.
when using a ChatAgent.llm_response, extract the reasoning text from the ChatDocument object's reasoning field (in addition to extracting final answer as usual from the content field)

Below is a simple example, also in this script:

Some notes:

To get reasoning trace from Deepseek-R1 via OpenRouter, you must include the extra_body parameter with include_reasoning as shown below.
When using the OpenAI o3-mini model, you can set the resoning_effort parameter to "high", "medium" or "low" to control the reasoning effort.
As of Feb 9, 2025, OpenAI reasoning models (o1, o1-mini, o3-mini) do not expose reasoning trace in the API response.

import langroid as lr
import langroid.language_models as lm

llm_config = lm.OpenAIGPTConfig(
  chat_model="deepseek/deepseek-reasoner",
  # inapplicable params are automatically removed by Langroid
  params=lm.OpenAICallParams(
    reasoning_effort="low",  # only supported by o3-mini
    # below lets you get reasoning when using openrouter/deepseek/deepseek-r1
    extra_body=dict(include_reasoning=True),
  ),
)

# (1) Direct LLM interaction
llm = lm.OpenAIGPT(llm_config)

response = llm.chat("Is 9.9 bigger than 9.11?")

# extract reasoning
print(response.reasoning)
# extract answer
print(response.message)

# (2) Using an agent
agent = lr.ChatAgent(
    lr.ChatAgentConfig(
        llm=llm_config,
        system_message="Solve the math problem given by the user",
    )
)

response = agent.llm_response(
    """
    10 years ago, Jack's dad was 5 times as old as Jack.
    Today, Jack's dad is 40 years older than Jack.
    How old is Jack today?
    """
)

# extract reasoning
print(response.reasoning)
# extract answer
print(response.content)