llms.py

Latest Features

Latest features and updates in llms.py

Feb 9, 2026

Custom User and Agent Avatars

Personalize your chat experience with custom avatars for both yourself and AI agents. Upload images via the Settings page or manually add them to ~/.llms/users/ - supports .png, .svg, and auto-converts from other formats.

User Avatar

User Avatar

Click to view full size

Agent Avatar

Agent Avatar

Click to view full size

Compact Tools with Smart Summarization

The new Compact Tools feature automatically summarizes long tool arguments and outputs in the Chat UI to keep conversations concise while still providing access to important information as needed.

Tools Expanded

Tools Expanded

Click to view full size

Tools Collapsed

Tools Collapsed

Click to view full size

Previously, even with long Tool Call Arguments minimized, you could still only see a few on a page. Now that they're collapsed by default, you can see more at a glance and expand only the ones you need.

Feb 8, 2026

Support for Voice Input

Added Voice Input extension with speech-to-text transcription via a microphone button or ALT+D shortcut, supporting three modes: local transcription with voxtype, custom transcribe executable, and cloud-based voxtral-mini-latest via Mistral.

  • Added tok/s metrics in Chat UI on a per-message and per-thread basis

Feb 5, 2026

Voxtral Audio Models

Added support for Mistral's Voxtral audio transcription models - use the audio input filter in the model selector to find them.

Both the Chat Completion and dedicated Audio Transcription APIs deliver impressive speed, with the dedicated transcription endpoint returning results near-instantly.

Voxtral Chat

Voxtral Chat

Click to view full size

Voxtral Audio Transcription

Voxtral Audio Transcription

Click to view full size

Compact Threads

Added Compact Threads feature for managing long conversations - it summarizes the current thread into a new, condensed thread targeting 30% of the original context size. The compact button appears when a conversation exceeds 10 messages or uses more than 40% of the model's context limit.

Compact Button

Compact Button

Click to view full size

Compact Button Intensity

Compact Button Intensity

Click to view full size

The compaction model and prompts are fully customizable in ~/.llms/llms.json.

  • Fix OpenRouter provider after models.dev switched to use @openrouter/ai-sdk-provider. Remove llms.json to reset to default configuration:
rm ~/.llms/llms.json

Feb 3, 2026

  • Removed duplicate filesystem tools from Core Tools, they're now only included in File System Tools

  • Add sort_by and max_result options in search_files and made path and optional parameter to improve utility and reduce tool use error rates. path now defaults to the first allowed directory (project dir).

Feb 3, 2026

  • Add support for overridable ClientTimeout limits in ~/.llms/llms.json:
{
    "limits": {
        "client_timeout": 120
    }
}
  • Show proceed button for assistant messages without content but with reasoning

Feb 2, 2026

Multi User Skills

When Auth is enabled, each user manages their own skill collection at ~/.llms/user/<user>/skills and can enable or disable skills independently. Shared global & project-level skills remain accessible but read-only.

Jan 31, 2026

Jan 30, 2026

  • Support for tool calling for models returned by local Ollama instances

  • New openai-local provider for custom OpenAI-compatible endpoints

  • Fix computer tool issues in Docker by only loading computer tool if run in environment with a display

Jan 29, 2026

Skills Management

Added a full Skills Management UI for creating, editing, and deleting skills directly from the browser.

Skills package domain-specific instructions, scripts, references & assets that enhance your AI agent.

Browse & Install Skills

Added a Skill Browser with access to the top 5,000 community skills from skills.sh. Search, browse, and install pre-built skills directly into your personal collection.

Browse Skills

Browse Skills

Click to view full size

Installing Skill

Installing Skill

Click to view full size

Jan 28, 2026

  • Use a barebones fallback markdown render when markdown renders like KaTex fail

  • Use sanitizeHtml to avoid breaking layout when displaying rendered html

Jan 26, 2026

  • Add copy button to TextViewer popover menu

  • Add proceed and retry buttons at the bottom of Threads to continue agent loop

  • Add filesystem tools in computer extension

  • Add a simple sendUserMessage API in UI to simulate a new user message on the thread

  • Implement TextViewer component for displaying Tool Args, Tool Output + SystemPrompt

Jan 24, 2026

  • Auto collapse long tool args content and add ability to min/maximize text content

Jan 23, 2026


v3 Released

See v3 release notes for details on the major new features and improvements in v3.