Skip to content

v1.9.6

Latest
Compare
Choose a tag to compare
@github-actions github-actions released this 27 Jun 14:59
· 2 commits to master since this release

Changes

Llama.cpp support

  • Add llama.cpp support with pre-compiled Vulkan libraries for Windows that should work out-of-the-box with any modern GPU. Mac and Linux users still need to manually install llama-cpp-python. You can now use RWKV GGUF models as well as any other GGUF models such as DeepSeek, Qwen3, Gemma3, Phi4. You can select the llama.cpp tag in the Models page and download the required models with one click, or place downloaded GGUF models in the models directory for use.
  • The software's preset configs have been streamlined and now include some GGUF format presets. You can click the reset button to fetch the latest presets.
  • The RWKV-Runner Python server in llama.cpp mode has been optimized. After loading the model to GPU, the server process only occupies approximately 200MB of memory on Windows platform.
  • When server users call the /switch-model API to load models, you only need to pass a file path ending with .gguf to the model field to use llama.cpp mode.

Features

  • llama.cpp support
  • Add a setting to save the full rwkv-runner client state, rather than just storing necessary settings. This option is enabled by default. You can disable it and restart the software to restore the previous version's behavior
  • add a share button to save your chat screenshot

Improvements

  • reduce peak memory usage when loading rwkv7 in cuda mode
  • increase the maximum value of the top_k API parameter to 100
  • remove language tags in Models page, as all new models support global languages
  • remove useless/disabled resources
  • other small improvements
  • You can run RWKV-Runner on Windows 7 by installing the patches from the link below. Note that you still need to install Python 3.8 and dependencies manually. https://github.com/josStorer/wails/releases/tag/v2.9.2x

Fixes

  • fix the issue of failing to load the state for RWKV7
  • Fix the abnormal behavior when passing a Tool Definition array. This is a frontend only parameter construction issue.
  • fix the issue where the model list did not refresh automatically after downloading the model when using a custom model path

Note: If you encounter WebView2 crash issues, please try opening the Windows Settings, click on Apps, search for
WebView2, click Modify -> Repair to update your WebView2 runtime.

Install