Skip to content

Releases: infiniflow/ragflow

nightly

01 Dec 13:17
e89afba

Choose a tag to compare

nightly Pre-release
Pre-release

Release nightly created from e89afba at 2026-06-13 21:37:30+08:00

v0.26.0

11 Jun 11:04
92c4b76

Choose a tag to compare

Summary

New features

  • Model providers
    • Implements auto-populated model lists for multiple providers, eliminating the need to type model names manually. This feature currently supports: Ollama, OpenRouter, vLLM, OpenAI-API-Compatible, LM-Studio, VolcEngine, Xinference, LocalAI, BaiduYiyan, GPUStack, and Fish Audio.
    • Allows configuring multiple API keys for the same model provider. #14595
    • Dynamically populates model selection dropdowns in the UI by fetching the currently available models directly from remote model providers. #15711
  • Data source connectors: Implements new data source connectors for Outlook, OneDrive, Microsoft Teams, Slack, SharePoint, Salesforce, and Azure Blob Storage. #15333#15330#15332#15188#15190#15462#15466
  • Dataset - Implements a checkpoint and resume feature for community extraction and entity resolution, the most expensive and time-consuming parts of the GraphRAG indexing pipeline. #15518#15523

Improvements

  • Removes <think> text buffering to ensure reasoning-capable models feel faster and more transparent during interactions. #15891
  • Marks MySQL migrations as applied. #15504

Model Support

i18n

  • Completes Korean translation. #15863
  • Completes Italian translation. #15729

Bug fixes

  • The thinking mode of MiniMax models was not correctly enabled. #15496
  • Infinite loops were triggered when the thinking mode was enabled for Qwen3.5 and Qwen3.6 models. #15101
  • Streamed answers were being duplicated when using the OpenAI-compatible chat completions API endpoint. #15286#15443
  • Serialization errors were caused during chat completions when invalid numeric scores like NaN (Not-a-Number) or Inf (Infinity) were passed to the JSON encoder. #15245#15266
  • Chat completions using LiteLLM providers were failing because unrecognized internal configuration parameters were not being filtered out before reaching the external APIs. #15427#15432
  • The OpenAI-compatible chat completions API was defaulting to streamed responses. #15356#15394
  • Empty AND results were incorrectly dropped during metadata filtering. #15477
  • Repetitive page chrome, such as headers and footers, was incorrectly extracted as main text by the MinerU parser. #15335#15387
  • English chart titles were missing during document extraction in the DeepDoc module. #15481
  • Empty outputs were returned by the TitleChunker for json and chunks upstream formats #14247#15396
  • An error message was missing when a .tsv file upload attempt failed. #15284
  • API tokens missing beta values caused token retrieval errors. #15405
  • Caps the maximum page size to fix system crashes or slowdowns from large queries. #15292
  • Client errors were caused by the OpenAI-compatible chat completion API incorrectly defaulting to streamed responses. #15356#15394
  • HTTP 500 internal server errors were triggered instead of standard 4xx client errors when users attempted to download missing files from the storage backend. #15369#15371
  • GraphRAG entity ranking was broken. #15795#15797

What's Changed

Read more

v0.25.6

27 May 01:50
8f0632c

Choose a tag to compare

Summary

Released on May 26, 2026.

New features

  • Agent: Adds a Browser component that enables AI to autonomously navigate and interact with web pages. #14888

Improvements

  • RAG: RAPTOR construction now introduces AHC mode (Ψ-RAG), which expands semantics from the document level to the dataset level. Not only is index construction performance significantly higher than the previous RAPTOR, but it also outperforms the old RAPTOR on Recall@5 and average F1. Users can switch between AHC and GMM modes. This feature was introduced in v0.25.3, and this release fixes several bugs. #14674#14679
  • Agent: Introduces lightweight @tool decorator to streamline Python function registration process for chat models. #15047
  • Agent: Enables agent messages to display base64-encoded images. #15212
  • Agent: Exposes Doc Generator component's file metadata as discrete variables. #15080
  • Agent: Allows developers to pass chat_template_kwargs to agent chat completion endpoint. #14182#14542

Bug fixes

  • Fixes /chat/completions to allow sending only latest message in API payload and removes requirement to transmit full conversation history. #15197 See also Converse with chat assistant.
  • Weight assigned to vector similarity was not applied during the retrieval phase. #15108
  • Fixes parser configs not saving on dataset configuration page.
  • Logs wer not fully displayed on data source details page.
  • Fixes document status filtering failure.
  • Fixes crash guard for empty LLM choices responses.
  • RAG: RAPTOR construction process halted when using the Infinity document engine. #14998
  • Fixes streaming response parsing for Mistral/Upstage reasoning models.
  • Fixes HTML tags in ingestion pipeline parser output.
  • Fixes table parser metadata.
  • Fixes asyncio event loop nesting and fire-and-forget task issues.
  • Fixes asyncio.Semaphore bound to different event loop error.
  • Agent: Fixes Agent component prompt variable disappearing and search vector_similarity_weight issues.

What's Changed

Read more

v0.25.5

20 May 12:08
90c76e7

Choose a tag to compare

Summary

Released on May 20, 2026.

New features

  • Adds local & SSH providers in admin panel. #15039

Improvements

  • Accelerated dataset search path, reducing latency by 50–100% by removing expensive vector fetch and rerank similarity computation steps. #14970
  • Pushes metadata filters down to Infinity, significantly speeding up metadata filtering. #14974
  • added Redis caching for TTS. #14851
  • atomic document counter updates #14867
  • Improved server startup speed and memory usage #14973
  • Agent: structured output aggregation #13384 #14848
  • Agent: metadata filter reuse. #14849
  • Optimizes connector dashboard. #14979

What's Changed

Read more

v0.25.4

14 May 03:26
851b16b

Choose a tag to compare

Summary

New features

  • Adds a generic, configuration-driven RESTful API data source connector.

Improvements

  • Agent tag management with filtering and sorting.
  • Widget customization and persistence.

Model support

  • Adds gpt-5.4-mini and gpt-5.4-nano to the OpenAI model list

Bug fixes

Fixed dataset document download route.

What's Changed

New Contributors

Full Changelog: v0.25.3...v0.25.4

v0.25.3

13 May 11:50
30d1c1d

Choose a tag to compare

Summary

New features

  • Data source and parsing: Added column-level semantic/metadata control for the spreadsheet file parser; introduced ETag optimization for incremental synchronization of S3 data sources to avoid unnecessary file transfers.
  • Enables assigning specific roles like content, metadata, and primary key, to table columns. #13710

Improvements

  • API refactoring and security
    • Continues the transition of web APIs to RESTful conventions, ensuring backward compatibility for all legacy endpoints.
    • Binds the user_id in POST /api/v1/messages to the authenticated JWT principal. #14745
    • Secures the sandbox executor against dynamic and indirect code execution bypasses. #14690
    • LLM request timeout control
    • High concurrency blocking call thread pool
  • Reduces ingestion server boot time. #14894

Bug fixes

  • Images in multi-sheet Excel workbooks were not scoped by sheet, causing images to be incorrectly attributed across different worksheets. #14120
  • Iteration item alias passing
  • Tool parameter template parsing
  • Code execution attachment output
  • Volcano model addition fix

What's Changed

Read more

v0.25.2

09 May 11:07
57b24be

Choose a tag to compare

Summary

Improvements

  • API refactoring and unification: Continues the transition of web APIs to RESTful conventions, ensuring backward compatibility for all legacy endpoints.

Data source

  • Introduces a lightweight snapshot mechanism for synchronizing deleted files across eight data sources—including Moodle, DingTalk AI Table, and RSS—ensuring a faithful reflection of all remote data sources. #14362#14499

Bug fixes

  • Metadata visibility issues during v0.24.0 to v0.25.0 upgrades.
  • Duplicate chat output.
  • Metadata filtering was handled in-memory instead of leveraging Elasticsearch, incurring performance bottlenecks. #14576

What's Changed

Read more

v0.25.1

30 Apr 02:54
ce4c782

Choose a tag to compare

Summary

Improvements

  • API refactoring and unification: Standardizes web APIs to RESTful conventions across all endpoints, unifying document creation and indexing flows while maintaining backward compatibility.
  • Parsing optimizations: Adds OpenDataLoader PDF backend. #14097
  • Introduces lazy loading and chunked parsing for large PDFs (>50 pages), significantly reducing memory footprint. #14385

Data source

Enables synchronizing deleted files in Bitbucket, Gmail, Google Drive, and Airtable.

Model support

  • DeepSeek v4

Model providers

  • UCloud

Bug fixes

  • Metadata visibility issues during v0.24.0 to v0.25.0 upgrades.
  • Duplicate chat output.

What's Changed

Read more

v0.25.0

21 Apr 09:32
a33d073

Choose a tag to compare

Release note

  • Ingestion Pipeline

    • Added 7 built-in pipeline templates aligned with RAGFlow’s native document parsers.
    • Several new templates bring significant parsing improvements.
  • Data Sources

    • New data sources: Seafile, RSS, DingTalk AI Sheet.
    • Added synchronization for file deletions from data sources.
  • Improved DOCX parsing strategy with lazy-load support for images, reducing memory consumption.

  • Agent

    • Introduced agent publishing capability
    • Sandboxed code execution and chart generation
    • New template: Data Analysis Agent
  • Memory: ddded user-level memory storage and retrieval.

  • Language & UI

    • New language support: Arabic, Bulgarian, Turkish
    • UI now supports Right-to-Left (RTL) layout
  • DocEngine: supports upgrade to Elasticsearch 9.x.

  • New Model Support:

    • MiniMax-M2.7 series models
    • Perplexity embedding model: pplx-embed
    • Tongyi rerank model
  • New Model Providers

  • Mobile Compatibility: embedded chat pages in Chat / Agent / Search are now compatible with mobile devices.

  • Due to Minio’s official image no longer being maintained, the default object storage container has been changed to pgsty/minio.

  • Database Upgrade: added database upgrade scripts. See documentation for details.

  • Ecosystem Integration: RAGFlow can now be accessed via OpenClaw: https://clawhub.ai/yingfeng/ragflow-skill

Release v0.25.0 created from a33d073 at 2026-04-21 17:32:49+08:00

What's Changed

Read more

v0.24.0

10 Feb 09:27
392ec99

Choose a tag to compare

Summary

New features

  • Memory
    • Introduces APIs and an SDK for developer integration.
    • Adds Memory extraction log display in the console for improved debugging and tracing.
  • Dataset
    • Added support for batch management of Metadata.
    • Renamed "ToC (Table of Contents)" to "PageIndex".
  • Agent
    • Launches a new Chat-like Agent conversation management interface that retains Sessions and dialogue history.
    • Introduces a multi-Sandbox mechanism, currently supporting local gVisor and Alibaba Cloud, with compatibility for mainstream Sandbox APIs (configurable in the Admin page).
  • Chat
    • Adds a new "Thinking" mode and removed the previous "Reasoning" configuration option.
    • Optimizes retrieval strategies for deep-research scenarios, enhancing recall accuracy.
  • Admin
    • Adds support for configuring multiple Admin accounts.
  • Model configuration center
    • Adds a model connection test feature when adding new models.
  • Ecosystem
    • Adds support for OceanBase as a database alternative to MySQL.
    • Adds support for PaddleOCR-VL.
  • Model
    • Adds new model support for Kimi 2.5, Stepfun 3, and doubao-embedding-vision, among others.
  • Data sources
    • Adds new data source integrations for Zendesk, Bitbucket, and others.

What's Changed

Read more