Skip to content

Release: v0.14.0

Choose a tag to compare

@shinpr shinpr released this 17 May 03:05
· 100 commits to main since this release
f7412a8

Changes

  • Visual ingest for PDFs (opt-in). visual: true (MCP) or --visual (CLI) inlines a short caption from a local VLM (HuggingFaceTB/SmolVLM-256M-Instruct) into the text chunks for each page with figures, tables, or diagrams. Captions are auxiliary text — not image search, not OCR, and not a faithful transcription of the figure. Default ingest is unchanged. See the README's Ingesting PDFs with figures section for usage and the security note.
  • GPU acceleration. Embedding runs on WebGPU when available, with a CPU fallback. Override with RAG_DEVICE=cpu.
  • Bug fix (Windows paths). File metadata fields (file name, extension) now extract correctly from backslash-style paths. Previously these fields could be empty or contain the full path when ingesting from a Windows path.
  • MCP server is env-only. Configuration comes from environment variables only; passing CLI flags to npx mcp-local-rag now fails fast with a clear error instead of being silently ignored.

Contributors

  • Thanks to @mickey-mikey for GPU support (#128), the env-only MCP server fix (#123), and the Windows path-handling fix (#118).