You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Changes
Visual ingest for PDFs (opt-in).visual: true (MCP) or --visual (CLI) inlines a short caption from a local VLM (HuggingFaceTB/SmolVLM-256M-Instruct) into the text chunks for each page with figures, tables, or diagrams. Captions are auxiliary text — not image search, not OCR, and not a faithful transcription of the figure. Default ingest is unchanged. See the README's Ingesting PDFs with figures section for usage and the security note.
GPU acceleration. Embedding runs on WebGPU when available, with a CPU fallback. Override with RAG_DEVICE=cpu.
Bug fix (Windows paths). File metadata fields (file name, extension) now extract correctly from backslash-style paths. Previously these fields could be empty or contain the full path when ingesting from a Windows path.
MCP server is env-only. Configuration comes from environment variables only; passing CLI flags to npx mcp-local-rag now fails fast with a clear error instead of being silently ignored.
Contributors
Thanks to @mickey-mikey for GPU support (#128), the env-only MCP server fix (#123), and the Windows path-handling fix (#118).