Summary
Langflow is vulnerable to Path Traversal in the Knowledge Bases API (POST /api/v1/knowledge_bases). This occurs because user-supplied knowledge base names are used directly to create file paths without proper sanitization or containment checks. An authenticated attacker can exploit this flaw to create directories and write files anywhere on the server's filesystem.
Details
The vulnerability exists in the create_knowledge_base function within src/backend/base/langflow/api/v1/knowledge_bases.py.
This function constructs file paths directly from the user-supplied name field without sanitization. The value is concatenated with the user's base directory and passed directly to kb_path.mkdir(). Immediately following the directory creation, the application writes embedding_metadata.json and schema.json into this attacker-controlled path.
PoC (Proof of Concept)
For the Create endpoint, an attacker can supply traversal sequences or absolute paths in the name field:
../victim_user/evil_kb
or
/tmp/pwned
This forces kb_path.mkdir() to create directories and write specific application files (embedding_metadata.json and schema.json) at any reachable path on the server.
Impact
Any Langflow instance exposing this endpoint to authenticated users is vulnerable. This exposes the server to:
- Cross-user data compromise: Creation of directories and files within another tenant's knowledge base space.
- Arbitrary filesystem manipulation: Directory creation at any path on the server where the application has write permissions (e.g.,
/app/data).
- Data overwrite: Overwriting existing
embedding_metadata.json and schema.json files in attacker-targeted paths, potentially corrupting existing knowledge bases.
Fixes
The issue was addressed in PR #12337. The fix introduces the _validate_kb_path_containment() helper function, which uses Path.is_relative_to() instead of startswith() to enforce strict path boundaries and prevent prefix-ambiguity bugs. This helper is applied before any filesystem operations. Regression tests were added to verify that traversal payloads return a 403 Forbidden.
Acknowledgements
Thanks to the security researchers who responsibly disclosed this vulnerability:
Summary
Langflow is vulnerable to Path Traversal in the Knowledge Bases API (
POST /api/v1/knowledge_bases). This occurs because user-supplied knowledge base names are used directly to create file paths without proper sanitization or containment checks. An authenticated attacker can exploit this flaw to create directories and write files anywhere on the server's filesystem.Details
The vulnerability exists in the
create_knowledge_basefunction withinsrc/backend/base/langflow/api/v1/knowledge_bases.py.This function constructs file paths directly from the user-supplied
namefield without sanitization. The value is concatenated with the user's base directory and passed directly tokb_path.mkdir(). Immediately following the directory creation, the application writesembedding_metadata.jsonandschema.jsoninto this attacker-controlled path.PoC (Proof of Concept)
For the Create endpoint, an attacker can supply traversal sequences or absolute paths in the
namefield:../victim_user/evil_kbor
/tmp/pwnedThis forces
kb_path.mkdir()to create directories and write specific application files (embedding_metadata.jsonandschema.json) at any reachable path on the server.Impact
Any Langflow instance exposing this endpoint to authenticated users is vulnerable. This exposes the server to:
/app/data).embedding_metadata.jsonandschema.jsonfiles in attacker-targeted paths, potentially corrupting existing knowledge bases.Fixes
The issue was addressed in PR #12337. The fix introduces the
_validate_kb_path_containment()helper function, which usesPath.is_relative_to()instead ofstartswith()to enforce strict path boundaries and prevent prefix-ambiguity bugs. This helper is applied before any filesystem operations. Regression tests were added to verify that traversal payloads return a403 Forbidden.Acknowledgements
Thanks to the security researchers who responsibly disclosed this vulnerability: