Name	Name	Last commit message	Last commit date
parent directory ..
examples	examples
src/everalgo/knowledge	src/everalgo/knowledge
tests	tests
CHANGELOG.md	CHANGELOG.md
LICENSE	LICENSE
README.md	README.md
pyproject.toml	pyproject.toml

Name

Last commit message

Last commit date

examples

src/everalgo/knowledge

everalgo-knowledge

Extracts a flattened topic tree (list[KnowledgeMemory]) from parsed content (ParsedContent) via LLM.

Use

from everalgo.knowledge import KnowledgeExtractor
from everalgo.types import ParsedContent

parsed = ParsedContent(text="...", mime="text/markdown")
memories = await KnowledgeExtractor(llm=client).aextract(parsed, doc_id="doc1", title="My doc")
# memories[0] = synthetic doc root; rest = DFS-ordered topic nodes.

llm: any everalgo.llm.LLMClient (OpenAI-compatible), passed at construction. Sync bridge: extract(...).

Pipeline at a glance

ParsedContent
  → preprocess + atomize           (_block_split)
  → topic-tree LLM (per batch)     (TOPIC_TREE_EXTRACT_PROMPT_EN)
  → cross-batch merge (>1 batch)   (CONTENT_MERGE + TOPIC_MERGE)
  → postprocess: split unsplit leaves + assign uncovered blocks
  → tree assembly + DFS flatten    → list[KnowledgeMemory]

Install

pip install everalgo-knowledge    # pulls everalgo-core + everalgo-parser

For development within the monorepo workspace:

uv sync --package everalgo-knowledge

Live-LLM scripts

Run from the repository root. Put LLM_API_KEY, LLM_BASE_URL, LLM_MODEL in a .env file there first; each command block below sources it inline and is paste-and-run.

Smoke test — 2 fixtures, structural assertions only, no artifacts written:

set -a; source .env; set +a
uv run pytest -m integration packages/everalgo-knowledge/tests/functional/test_extraction_smoke.py

Run one doc — dumps result (JSON + optional HTML viz) under tests/functional/outputs/:

set -a; source .env; set +a
uv run python packages/everalgo-knowledge/examples/run_one_doc.py \
  packages/everalgo-knowledge/tests/functional/fixtures/idx_multi_topic.json --html

Related distributions

everalgo-parser — produces ParsedContent from raw files (fully implemented except video, which is deferred pending ADR)
everalgo-core — KnowledgeMemory type is defined here

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

everalgo-knowledge

Use

Pipeline at a glance

Install

Live-LLM scripts

Related distributions

FilesExpand file tree

everalgo-knowledge

Directory actions

More options

Directory actions

More options

Latest commit

History

everalgo-knowledge

Folders and files

parent directory

README.md

everalgo-knowledge

Use

Pipeline at a glance

Install

Live-LLM scripts

Related distributions