SumLens

Explainability dashboard for AI-generated summaries. Upload a document, get an abstractive summary, and see — sentence by sentence — how strongly each summary sentence is grounded in the source, with low-confidence sentences flagged as potential hallucinations.

Course project · MICS · Principles of Software Development (S2 2025–26).

Demo

▶️ Watch the demo on YouTube — a faithful summary scored all-green, a hallucinated sentence flagged red with its source evidence, adjustable thresholds, and JSON/PDF export.

What SumLens does

SumLens takes a text document (pasted or PDF) and:

Summarises it locally using BART (facebook/bart-large-cnn) — no external API.
Scores each summary sentence against the source using three signals:
- Signal A — Classifier: LettuceDetect flags hallucinated tokens.
- Signal B — NLI: DeBERTa-v3 checks whether atomic claims are entailed by the source.
- Signal C — Attribution: Inseq integrated gradients measure how much each source span influenced each summary sentence.
Fuses the signals into a single grounding score (0 = hallucinated, 1 = grounded).
Labels each sentence: grounded, weakly grounded, or hallucinated.
Displays the result as a colour-coded summary with click-to-highlight source spans.

Hardware requirements

The pipeline loads three large transformer models. Running on CPU is supported but can take several minutes per document.

Setup	RAM	VRAM	Expected time
GPU (recommended)	16 GB	8 GB+	~30–60 s
CPU-only	16 GB	—	3–10 min

Models are downloaded automatically from Hugging Face on first run (~4 GB total). No paid API key is required.

Installation

Requires Python 3.11+ and Git.

git clone https://github.com/bacemtayeb/SumLens.git
cd SumLens
python3.11 -m venv .venv

# Windows
.venv\Scripts\activate
# macOS / Linux
source .venv/bin/activate

pip install -e ".[dev]"
python -m nltk.downloader punkt punkt_tab

Running the app

python app.py

Gradio prints a local URL, e.g.:

Running on local URL: http://127.0.0.1:7860

Open that URL in your browser. The app runs entirely on your machine — no data leaves your computer.

Usage walkthrough

Step 1 — Load a document

You have two options:

Paste text — click the Paste text box and type or paste your document (up to 10 000 words).
Upload a PDF — click or upload PDF and select a file (up to 5 MB).

If you provide both, the PDF takes priority.

Step 2 — Analyse

Click the Analyse button. The button is disabled while the pipeline runs (typically 30–60 s on GPU, longer on CPU). Both export buttons appear once analysis is complete.

Step 3 — Read the summary

The right panel shows the summary with each sentence colour-coded:

Colour	Label	Meaning
Green	Grounded	Well-supported by the source
Orange	Weakly grounded	Partial support; treat with caution
Red	Hallucinated	Low support; likely fabricated or distorted

Step 4 — Trace a sentence to the source

Click any summary sentence. The left panel highlights (in yellow) the source sentences most strongly attributed to it by the model. Click a different summary sentence to switch the highlight.

Step 5 — Adjust the thresholds (optional)

Two sliders let you change the decision boundaries without re-running the model:

τ hallucinated (default 0.30) — sentences with a grounding score below this value are labelled hallucinated.
τ grounded (default 0.70) — sentences with a grounding score above this value are labelled grounded. Anything in between is weakly grounded.

Move either slider and the summary colours update instantly.

Interpreting the results

The grounding score (0–1) represents how strongly the model believes a summary sentence is supported by the source. It is not a probability in a strict statistical sense — treat it as a relative risk indicator.
A hallucinated label does not guarantee the sentence is wrong; it means the model could not find sufficient evidence in the source text. Always cross-check flagged sentences manually.
The signal breakdown (JSON export) shows the individual classifier, NLI, and attribution scores for each sentence, which can help diagnose why a sentence was flagged.

Exporting results

Two download buttons appear after analysis:

Export JSON — downloads the full AnalysisResult as a JSON file. The schema matches sumlens/types.py and round-trips via AnalysisResult.model_validate().
Export PDF — downloads a human-readable PDF containing the colour-annotated summary, a legend, and a per-sentence signal-scores table.

Development

Quality gate (CI enforces on every PR)

ruff check . && mypy sumlens tests app.py && pytest -q --cov=sumlens --cov-fail-under=70

Lint (ruff), strict type-check (mypy), and tests with a ≥ 70 % coverage gate must pass before any PR is merged.

Project layout

sumlens/
  types.py          # canonical data model (AnalysisResult, etc.)
  ingest.py         # PDF / text → Document
  summarise.py      # BART summarisation
  signals/
    classifier.py   # Signal A — LettuceDetect
    nli.py          # Signal B — DeBERTa NLI
    attribution.py  # Signal C — Inseq attribution
  fuse.py           # logistic-regression fusion + Platt calibration
  pipeline.py       # orchestrates ingest → summarise → signals → fuse
app.py              # Gradio UI entry point
tests/              # pytest suite (all models mocked)
docs/               # requirements, data model, research plan

Documentation

docs/requirements.md — functional / non-functional requirements, MoSCoW, user stories, traceability.
docs/data-model.md — canonical data types and JSON schema.
docs/research-plan.md — signals, fusion, evaluation methodology.
docs/mockup.html — static HTML wireframe of the two-panel dashboard UI (open in any browser).
docs/use-case.puml — PlantUML use-case diagram (UC-01 "Verify a Summary").

License

Released under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github/workflows		.github/workflows
docs		docs
scripts		scripts
sumlens		sumlens
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
pyproject.toml		pyproject.toml
requirements-ci.txt		requirements-ci.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SumLens

Demo

Contents

What SumLens does

Hardware requirements

Installation

Running the app

Usage walkthrough

Step 1 — Load a document

Step 2 — Analyse

Step 3 — Read the summary

Step 4 — Trace a sentence to the source

Step 5 — Adjust the thresholds (optional)

Interpreting the results

Exporting results

Development

Quality gate (CI enforces on every PR)

Project layout

Documentation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SumLens

Demo

Contents

What SumLens does

Hardware requirements

Installation

Running the app

Usage walkthrough

Step 1 — Load a document

Step 2 — Analyse

Step 3 — Read the summary

Step 4 — Trace a sentence to the source

Step 5 — Adjust the thresholds (optional)

Interpreting the results

Exporting results

Development

Quality gate (CI enforces on every PR)

Project layout

Documentation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages