Tana Semantic Search (Local BGE + LanceDB)

A high-performance semantic search engine for Tana that syncs nodes to a local vector store for instant, conceptual retrieval. Built to bypass the limitations of keyword-only search by providing deep, semantic understanding of your knowledge graph.

🚀 Features

Local Semantic Sync: Automatically pulls nodes from your Tana workspace and indexes them into a local vector store.
FastEmbed BGE Model: Utilises the BAAI/bge-small-en-v1.5 model for high-quality embeddings without requiring an external API (runs entirely on your machine).
LanceDB Vector Store: Uses the high-performance LanceDB for serverless, disk-based vector storage.
Hybrid Search Logic: Designed to work alongside Tana's native keyword search for the ultimate "Second Brain" retrieval experience.
Background Automation: Includes scripts for scheduled syncs via launchd or cron.

🛠️ Technical Stack

Python 3.10+
LanceDB: Disk-persistent vector database.
FastEmbed: Lightweight CPU-optimised embedding generation.
Tana MCP: Interfaces with Tana via the Model Context Protocol.

📋 Prerequisites

A Tana account and an active Tana Token.
Python installed on your system.
Tana Desktop app (recommended for local MCP bridge).

⚙️ Setup

Clone the repository:

git clone https://github.com/krshirkoohi/Tana-Embeddings-Public.git
cd Tana-Embeddings-Public

Install dependencies:
```
pip install -r requirements.txt
```
Configure Environment: Create a .env file in the root directory (this is ignored by Git):
```
TANA_TOKEN=your_tana_token_here
```

🔍 Usage

Syncing Nodes

To pull your latest Tana nodes and update the local vector store:

python3 sync_tana.py

Performing Semantic Search

To search your Tana workspace conceptually:

python3 search_tana.py "How do I handle project management in Tana?"

🛡️ Security & Privacy

This tool is designed with privacy in mind. Your embeddings are generated locally and stored on your own disk. No data is sent to external embedding providers (unless you explicitly configure the Google AI Gemini model).

Created as part of the APOPHIS / Gemini CLI workspace for Kavia.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
GEMINI.md		GEMINI.md
README.md		README.md
aggressive_sync.py		aggressive_sync.py
force_index.py		force_index.py
hyper_sync.py		hyper_sync.py
json_smasher.py		json_smasher.py
requirements.txt		requirements.txt
safe_tana_search.sh		safe_tana_search.sh
search_tana.py		search_tana.py
sync_tana.py		sync_tana.py
test_req.py		test_req.py
turbo_index.py		turbo_index.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tana Semantic Search (Local BGE + LanceDB)

🚀 Features

🛠️ Technical Stack

📋 Prerequisites

⚙️ Setup

🔍 Usage

Syncing Nodes

Performing Semantic Search

🛡️ Security & Privacy

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Tana Semantic Search (Local BGE + LanceDB)

🚀 Features

🛠️ Technical Stack

📋 Prerequisites

⚙️ Setup

🔍 Usage

Syncing Nodes

Performing Semantic Search

🛡️ Security & Privacy

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages