Skip to main content

BATHO

Reduce token spend and hallucinations. Index your codebase into a graph your AI agents can navigate.

BATHO β€” zsh
typing…
# Build full hypergraph (baseline)
➜~β–‹

Why Batho?

Companies are reducing AI usage because token spend is getting pricey. Batho fixes cost and quality.

Lower Token Costs

Your agent traverses a graph instead of reading entire files. Feed the LLM only what it needs.

Fewer Hallucinations

Deterministic, tree-sitter-parsed relationships. No guessing, no embeddings β€” just facts.

More Use Cases

When cost and accuracy are solved, automation possibilities widen with imagination.

Works With Your Stack

Python, TypeScript, Rust, Go, Java, and more. Batho parses 40+ languages so your agent understands your entire codebase.

Learn more β†’

Spend Less on Tokens

Batho compresses entire codebases into a graph your agent traverses β€” using a fraction of the tokens.

Learn more β†’

Track Codebase Evolution

Versioned snapshots and incremental diffing let your agent understand what changed and why β€” across every commit.

Learn more β†’

Zero-Copy Performance

Apache Arrow IPC storage means zero-copy, memory-mapped reads. Your agent queries the graph instantly.

Learn more β†’

Reconstruct Anything

Reconstruct any file byte-for-byte from the graph. Cryptographic hash integrity means nothing is lost.

Learn more β†’

Zero Config to Start

Batho runs with zero config. Customize with a single YAML file when you need control.

Learn more β†’

How It Works

From source code to AI-ready graph in four steps

01

Build

Parse your entire codebase into a structured hypergraph with cross-file symbol resolution.

batho build --root .
02

Patch

Apply incremental updates 10-100x faster than full re-indexing using native content hashing.

batho patch --root .
03

Export

Export a transportable ZIP artifact by default, or use --json for LLM-optimized views.

batho export --root .
04

Verify

Execute integrity checks and diagnostic routines to repair database corruption or inconsistencies.

batho fix --dry-run
10x
Less Tokens
vs. raw file injection
0
Hallucinations
deterministic, not guessed
40+
Languages
tree-sitter powered
>95%
Cache Hit
typical PR changes

Get Started in Seconds

One command to give your AI agent a map of your entire codebase.

$pip install batho

Stop paying to dump your repo into an LLM.

Index your codebase in one command. Your AI agent gets the map β€” not the whole territory.