Comma for either/or — dharma, courage. Spelling forgiving — corage finds courage.

    Falsafa by the numbers

    Every claim in the launch writeup deep-links to its source. Audit anything.

    Corpus

    949

    total works

    deep-links to the catalogue

    36,072,428

    total words across all variants

    view raw manifest

    Eras & languages

    9

    English, French, German, Greek, Kawi, Latin, Old English, Sanskrit, Urdu

    view languages

    10

    eras spanned

    view era index

    Eval

    146

    cases run, paper-grade post-patch

    view raw eval JSON

    94%

    mechanical pass rate

    view raw eval JSON

    917

    paragraph citations resolved against the corpus

    audit each one

    4

    verse-marker hallucinations (target: 0)

    why this matters

    Sample size is small (post-anti-cheat-patch). Scoring is deterministic — citation match against expected_works, no LLM judge.

    Benchmark vs hybrid RAG

    Citation-validity rate

    Falsafa forthcoming
    Hybrid forthcoming

    p95 latency

    Falsafa forthcoming
    Hybrid forthcoming

    One-time embedding cost

    Falsafa $0
    Hybrid forthcoming

    Hybrid RAG baseline (apps/baseline/) is in development. Falsafa's approach reads markdown directly through MCP tools, so there are no embeddings to compute. Hybrid figures land when the baseline runs against the same 1,000-question pool.

    Build

    9,124

    logical chapters

    view manifest

    16,433

    variant entries (translation, transliteration, original)

    view manifest

    767,729

    cited paragraphs with stable hash IDs

    view corpus tree