Knowledge compiler roadmap

Org2 is a local-first knowledge compiler. Plain-text notes are the source corpus; the product value is the structured, queryable, maintainable system compiled from that corpus.

This page is the product/technical direction behind the current implementation work.

Product frame

Org2 should compete on the things plain prose tools leave underspecified: planning semantics, graph health, provenance, publishing, and agent collaboration over local files.

Org2 earns its complexity when it makes a personal knowledge base operational:

  • tasks and planning become agenda data

  • links, IDs, aliases, and properties become graph data

  • raw notes and captured material become curated notes

  • curated notes compile into views, reports, lint results, and published sites

  • provenance gives generated artifacts clear origins

Compiler model

The intended corpus flow is:

raw/ -> notes/ -> compiled/ -> views/ -> publish/
  • raw/ holds inboxes, imports, transcripts, scraped material, and other source inputs.

  • notes/ holds the human-edited canonical corpus.

  • compiled/ holds generated machine-readable artifacts.

  • views/ holds generated human-facing local reports and review surfaces.

  • publish/ holds deployable/public artifacts.

Generated zones should be marked, lintable, and reproducible from source notes where possible.

First-class semantics

The compiler layer should keep moving more behavior out of editor-specific conventions and into portable Org2 semantics:

  • tasks, TODO states, planning timestamps, priorities, and tags

  • IDs, aliases, file links, wiki links, backlinks, and graph edges

  • properties, provenance, generated status, and artifact roles

  • source ranges that let tools point back to the original file and heading

  • health checks that detect broken links, duplicate IDs, stale generated files, and unsafe corpus boundaries

Implementation direction

Near-term work should bias toward small compiler passes and CLI-backed workflows that editor integrations can reuse.

Good examples:

  • richer org2 lint checks for corpus structure and generated artifacts

  • explicit provenance fields for generated outputs

  • query/search commands that include citations back to source files

  • safer roam/linkify passes that can preview changes and exclude generated or archived paths

  • publishing docs that explain the corpus model and its user-facing workflows

This keeps Org2 useful today while preventing the project from collapsing into editor-specific glue.

Non-goals

  • No Emacs runtime dependency.

  • No need to win as a generic prose editor.

  • No byte-for-byte clone of every Org Mode behavior before shipping useful workflows.

  • No opaque generated artifacts that cannot be traced back to source notes.

Done means compiled artifacts matter

A mature Org2 setup should make notes feel like source code for a personal knowledge system: easy to edit, but also compilable into agenda views, graph artifacts, maintenance reports, and published outputs.