10.1 What it does ¶
- Reads the file and splits it into chunks (default 4000 characters,
broken at paragraph then sentence boundaries).
- For each chunk, POSTs to your configured chat-completion endpoint
(
{llm_base_url}/chat/completions) with a system prompt that
instructs the model to extract durable facts as a JSON array.
- Collects all returned facts across chunks.
- Default behavior: dry-run. Prints the JSON to stdout (or
--output FILE) so you can review before committing.
- If you pass
--commit, submits the facts via
remember_batch in 200-at-a-time chunks, with
dedup_policy=skip_if_near by default so re-runs after manuscript
edits don’t double up.