kb/openspec/changes/archive/2026-03-29-kb-title-in-chunks/specs/engine-api/spec.md at e39e00a2c0a74263a6912f07564b2cd5043202c4

steve/kb

Files

T

steve bbe6a5e909 Add dev-up script and archive kb-title-in-chunks change

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-30 07:25:22 +01:00

2.7 KiB

Raw Blame History

MODIFIED Requirements

Requirement: Background ingestion worker

The engine SHALL run a background worker that processes queued jobs. The worker SHALL process one job at a time. For each job, it SHALL: detect document type, run the appropriate chunking pipeline (Docling for PDFs, header-based for Markdown, AST-based for code, whole-text for notes), build enriched text by prepending the document title (and section header when present) to each chunk's text, generate embeddings using the enriched text and the resident model, insert chunks (with both raw text and enriched text) and vectors into the database, and move the original file to persistent storage.

Scenario: Successful PDF ingestion

WHEN the background worker picks up a queued PDF job
THEN it SHALL update the job status to processing, run Docling conversion and chunking, build enriched text for each chunk by prepending the document title, embed all chunks using enriched text, insert document and chunks into the database, move the staged file to {data_dir}/documents/{content_hash}.pdf, update documents.stored_path with the permanent path, store the original filename in documents.original_filename, update the job status to done with the resulting document_id and chunk count, and clean up the staging entry

Scenario: Ingestion failure

WHEN the background worker encounters an error during processing (e.g., corrupt PDF)
THEN it SHALL update the job status to failed with the error message, delete the staged file, and continue processing the next queued job

Scenario: Search during active ingestion

WHEN a search request arrives while the background worker is processing a job
THEN the search SHALL execute without blocking (SQLite WAL mode) and return results from already-ingested documents

Requirement: Engine status and reindex

The engine SHALL provide status information and support re-embedding all chunks. The version field in the status response SHALL always be present and SHALL reflect the engine's release version as read from the VERSION file. This field is the contract used by clients for compatibility checking.

Scenario: Get engine status

WHEN a client sends GET /api/v1/status
THEN the engine SHALL return JSON with version (string, from VERSION file), model_name, embedding_dim, GPU device info, database stats (document count by type, total chunks, DB size), and queue stats (queued/processing job count)

Scenario: Trigger reindex

WHEN a client sends POST /api/v1/reindex
THEN the engine SHALL re-embed all existing chunks using the enriched_text column and the currently loaded model, and return progress information. This operation SHALL NOT block search queries.

2.7 KiB Raw Blame History