M13: Multi-Translation & Schema Evolution — ingest/v0.2.0-alpha, content/v0.2.0-alpha
Version tag:
ingest/v0.2.0-alpha,content/v0.2.0-alphaPhase: P0: Foundation (Phases 1--2), P2: Enhancement (Phases 3--4) Target: Weeks 5--26 Sprints: S2--S12
Phase Context
Goal: Establish symmetric Translation-node schema for multi-translation support, fix critical data corruption (wrong translation text on 31K Bible passages), recover ~350K silently-discarded data points, and progressively enrich the graph with word alignment, lexicon, and citation data.
Key constraint: This work addresses two CRITICAL bugs blocking all downstream features -- every Bible passage shows wrong text, and 9 translations have no storage mechanism. Must complete Phase 1--2 before M03 Scripture Reader can ship correct data.
ZenHub Configuration
| Field | Value |
|---|---|
| Milestone | M13: Multi-Translation & Schema Evolution |
| Due Date | 2026-09-06 |
| Default Pipeline | Product Backlog |
| Primary Epic(s) | Schema Foundation & Critical Fixes, Content Service Translation Support, Word Alignment & Lexicon Enrichment, Citation Extraction |
Prerequisites
- M01: Data Pipeline -- ingest pipeline must exist before modification (Stages 0--6 operational)
- M00: Tech Prep -- FalkorDB local instance, structured logging
Epic: Schema Foundation & Critical Fixes (Phase 1)
Translation schema foundation, ID generators, slug registry, JST versification, pipeline rework.
Issues
| Issue | Title | Status | Notes |
|---|---|---|---|
| M13-001 | Translation Node Cypher Templates | ✅ Done | PR #959 |
| M13-002 | Translation ID Generators | ✅ Done | PR #959 |
| M13-003 | Translation Slug Registry Module | ✅ Done | PR #959 |
| M13-004 | Translation Slug Discovery | ✅ Done | PR #959 |
| M13-005 | JST Versification Support | ✅ Done | PR #959 |
| M13-006 | Scripture Text Pipeline Translation Support | ✅ Done | PR #968 |
| M13-007 | Dictionary Gloss Resolution | ✅ Done | PR #972 |
| M13-008 | Scholarly Metadata Fix | ✅ Done | PR #972 |
| M13-009 | Missing Secondary Indexes | ✅ Done | PR #972 |
| M13-010 | PendingPassage Monitoring | ✅ Done | PR #972 |
| M13-011 | Phase 1 Unit Tests | ✅ Done | PR #978 |
| M13-012 | Phase 1 Integration Tests | ✅ Done | PR #978 |
Epic: Content Service Translation Support (Phase 2)
Content query migration, translation routes, Book/Chapter hierarchy, note anchor positions.
Issues
| Issue | Title | Status | Notes |
|---|---|---|---|
| M13-013 | Content Service Translation Queries | ✅ Done | PR #982 |
| M13-014 | Translation Registry Routes & Models | ✅ Done | PR #982 |
| M13-015 | Book/Chapter Hierarchy Pipeline | ✅ Done | PR #982 |
| M13-016 | Note Anchor Positions | ✅ Done | PR #982 |
| M13-017 | Phase 2 Unit Tests | ✅ Done | PR #983 |
| M13-018 | Phase 2 Integration Tests | ✅ Done | PR #983 |
Epic: Word Alignment & Lexicon Enrichment (Phase 3)
TranslationWord pipeline, lexicon JSON properties, DefinitionSense and TranslationOccurrence graph nodes.
Issues
| Issue | Title | Status | Notes |
|---|---|---|---|
| M13-019 | TranslationWord Pipeline | ✅ Done | PR #1344 (batch A1.6) |
| M13-020 | Lexicon Enrichment -- JSON Properties | ✅ Done | PR #1344 (batch A1.6) |
| M13-021 | Lexicon Enrichment -- Graph Nodes | ✅ Done | PR #1344 (batch A1.6) |
| M13-022 | Phase 3 Tests | ✅ Done | PR A2.4 |
Epic: Citation Extraction (Phase 4)
Scripture reference parser, BD mentions, section citations extraction.
Issues
| Issue | Title | Status | Notes |
|---|---|---|---|
| M13-023 | Scripture Reference Parser | ✅ Done | PR #1337 (batch A0.2) |
| M13-024 | BD Mentions Extraction Pipeline | ✅ Done | PR #1345 (batch A1.5) |
| M13-025 | Section Citations Extraction Pipeline | ✅ Done | PR #1345 (batch A1.5) |
| M13-026 | Phase 4 Tests | ✅ Done | PR A2.4 |
Progress: 26 Done · 0 Partial · 0 To Do (100%)
Summary
| Metric | Count |
|---|---|
| Total Issues | 26 |
| Sub-Issues | 4 |
| Total Estimate (pts) | ~112 |
| Sprints | S2--S12 |
| Dependencies (blocking) | 32 |
| Dependencies (blocked by) | 32 |