Skip to main content

Gap Inventory

This page catalogs the 16 identified data gaps in GospeLib's scripture content layer, ranked by severity.

Gap Table

#GapSeverityRationale
1Morphological tags (interlinear)CriticalExisting interlinear data is 100% missing morphology — the primary scholarly feature
2Source tokens (interlinear)CriticalExisting interlinear data is 100% missing source tokens — renders interlinear incomplete
3Transliteration (interlinear)HighNo transliteration data in existing interlinear — blocks accessibility for non-readers of Hebrew/Greek
4Cross-referencesHighNo cross-reference data ingested — core navigation feature for scripture study
5Septuagint (LXX)HighGreek OT text not available — essential for scholarly OT study
6Versification mappingHighNo mapping between English, Hebrew, Greek, Latin verse numbering systems
7Person names databaseMediumNo structured people data — needed for graph navigation and search
8Place names / geocodingMediumNo geographic data — needed for map features and place-based navigation
9Vulgate (Latin)MediumLatin Bible text not available — relevant for historical/scholarly completeness
10Additional translationsMediumOnly 9 translations — competitors offer dozens to hundreds
11Dead Sea ScrollsLowETCBC/dss provides word-level transcriptions under MIT license — viable for Phase 3 scholarly features
12Aramaic LexiconLowComposite approach via SEDRA IV (Apache 2.0) + Sefaria Jastrow (CC-BY-NC) provides substantial coverage
13Extended CommentaryLowCrossWire SWORD modules provide ~10 public-domain verse-aligned commentaries via SWORD→OSIS→JSON pipeline
14Syntax / discourse analysisMediumNo syntactic structure data — needed for advanced Greek/Hebrew study features
15Synoptic Gospel parallelsLowNo structured pericope-level parallel mapping across Synoptic Gospels
16OT quotations in NTLowNo structured mapping of Old Testament passages quoted in the New Testament

BLB Translation Inventory (Commercial Reference)

The following translations are available on BLB but not accessible for ingestion due to BLB's no-scraping policy. This table documents the scope of what exists commercially, informing our open-source sourcing priorities for Gap #10.

TranslationLicense Status
KJV — King James VersionPublic domain
ASV — American Standard VersionPublic domain
YLT — Young's Literal TranslationPublic domain
DBY — Darby TranslationPublic domain
WEB — World English BiblePublic domain
HNV — Hebrew Names VersionPublic domain
WLC — Westminster Leningrad CodexPublic domain
TR — Textus ReceptusPublic domain
BES — Brenton's English SeptuagintPublic domain
VUL — Latin VulgatePublic domain
SVD — Smith & Van Dyck ArabicPublic domain
LXX — Septuagint (Rahlfs)Non-commercial only
mGNT — Morphological Greek NTRestricted
NKJV — New King James VersionCopyrighted (500-verse limit)
NLT — New Living TranslationCopyrighted (500-verse limit)
NIV — New International VersionCopyrighted (500-verse limit)
ESV — English Standard VersionCopyrighted (500-verse limit)
CSB — Christian Standard BibleCopyrighted
NASB20 — NASB 2020Copyrighted (500-verse limit)
NASB95 — NASB 1995Copyrighted (500-verse limit)
LSB — Legacy Standard BibleCopyrighted
AMP — Amplified BibleCopyrighted (500-verse limit)
NET — New English TranslationCopyrighted
RSV — Revised Standard VersionCopyrighted (500-verse limit)
RVR60 — Reina-Valera 1960 (Spanish)Copyrighted
NAV — Arabic New Arabic VersionCopyrighted

Takeaway: Of BLB's 26+ translations, ~11 are public domain and available from open-source repositories (scrollmapper, ebible.org). The copyrighted translations require direct publisher licensing.

BLB Commentary Inventory (Commercial Reference)

BLB hosts 50+ commentary authors — most are individually copyrighted. Public-domain authors marked with ✅ may be obtainable from CCEL, Project Gutenberg, or similar archives.

  • ✅ Matthew Henry (public domain)
  • ✅ Jamieson, Fausset & Brown (public domain)
  • ✅ John Calvin (public domain)
  • ✅ John Wesley (public domain)
  • ✅ C.H. Spurgeon (public domain)
  • ✅ Martin Luther (public domain)
  • ✅ Jonathan Edwards (public domain)
  • ✅ John Trapp (public domain)
  • ✅ John Bunyan (public domain)
  • ✅ Alexander Maclaren (public domain)
  • ✅ R.A. Torrey (public domain)
  • ✅ Scofield Reference Bible Notes (public domain)
  • ❌ David Guzik (copyrighted)
  • ❌ Chuck Smith (copyrighted)
  • ❌ John MacArthur (copyrighted)
  • ❌ J. Vernon McGee (copyrighted)
  • ❌ John Walvoord (copyrighted)
  • ❌ …and 35+ additional copyrighted authors

Takeaway: ~12 commentary authors are public domain and could be sourced from open archives. Creating structured, verse-aligned datasets from these would require significant curation effort (see Gap #13).