BibleData Dataset
The BibleData dataset provides a comprehensive collection of structured biblical texts and metadata, covering various translations, alongside historical texts.
The whole dataset contains 18 CSV files. To maintain readability of the inferred Schema Category and overall clarity we have worked with these 9 files:
AlamoPolyglot.csv
with plain-text data as well as metadata entries for the Bible manuscriptsBook.csv
with books of the Bible and their detailsCommandments.csv
with basic information on the biblical commandmentsEvent.csv
with specifics of the events describedHitchcocksBibleNamesDictionary.csv
with Roswell D. Hitchcock’s Bible Names DictionaryPerson.csv
with information about each named individual in the BiblePersonLabel.csv
with detailed information about individual’s labelsPersonRelationship.csv
with details on the relationships among individualsReference.csv
- with unique identifiers for each book, chapter and verse in the Bible
Entity | Data Link | Mapping |
---|---|---|
Alamo Polyglot | ||
Book | ||
Commandments | ||
Event | ||
Hitchcock's Dictionary | ||
Person | ||
Person Label | ||
Person Relationship | ||
Reference |
The Person, PersonLabel, and PersonRelationship data are transformed into a graph structure and stored in Neo4j, a graph database. The original data contains relational information about biblical figures, and their relationships with one another. This inherent graph-like structure makes Neo4j an ideal choice for storing and analyzing this data.
The relationships in the dataset form a natural graph structure, with people as nodes and their connections as edges. Neo4j is specifically designed to store, query, and analyze such graph data efficiently.
Entity | Output Mapping |
---|---|
Person | |
PersonLabel | |
PersonRelationship |
Generated Data Manipulation Language (DML) Commands: