BibleData Dataset
The BibleData dataset provides a comprehensive collection of structured biblical texts and metadata, covering various translations, alongside historical texts.
The whole dataset contains 18 CSV files. To maintain readability of the inferred Schema Category and overall clarity we have worked with these 9 files:
AlamoPolyglot.csvwith plain-text data as well as metadata entries for the Bible manuscriptsBook.csvwith books of the Bible and their detailsCommandments.csvwith basic information on the biblical commandmentsEvent.csvwith specifics of the events describedHitchcocksBibleNamesDictionary.csvwith Roswell D. Hitchcock’s Bible Names DictionaryPerson.csvwith information about each named individual in the BiblePersonLabel.csvwith detailed information about individual’s labelsPersonRelationship.csvwith details on the relationships among individualsReference.csv- with unique identifiers for each book, chapter and verse in the Bible

| Entity | Data Link | Mapping |
|---|---|---|
| Alamo Polyglot | ||
| Book | ||
| Commandments | ||
| Event | ||
| Hitchcock's Dictionary | ||
| Person | ||
| Person Label | ||
| Person Relationship | ||
| Reference |
The Person, PersonLabel, and PersonRelationship data are transformed into a graph structure and stored in Neo4j, a graph database. The original data contains relational information about biblical figures, and their relationships with one another. This inherent graph-like structure makes Neo4j an ideal choice for storing and analyzing this data.
The relationships in the dataset form a natural graph structure, with people as nodes and their connections as edges. Neo4j is specifically designed to store, query, and analyze such graph data efficiently.
| Entity | Output Mapping |
|---|---|
| Person | |
| PersonLabel | |
| PersonRelationship |
Generated Data Manipulation Language (DML) Commands: