...
Source Files
Data Model Design
Process Flow (by example)
- Source/batch files are loaded into raw (R) tables
- The data is normalized and moved into standardized (S) tables, eg
prs_sor_role_records
and associated tables - Where a person has multiple records from a given SOR, the "best" biodem data is elected into
prs_sor_persons
- This covers (eg) correction of typos and name changes
- Where a person has multiple SORs, the "best" biodem data is elected into
prc_persons
- Note: The current table definition implies same SOR for best name & biodem
Guidelines
- The database is a "black box", so nothing sees it except for core Registry code. All manipulation is done via APIs.
- Where possible, tables should be consolidated to keep the number of tables down and simplify administering them. As a general rule of thumb, if two tables have the same structure and vary by only one column name, the tables should be consolidated.
...