Extract Tracker¶
These are the tables associated with extract tracking/handling. Links to other subjects will be provided as noted.
Extract Compression Type¶
This table tracks the valid compression types available.
Column Name | Column Type | Column Description |
---|---|---|
extract_compression_type_id | Integer | Auto incrementing integer sequence |
extract_compression_type | Integer | Unique compression type name |
Some default extract compression types are provided on initialization.
Extract Compression Type | Description |
---|---|
zip | Zip Compression |
Extract Dataset Type¶
This table tracks the relationship between extract files and dataset types.
Column Name | Column Type | Column Description |
---|---|---|
extract_id | Integer | Foreign key to the Extract Tracking table. |
dataset_type_id | Integer | Foreign key to the Dataset Type table. |
Extract Dependency¶
This table tracks the interdependencies between extract files.
Column Name | Column Type | Column Description |
---|---|---|
parent_extract_id | Integer | Foreign key to the Extract Tracking table. The parent extract of the relationship. |
child_extract_id | Integer | Foreign key to the Extract Tracking table. The child extract of the relationship. |
Extract File Type¶
This table tracks the valid types of files and their formats that available.
Column Name | Column Type | Column Description |
---|---|---|
extract_filetype_id | Integer | Auto incrementing unique sequence |
extract_filetype_code | String(25) | The file extension used by the filetype (i.e. csv) |
extract_filetype | String(75) | The unique name of the filetype. |
delimiter_char | String(1) | For filetypes like csv, the character used to delimit fields. |
quote_char | String(1) | For filetypes like csv, the character used to quote fields. |
escape_char | String(1) | For filetypes like csv, the character used to escape fields. |
Some default extract file types are provided on initialization.
Extract File Type | Description |
---|---|
csv | Comma Separated Values |
Extract Process Tracking¶
This table tracks the association between extract files and process runs.
Column Name | Column Type | Column Description |
---|---|---|
extract_tracking_id | Integer | Foreign key to the Extract Tracking table. |
process_tracking_id | Integer | Foreign key to the Process Tracking table. |
extract_process_status_id | Integer | Status of the extract from the process run. Foreign key to the Extract Status table. |
extract_process_event_date_time | Datetime/timestamp | The date/time of the status change for the extract. |
Extract Source¶
This table tracks the relationship between extracts and their sources.
Column Name | Column Type | Column Description |
---|---|---|
extract_id | Integer | Foreign key to Extract Tracking |
source_id | Integer | Foreign key to Source |
Extract Source Object¶
This table tracks the relationship between extracts and their source objects.
Column Name | Column Type | Column Description |
---|---|---|
extract_id | Integer | Foreign key to Extract Tracking |
source_id | Integer | Foreign key to Source Object |
Extract Status¶
This table is a lookup of system and user provided extract statuses.
Column Name | Column Type | Column Description |
---|---|---|
extract_status_id | Auto incrementing integer sequence | System key for the extract status |
extract_status_name | String(75) | Unique name of the extract status type |
Some default extract status types are provided on initialization.
Extract Status Type | Description |
---|---|
initializing | The extract file is being written to and/or is not ready for use. |
ready | The extract file is ready to be used. |
loading | The extract file is being used/loaded by a process run. |
loaded | The extract file has successfully been loaded by a process run. |
archived | The extract file has successfully been archived and can only be reprocessed if moved back out of archive location. |
deleted | The extract file has successfully been removed from the archive and can no longer be retrieved. |
error | Something went wrong in the writing/processing of the extract file. Until resolved, file is unusable. |
Custom extract status types can be added, but can not currently be utilized by the ProcessTracker framework.
Extract Tracking¶
This table is the core of the extract tracking subsystem.
Column Name | Column Type | Column Description |
---|---|---|
extract_id | Auto incrementing integer sequence | System key for the extract file |
extract_filename | String(750) | The unique filename of the extract file |
extract_location_id | Integer | Where the extract file can be located. Foreign key to Location |
extract_status_id | Integer | The current status of the extract file. Foreign key to Extract Status |
extract_registration_date_time | Datetime/timestamp | The date/time that the extract was initially registered into the system. |
extract_write_low_date_time | Datetime/timestamp | The earliest derived datetime for data processed in this extract at write. Optional audit field. |
extract_write_high_date_time | Datetime/timestamp | The latest derived datetime for data processed in this extract at write. Optional audit field. |
extract_write_record_count | Integer | For the given extract file at write, the total number of records processed. Optional audit field. |
extract_read_low_date_time | Datetime/timestamp | The earliest derived datetime for data processed in this extract at read. Optional audit field. |
extract_read_high_date_time | Datetime/timestamp | The latest derived datetime for data processed in this extract at read. Optional audit field. |
extract_read_record_count | Integer | For the given extract file at read, the total number of records processed. Optional audit field. |
extract_compression_type_id | Integer | Optional compression type used on the extract. Foreign key to Extract Compression Type |
extract_filetype_id | Integer | File type/format used by the extract. Foreign key to Extract File Type |
extract_filesize | Numeric | The size of the extract |
extract_filesize_type_id | Integer | The measure of the extract filesize. Foreign key to File size Type |
File size Type¶
This table provides file sizes for extracts.
Column Name | Column Type | Column Description |
---|---|---|
filesize_type_id | Integer | Primary key for the file size type |
filesize_type_name | String(75) | Full name of the file size type |
filesize_type_code | String(2) | Code used by the file size type |
There are defaults provided for file sizes on initialization:
System Key | System Value | System Code |
---|---|---|
1 | kilobytes | KB |
2 | megabytes | MB |
3 | gigabytes | GB |
4 | bytes | B |
Location¶
This table tracks extract file locations.
Column Name | Column Type | Column Description |
---|---|---|
location_id | Auto incrementing integer sequence | System key for the file location |
location_name | String(750) | Unique optional name of the location. Will be derived from the filepath if not provided. |
location_path | String(750) | Unique filepath. |
location_type_id | Integer | The type of location for given filepath. Foreign key to Location Type. |
location_file_count | The number of files currently in the given location. | Integer |
Location Type¶
This table tracks extract file location types.
Column Name | Column Type | Column Description |
---|---|---|
location_type_id | Auto incrementing integer sequence | System key for the location type |
location_type_name | String(25) | The unique name of the type of location. |
Some default location types are provided on initialization.
Location Type | Description |
---|---|
S3 | S3 bucket location |
Local Filesystem | Local filesystem location |