dqcheckr 0.1.1
flag_new_columns, flag_dropped_columns,
flag_type_changes (in CP-02) and
flag_column_order_change (CP-08) are now honoured. Setting
any flag to false suppresses the corresponding check from
the report. Schema changes are still tracked in the SQLite snapshot
regardless of flags.
type_inference_threshold is now configurable per
dataset via rule_overrides in the dataset YAML (or
default_rules in the global config). Previously fixed at
90%, it now defaults to 90% if not set. Affects QC-06, QC-07, QC-08,
QC-11, CP-02, CP-04, CP-05, CP-06, and CP-07.
dqcheckr 0.1.0
Initial release.
- Single-snapshot quality checks: QC-01 to QC-14 (missing rate, empty
columns, duplicate rows, row/column counts, inferred types, numeric
stats, distinct counts, allowed values, numeric bounds, non-numeric
rate, key uniqueness, regex pattern, minimum row count) and SC-01/SC-02
(schema contract).
- Version comparison checks: CP-01 to CP-08 (row count change, schema
diff, missing rate change, numeric mean shift, new/dropped distinct
values, non-numeric rate change, column order).
- Custom organisation-specific checks via a plain R file.
- Self-contained HTML report with check tables, historical trend
charts, and column statistics appendix.
- SQLite snapshot database for long-term trend tracking.
- Supports CSV and fixed-width (FWF) file formats.
- Configuration via global
dqcheckr.yml and per-dataset
YAML files.