validatetools - Checking and Simplifying Validation Rule Sets
Rule sets with validation rules may contain redundancies or contradictions. Functions for finding redundancies and problematic rules are provided, given a set a rules formulated with 'validate'.
Last updated 22 days ago
data-cleaningrulesvalidation
16 stars 1.97 score 4 dependenciesvalidate - Data Validation Infrastructure
Declare data validation rules and data quality indicators; confront data with them and analyze or visualize the results. The package supports rules that are per-field, in-record, cross-record or cross-dataset. Rules can be automatically analyzed for rule type and connectivity. Supports checks implied by an SDMX DSD file as well. See also Van der Loo and De Jonge (2018) <doi:10.1002/9781118897126>, Chapter 6 and the JSS paper (2021) <doi:10.18637/jss.v097.i10>.
Last updated 22 days ago
data-cleaningvalidation
402 stars 7.05 score 2 dependencies 8 dependentslintools - Manipulation of Linear Systems of (in)Equalities
Variable elimination (Gaussian elimination, Fourier-Motzkin elimination), Moore-Penrose pseudoinverse, reduction to reduced row echelon form, value substitution, projecting a vector on the convex polytope described by a system of (in)equations, simplify systems by removing spurious columns and rows and collapse implied equalities, test if a matrix is totally unimodular, compute variable ranges implied by linear (in)equalities.
Last updated 22 days ago
3 stars 1.54 score 0 dependencies 2 dependentsdeductive - Data Correction and Imputation Using Deductive Methods
Attempt to repair inconsistencies and missing values in data records by using information from valid values and validation rules restricting the data.
Last updated 22 days ago
data-cleaning
11 stars 1.65 score 5 dependenciesdeducorrect - Deductive Correction, Deductive Imputation, and Deterministic Correction
A collection of methods for automated data cleaning where all actions are logged. NOTE: active development has moved to the 'deductive' package.
Last updated 22 days ago
7 stars 1.41 score 13 dependenciesdcmodify - Modify Data Using Externally Defined Modification Rules
Data cleaning scripts typically contain a lot of 'if this change that' type of statements. Such statements are typically condensed expert knowledge. With this package, such 'data modifying rules' are taken out of the code and become in stead parameters to the work flow. This allows one to maintain, document, and reason about data modification rules as separate entities.
Last updated 22 days ago
10 stars 1.82 score 5 dependencieserrorlocate - Locate Errors with Validation Rules
Errors in data can be located and removed using validation rules from package 'validate'. See also Van der Loo and De Jonge (2018) <doi:10.1002/9781118897126>, chapter 7.
Last updated 27 days ago
data-cleaningerrorsinvalidation
21 stars 2.20 score 4 dependenciesvalidatesuggest - Generate Suggestions for Validation Rules
Generate suggestions for validation rules from a reference data set, which can be used as a starting point for domain specific rules to be checked with package 'validate'.
Last updated 9 months ago
data-cleaningvalidation
5 stars 1.25 score 5 dependencies