audit {rattle}R Documentation

Sample dataset for illustration Rattle functionality.

Description

The audit dataset is an artificially constructed dataset that has some of the characteristics of a true audit dataset for modelling productive and non-productive audits. It is used to illustrate binary classification.

The target variable is Adjusted, an integer which is either 0 (for non-producitve audits) or 1 (for productive audits). Productive audits are those that result in an adjustment being made to a client's claims. The dollar value of those adjustments is also recorded (as the Adjusted column).

The independent variables include Age, type of Employment, level of Education, Marital status, Occupation, level of Income, Sex, amount of Deductions being claimed, Hours worked per week, and country in which they have a bank Account.

An identifier is included as the ID variable.

The dataset is quite small, consisting of just 2000 entities. It primary purpose is to illustrate modelling in Rattle, so a minimally sized dataset is suitable.

Format

A data frame.


[Package rattle version 2.2.91 Index]