Generate a summary table of the data, including variable types, missing values, and basic statistics. The summary is displayed in the browser for easy inspection.
Usage
eda_summary(
data,
variables = NULL,
console_output = TRUE,
browser_output = TRUE,
describe_distribution_args = list(iqr = FALSE, quartiles = TRUE, by = NULL)
)Arguments
- data
A data frame to be inspected.
- variables
An optional vector of variable names to include in the inspection. If NULL, all variables are included.
- console_output
A logical value indicating whether to print descriptive statistics in the console. Defaults to TRUE.
- browser_output
A logical value indicating whether to display the summary table in the browser. Defaults to TRUE.
- describe_distribution_args
A list of additional arguments passed to
datawizard::describe_distribution().- dfSummary_args
A list of additional arguments passed to
summarytools::dfSummary().
Examples
# Inspect a subset of variables in the data
eda_summary(data = bkw_missings, variables = c("F600", "F800_1", "F800_2"), console_output = TRUE, browser_output = FALSE)
#> Warning: no DISPLAY variable so Tk is not available
#> Warning: unable to open connection to X11 display ''
#> Warning: unable to open connection to X11 display ''
#> Warning: unable to open connection to X11 display ''
#>
#> ── Data Frame Summary
#> Data Frame Summary
#> data
#> Dimensions: 1216 x 3
#> Duplicates: 1049
#>
#> ------------------------------------------------------------------------------------------------------------------------------------------------
#> Variable Label Stats / Values Freqs (% of Valid) Graph Valid Missing
#> ------------------ ----------------------------------------- ------------------------------ -------------------- ----------- --------- ---------
#> F600 Wie attraktiv finden Sie die BKW als 1. [1] 1 - Überhaupt nicht a 44 ( 3.6%) 1214 2
#> [haven_labelled, Arbeitgeberin? 2. [2] 2 50 ( 4.1%) (99.8%) (0.2%)
#> vctrs_vctr, 3. [3] 3 132 (10.9%) II
#> double] 4. [4] 4 474 (39.0%) IIIIIII
#> 5. [5] 5 313 (25.8%) IIIII
#> 6. [6] 6 117 ( 9.6%) I
#> 7. [7] 7 - Sehr attraktiv 84 ( 6.9%) I
#>
#> F800_1 Sicherheit und langfristige Stabilität 1. [1] Überhaupt nicht gut 11 ( 0.9%) 1215 1
#> [haven_labelled, des Arbeitgebers 2. [2] 2 21 ( 1.7%) (99.9%) (0.1%)
#> vctrs_vctr, 3. [3] 3 67 ( 5.5%) I
#> double] 4. [4] 4 304 (25.0%) IIIII
#> 5. [5] 5 277 (22.8%) IIII
#> 6. [6] 6 340 (28.0%) IIIII
#> 7. [7] Sehr gut 7 195 (16.0%) III
#>
#> F800_2 Karriere- und Entwicklungsmöglichkeiten 1. [1] Überhaupt nicht gut 11 ( 0.9%) 1211 5
#> [haven_labelled, 2. [2] 2 34 ( 2.8%) (99.6%) (0.4%)
#> vctrs_vctr, 3. [3] 3 70 ( 5.8%) I
#> double] 4. [4] 4 420 (34.7%) IIIIII
#> 5. [5] 5 331 (27.3%) IIIII
#> 6. [6] 6 247 (20.4%) IIII
#> 7. [7] Sehr gut 7 98 ( 8.1%) I
#> ------------------------------------------------------------------------------------------------------------------------------------------------
#>
#> ── Descriptive Statistics
#> Variable | Mean | SD | Range | Quartiles | Skewness | Kurtosis | n | n_Missing
#> -------------------------------------------------------------------------------------------
#> F600 | 4.36 | 1.32 | [1.00, 7.00] | 4.00, 5.00 | -0.19 | 0.44 | 1214 | 2
#> F800_1 | 5.15 | 1.29 | [1.00, 7.00] | 4.00, 6.00 | -0.44 | -0.11 | 1215 | 1
#> F800_2 | 4.78 | 1.21 | [1.00, 7.00] | 4.00, 6.00 | -0.19 | 0.15 | 1211 | 5
