High-Performance Open-Source Archive
The sumvar package quickly explores continuous, date and categorical
variables. sumvar brings the ease and simplicity of the sum
and tab commands from Stata to R.
Install from CRAN:
install.packages("sumvar")Or install the development version from GitHub:
# install.packages("pak")
pak::pak("alstockdale/sumvar")dist_sum() — explore a continuous
variable; stratify by a grouping variable with
df %>% dist_sum(var, group).dist_date() — summarise date
variables; usage is the same as dist_sum().tab1() — frequency table for a single
categorical variable: df %>% tab1(var).tab() — two-way cross-tabulation:
df %>% tab(var1, var2).dup() — explore duplicates and missing
values across a single variable or an entire data frame.explorer() — generate an HTML or PDF
exploratory data analysis report for a whole data frame.Both dist_sum() and tab() include options
for frequentist hypothesis tests. See the function help pages for
details.
All functions accept the %>% pipe from a tibble or
data frame and return results as a tibble, so outputs can be saved and
further manipulated:
summary <- df %>% dist_sum(var)
Need mirroring services?
Contact our team at info@vpspulse.com.
Mirror powered by VPSpulse
Infrastructure sponsored by VPSPulse & Secure Payments by ArionPay.