Provides a useful way to sort the variables(columns) according to their missingness.
sort_by_missingness(df, sort_by = "counts", descending = FALSE, ...)
A data.frame object
One of counts or percents. This determines whether the results are sorted by counts or percentages.
Logical. Should missing values be sorted in decreasing order ie largest to smallest? Defaults to FALSE.
Other arguments to specific functions. See "See also below"
A `data.frame` object sorted by number/percentage of missing values
sort_by_missingness(airquality, sort_by = "counts")
#> variable percent
#> 1 Wind 0
#> 2 Temp 0
#> 3 Month 0
#> 4 Day 0
#> 5 Solar.R 7
#> 6 Ozone 37
# sort by percents
sort_by_missingness(airquality, sort_by="percents")
#> variable percent
#> 1 Wind 0.000000
#> 2 Temp 0.000000
#> 3 Month 0.000000
#> 4 Day 0.000000
#> 5 Solar.R 4.575163
#> 6 Ozone 24.183007
# descending order
sort_by_missingness(airquality, descend = TRUE)
#> variable percent
#> 1 Ozone 37
#> 2 Solar.R 7
#> 3 Wind 0
#> 4 Temp 0
#> 5 Month 0
#> 6 Day 0