NEWS.md
There is now a function recode_as_value
for general recoding. See #41.
Fixed all_of
warnings in dplyr
select
and across
verbs.
Tests now use edition 3 of testthat. See https://github.com/r-lib/testthat/issues/1471 and https://github.com/Nelson-Gon/manymodelr/pull/22
Vignettes no longer include URLs to avoid future issues with broken links on CRAN.
Exclusion or inclusion in na_summary
via regular expressions is now possible via regex_kind
and providing a pattern_type
and pattern
.
exclusion
is now done entirely in base
R instead of using dplyr::select
.
get_na_means
and percent_missing
now support POSIXct
.
na_counts
and percent_na
are new vector focused functions to allow get
ting na_counts
and percent missingness for objects of classes like POSIXct
.
include a new argument to reset rownames in na_summary
. Fixes
#33.
Fixed a bug that made exclude_cols
result in non-exclusion in grouped na.summary
outputs.
na_summary
now includes a round_to
parameter to control the number of digits to round to. Defaults to options("digits")
in the user’s settings.
drop_na_at
no longer uses a tibble
to avoid warnings with tibble
row.names
.
Tests in dict_recode
were extended to ensure that we get what is expected particularly with respect to the use_func
argument.
dict_recode
now supports the use of different NA
recoding functions. These can be provided via the argument use_func
.
dict_recode
is a new function to allow recoding with a pattern-values pair.
drop_na_if
now supports keep certain columns when grouping. It also now supports dropping only at certain columns.
drop_na_if
now supports dropping groups that meet a set missing data threshold.
recode_as_na_str
updated to convert factors to character.
drop_row_if
arguments were changed. See docs for the new arguments. This was done to allow new functionality that enables dropping rows based on counts.
Sorting strings in na_summary
contained a minor bug that has now been fixed.
recode_as_na_str
is a new function to recode_as_na
based on string matching.
recode_as_na_if
is a new function that allows deliberate addition of NA
s to a column if it meets a pre-defined condition. This is essentially the reverse of drop_na_if
.
drop_row_if
is a new function similar to drop_na_if
but for rows instead of columns.
percent_missing
can now work independently for factors, characters, or numeric data. It previously was limited to data.frame
objects.
Fixed issues with drop_na_at
that resulted in failing data rebinding.
get_na_counts
now includes an optional exclude_cols
argument to exclude columns from the analysis.
na_summary
has been fixed to allow exclusion of columns from the analysis.
all_na
is a new low level function to perform missingness boolean checks for columns(vectors).
Added a new function column_based_recoode
that aims to allow NA
recoding based on other columns.
Patch Fix to skip tests on old R releases.
Only R versions above 3.6.3 are supported.
Additions
na_summary
which provides a very quick overview of missingness. It also supports grouped summaries.
drop_na_if
allows easy dropping of columns where all values are missing.
custom_na_recode
allows replacing missing values with common values such as mean, min, max, sd.
Major changes
In percent_missing
, the argument grouped
was dropped in favour of simply providing a grouping_cols
vector.
In recode_as_na
, subset_df
was dropped. It now simply accepts an optional subset_cols
argument. The argument tidy
was also dropped.One can simply provide an optional pattern_type
and pattern
.
Similar changes were made for recode_na_as
as above.
The argument x
was changed to df
in drop_na_at
.
Available functions
get_na_counts
percent_missing
recode_as_na
sort_by_missingness
recode_na_as
drop_na_if
recode_na_if
drop_na_at
recode_as_na_for
Key changes
Now supports tidy selection and exploration at specific columns
percent_missing
supports grouping and exclusion of certain columns. The use of decimals was dropped.
drop_na_if
allows exclusion of columns.