Bodo 2021.1 Release (Date: 1/26/2021)¶
This release includes many new features, bug fixes and performance improvements. Overall, 61 code patches were merged since the last release.
New Features and Improvements¶
Support filter pushdown when reading partitioned parquet datasets: at compile time, Bodo detects if filters are applied to a dataframe after
read_parquet, and generates code that applies those filters at read time so that only the required parquet files are read.
Support for f-strings in Bodo jitted functions
Support passing Bodo distributed JIT functions to other Bodo JIT functions
Support groupby with
Match input row order of pandas in
min_periodsin rolling calls
Support passing a dictionary of data types to
Support dataframe setitem of multiple columns. For example:
df[["A", "B"]] = 1.3
ddofargument (delta degrees of freedom) of
Initial support for dictionaries in
Support level argument with all levels in
Several documentation improvements
sklearn.model_selection.train_test_splitinside jit functions.
sklearn.preprocessing.MinMaxScalerinside jit functions.