bodo.random_shuffle¶
bodo.random_shuffle(data, seed=None, dests=None, parallel=False)
Manually shuffle data evenly across selected ranks.
Arguments¶
data
: data to shuffle.seed
: number to initialze random number generator.dests
: selected ranks to distribute shuffled data to. By default, distribution includes all ranks.parallel
: flag to indicate whether data is distributed. Default:False
. Inside JIT default value depends on Bodo's distribution analysis algorithm for the data passed (For more information, see Data Distribution section below).
Example Usage¶
import bodo
import pandas as pd
@bodo.jit
def test_random_shuffle():
df = pd.DataFrame({"A": range(100)})
return df
df = test_random_shuffle()
print(df.head())
df = bodo.random_shuffle(res, parallel=True)
print(df.head())
Save code in test_random_shuffle.py
file and run with mpiexec
.
Output: