added EM::B (Important) Type::Feature request labels
Does the lazy pyspark execution stand in contrast to the current use of index arrays?
assigned to @fsauerbu
created merge request !66 to address this issue
mentioned in merge request !66
The way forward for hist() would be to add a column with the binary-encoded assignment to and stack/process. The assignment might be overlapping. The total dataframe should then be queried with a groupby of the oversampled bin and assignment column
hist()