
Rick Zamora
@_rjzamora
Parallel-computing enthusiast.
ID: 1312026876156633089
https://rjzamora.github.io/ 02-10-2020 13:49:31
10 Tweet
27 Followers
17 Following




Dask DataFrame read_parquet's performance for remotely-stored-data has been improved. This will provide faster reads for both the "pyarrow" and "fastparquet" engines, thanks to Rick Zamora!



High-cardinality groupby aggregations on Dask DataFrames are much more performant. Thank you, Rick Zamora! They now use a shuffle-based algorithm, learn more: github.com/dask/dask/pull… /2


Moving between CPU and GPU environments just got a whole lot easier. Find out more in this new blog by Rick Zamora and @quasiben: medium.com/rapids-ai/easy…

