
Joris Van den Bossche
@jorisvdbossche
Open source #python developer and teacher. Pandas, GeoPandas and Shapely maintainer. Apache Arrow at Voltron Data Labs
fosstodon.org/@jorisvandenbo…
ID: 1047353646
https://jorisvandenbossche.github.io 30-12-2012 09:31:31
1,1K Tweet
2,2K Followers
126 Following




Live at Community Over Code in just under an hour! Kyle Barron kylebarron.dev on bsky, Joris Van den Bossche, myself, and many others worked hard to release the initial version of the spec + Python bindings. Read more to find out!

Along with Dewey Dunnington and Joris Van den Bossche we tagged version 0.1 of the GeoArrow spec last week, and have started implementations in C, Rust, and Python! There's so much potential in a near future of sharing geodata across languages without copies




Big day for us Voltron Data 🔥Our CEO Josh Patterson just announced Theseus, an embeddable, accelerator-native data engine!!! Data preprocessing isn't keeping up with the performance of AI training! Theseus accelerates the FULL data system⏩ Learn more: voltrondata.com/theseus


To everyone who ever asked me what Voltron Data actually does... Here you go! We finally answer the question!



Claypot AI is joining Voltron Data! AI starts from data. By joining forces, we can further help companies leverage both batch and real-time data for AI applications, on top of Voltron Data’s GPU-native distributed engine Theseus. venturebeat.com/data-infrastru… For AI, GPUs are mostly



Listen in for a discussion on what GeoParquet solves and why you should (or shouldn't) consider using it! We cover how GeoParquet is cloud native, how its compression makes reading and writing faster, and how it integrates with GeoArrow for fast in-memory computing.


We are looking for an Xarray community developer to work at Earthmover to focus on building connections, both technical and social, between the Xarray project and the biomedical research community. jobs.gusto.com/postings/earth…

The GeoParquet 1.1 revision is out, adding support for spatial partitioning and native GeoArrow geometries Both have potential to massively speed up working with very large geospatial datasets. Next step is ensuring the ecosystem works with this version github.com/opengeospatial…