Martin Durant
@martin_durant_
ID: 712667425204998146
http://martindurant.github.io/ 23-03-2016 15:48:49
138 Tweet
358 Followers
24 Following
With some help from Martin Durant & support from the Sloan Foundation we're working on distributing PUDL data using Jarrett Scott Intake catalogs. First up: ~1 billion rows of U.S. EPA Continuous Emissions Monitoring System (CEMS) data in Apache Parquet files. github.com/catalyst-coope…
To make versioning and distributing our SQLite DBs easier, we wrote a wrapper around the intake-sql driver called intake-sqlite. It uses fsspec to cache a remote DB file locally, and then hands the local URI off to intake-sql. Simon Willison Martin Durant github.com/catalyst-coope…
🚨 New blog post by Peter Marsh! Learn about the latest advances enabled by the Kerchunk package. Thanks to Google Open Source #GSOC for sponsoring this work. 🙏 medium.com/pangeo/accessi…