OBIS is providing new products in Parquet, like the speciesgrids and the full export, which also became available in AWS through the Open Data Program. The publication of the full export through AWS came with a total update of the way the data is offered. Now it is offered as a dataset of GeoParquet files, what opens up a world of possibilities in doing spatial analysis with the data, and also make analysis much more efficient and cloud performant. This new version also contains all measurements of the eMoF extension.
To help you use this data more efficiently, we wrote three tutorials on how to use DuckDB with the Parquet datasets!
- Using DuckDB to query the OBIS full export - Part 1 (an introduction to the topic): Using DuckDB to query the OBIS full export - Part 1 • Silas Principe
- Using DuckDB to query the OBIS full export - Part 2 (spatial extension): Using DuckDB to query the OBIS full export - Part 2 (spatial extension) • Silas Principe
- Using DuckDB to query the OBIS full export - Part 3 (
duckplyrpackage): Using DuckDB to query the OBIS full export - Part 3 (`duckplyr` package) • Silas Principe
We hope it will be useful!