Access Overture's Places Data
Overture Maps Foundation released the alpha version of their open data, and I couldn’t wait to get my hands on it. When I went to their download page I saw that they have released their data not in any of the standard GIS formats, nor in OSM’s PBF format, but in Apache parquet format.
The kind folks at Overture have a writeup on how you can access it here, but it is far too technical for someone who doesn’t have knowledge of the parquet format, and hence it is not too easy for us GIS folk to download it.
And that's why let us go step by step, and figure out how to access and download this data.
Before we get started, lets understand a couple of things:
Here are the steps to get the POIs for a small area on to our local system.
INSTALL httpfs;
INSTALL spatial;
LOAD spatial;
LOAD httpfs;
SET s3_region='us-west-2';
Describe Select * from read_parquet('s3://overturemaps-us-west-2/release/2023-07-26-alpha.0/theme=places/type=*/*', filename=true, hive_partitioning=1);
Select
id,
JSON(names) as names,
JSON(categories) as categories,
JSON(brand) as brand,
JSON(addresses) as addresses,
ST_GeomFromWKB(geometry) as geom
from read_parquet('s3://overturemaps-us-west-2/release/2023-07-26-alpha.0/theme=places/type=*/*', filename=true, hive_partitioning=1)
LIMIT 2;
Select
id,
Recommended by LinkedIn
JSON(names) as names,
JSON(categories) as categories,
JSON(brand) as brand,
JSON(addresses) as addresses,
ST_GeomFromWKB(geometry) as geom
from read_parquet('s3://overturemaps-us-west-2/release/2023-07-26-alpha.0/theme=places/type=*/*', filename=true, hive_partitioning=1)
where
bbox.minX > 73.77 and
bbox.maxX < 73.955 and
bbox.minY > 18.43 and
bbox.maxY < 18.61
LIMIT 2;
COPY (
Select
id,
JSON(names) as names,
JSON(categories) as categories,
JSON(brand) as brand,
JSON(addresses) as addresses,
ST_GeomFromWKB(geometry) as geom
from read_parquet('s3://overturemaps-us-west-2/release/2023-07-26-alpha.0/theme=places/type=*/*', filename=true, hive_partitioning=1)
where
bbox.minX > 73.77 and
bbox.maxX < 73.955 and
bbox.minY > 18.43 and
bbox.maxY < 18.61
) TO 'poi_pune.geojson'
WITH (FORMAT GDAL, DRIVER 'GeoJSON');