OSM and photovoltaic data
About this tutorial
What will you learn ?
With this tutorial, you'll be able to:
- Process multiple data sources (vector and raster data) in a jupyter notebook
- Index the processed buildings solar potential data in Elasticsearch
- Reference the indexed solar data in ARLAS
- Create a view of ARLAS-wui (a dashboard) from a configuration file
What will you need ?
Follow the Get Started guide to configure your environment.
There also are additional requirements:
- Java
- gsutil
- Jupyter
On Linux, they can be installed by doing
apt install default-jdk gsutil
pip install jupyter
What will you get ?
An ARLAS dashboard with map layers and graphs to explore the photovoltaic potential of buildings of the Lozère french department.
Exploration app created in this tutorial
The tutorial data
This tutorial use three different open data sources to explore the buildings solar potential.
OSM
OpenStreetMap (abbreviated OSM) is a website that uses an open geographic database which is updated and maintained by a community of volunteers via open collaboration. Contributors collect data from surveys, trace from aerial photo imagery or satellite imagery, and also import from other freely licensed geodata sources. OpenStreetMap is freely licensed under the Open Database License and as a result commonly used to make electronic maps, inform turn-by-turn navigation, assist in humanitarian aid and data visualisation. OpenStreetMap uses its own topology to store geographical features which can then be exported into other GIS file formats. The OpenStreetMap website itself is an online map, geodata search engine and editor.
OSM data is a very valuable tool when it comes to datascience as it can bring freely licensed building data quite reliably all across the globe. In order to limit the volume of the downloaded data, websites like Geofabrik's are particularly helpful to easily select a chunk of the data, here Languedoc-Roussillon.
ADMIN Express
The National Institute of Geographic and Forest Information (IGN) is a public administrative establishment placed under the joint authority of the Ministries in charge of ecology and forestry. Their goal is to guarantee the availability of geolocalised data and in particular sovereign data for the State, foster the appropriation and use of geographic data as well as maintain a high level of competence in the field of geographic information.
They produce a wide variety of interesting datasets when working with French geospatial data, as they give us information to enrich our own data with, for a better exploration. In this example, we will use their ADMIN Express dataset to add the town info to each OSM building.
Photovoltaic potential
Solargis is a Slovakian company that provides online access to high-resolution solar data and services around the domain of solar energy. They provide regional and by country solar resource maps that can be used as is or, in our case, to enrich data.
Get the data
All the necessary data is stored in the Gisaïa bucket, and can be downloaded with:
export ARLAS_DEMO_REMOTE_DATA_PATH="gs://gisaia-public/demo"
export ARLAS_DEMO_LOCAL_DATA_PATH="tutorials/sunny_osm/data"
python3 tutorials/sunny_osm/download_raw_data.py
The OSM and IGN data are a section of the original data, with only the buildings with a type and the towns of Occitanie kept to limit the size of the downloads.
The path ARLAS_DEMO_LOCAL_DATA_PATH
can be changed to any path of your liking.
If the script does not work, the files can directly be retrieved on Google Cloud.
Transform the data
To transform the data and cross the data sources, a notebook (tutorials/sunny_osm/tutorial.ipynb
) is available.
You can open a notebook using various tools, such as Jupyter, VSCode, or any compatible code editor.
Tip
For example, you can use Jupyter Notebook locally.
To do this, open a terminal at the project root and run the following command:
jupyter notebook
Then, use the Jupyter interface to navigate to and open the notebook.
Once the notebook is open, follow the provided guidance to proceed.
Example of notebook cells
It creates a NDJSON file ready to be ingested in ARLAS at tutorials/sunny_osm/data/osm_sunny.json
Ingest data in ARLAS
Warning
If arlas_cli
is configured for local ARLAS, the --config local
option in all tutorial arlas_cli
commands must be specified.
Example:
arlas_cli indices \
--config local \
mapping ...
Create empty index with correct mapping
Infer mapping directly from the data:
arlas_cli indices \
mapping tutorials/sunny_osm/data/sunny_osm/osm_sunny.json/part-00000-*.json \
--no-fulltext osm_id \
--no-fulltext month \
--no-fulltext unique_id \
--no-fulltext name \
--field-mapping time:date-epoch_second \
--field-mapping area:integer \
--nb-lines 1000 \
--push-on sunny_osm
Index data
To index the object data created in sunny_osm/osm_sunny.json
, run:
arlas_cli indices \
data sunny_osm \
tutorials/sunny_osm/data/sunny_osm/osm_sunny.json/*.json
Delete index
If you want to delete the index, run:
arlas_cli indices delete sunny_osm
Warning
Before reindexing data, don't forget to recreate empty index with mapping.
Create collection
To create the ARLAS collection, run:
arlas_cli collections \
create tuto_sunny_osm \
--index sunny_osm \
--display-name "Sunny OSM" \
--centroid-path centroid \
--geometry-path geometry \
--date-path time \
--id-path unique_id \
--owner tmp
Create an ARLAS dashboard from a configuration file
Depending on your deployment, change the ARLAS server URL in the dashboard configuration.
Run at the project root:
export ARLAS_SERVER_URL=http://localhost/arlas
envsubst '$ARLAS_SERVER_URL' < tutorials/sunny_osm/data/arlas/template.config.dashboard.json > tutorials/sunny_osm/data/arlas/config.dashboard.json
Then create the ARLAS dashboard from
arlas_cli persist \
--config local \
add tutorials/sunny_osm/data/arlas/config.dashboard.json config.json --name "Sunny OSM"
export ARLAS_SERVER_URL=https://cloud.arlas.io/arlas/server
envsubst '$ARLAS_SERVER_URL' < tutorials/sunny_osm/data/arlas/template.config.dashboard.json > tutorials/sunny_osm/data/arlas/config.dashboard.json
arlas_cli persist \
add tutorials/sunny_osm/data/arlas/config.dashboard.json config.json --name "Sunny OSM"
ARLAS with the created "Sunny OSM" dashboard
Success
The Sunny OSM
Dashboard is accessible in ARLAS.
You can set a preview by editing the dashboard and choosing a view in the map Preview
tab:
Set Sunny OSM dashboard preview
The created dashboard has now its preview in ARLAS Hub:
ARLAS with the created "Sunny OSM" dashboard and its preview