Training Resources for Geospatial Computing
This is a curated list of training resources for popular packages that are used in geospatial computing. The current list primarily includes Python packages, but it will be updated to include packages from other languages, e.g. R. If you need resources for any particular package, please contact us so we can update the list accordingly.
Please feel free to contribute the adding resources you find useful for geospatial research, education, and training purposes.
Outline
Common Resources for Geospatial Computing and Earth Observation
These are links to resources that have useful information and tutorials on geospatial computing and Earth observation.
Awesome Geospatial - Long list of geospatial analysis tools
GeoHackWeek 2017 Materials - Materials from GeohackWeek which happened in University of Washington in 2017.
Python Geospatial Analysis Cookbook - Over 60 python recipes to perform spatial operations and build an indoor routing Django web application
Intro to Geospatial Data using Python - Introduction to geospatial data and its types, vector and raster, and work with geospatial data using Python.
Automating GIS Processes - A course on how to do different GIS-related tasks in Python programming language. Has lessons with tutorials on specific topic(s) where the aim is to learn how to solve common GIS-related problems and tasks using Python tools.
Geo Python - The Geo-Python course contains basic concepts of programming and scientific data analysis using the Python. Targeted at beginners with no previous programming experience needed.
Parallel Python - Analyzing large Datasets - Parallel computing in Python tutorial materials.
Course Materials of Fabien Maussion - Includes various courses such as Scientific Programming, Climate Systems, etc
Digital Earth Africa Training Course - Sandbox data analysis platform.
Digital Earth Australia Notebooks - Jupyter Notebooks, Python scripts and workflows for analysing Digital Earth Australia (DEA) satellite data and derived products.
Free Earth Data Science Courses and Textbooks (Earth Lab CU Boulder) - Free and open earth data science textbooks and courses as open education resources.
Intro to Earth Data Science (Earth Lab CU Boulder) - Analyze and visualize earth and environmental science data using the Python. No prior programming knowledge assumed.
Intermediate Earth Data Science (Earth Lab CU Boulder) - Use Data for Earth and Environmental Science in Open Source Python
Open Source Geoprocessing Tutorial - Tutorial of basic remote sensing and GIS methodologies using modern open source software in Python (rasterio, shapely, geopandas, folium, etc).
GDAL - GDAL is a translator library for raster and vector geospatial data formats.
GDAL Tutorials - Tutorials for using GDAL.
GIS in Python - Learn how to use geopandas, rasterio and matplotlib to plot and manipulate spatial data in Python.
Awesome-EarthObservation-Code - A curated list of awesome tools, tutorials, code, helpful projects, links, stuff about Earth Observation and Geospatial stuff.
Python for Atmosphere and Ocean Scientists - Repository containing the Data Carpentry lesson materials for a single day workshop on using python (and git) in the atmosphere and ocean sciences: Python AOS Lesson.
Open Geospatial Datasets for GIS Education - Repository of open geospatial datasets to be used in an educational context.
Awesome GIS - Collection of geospatial related sources, including cartographic tools, geoanalysis tools, developer tools, data, conference & communities, news, massive open online course, some amazing map sites, and more.
Awesome GeoSpatial List - A curated list of geospatial tools, data, tutorials, information, and more.
Open Geospatial Data Download Catalog - Major sources of open geospatial data download sites.
Python for Geospatial Analysis / Code Repository - Crashcourse introduction to using Python to wrangle, plot, and model geospatial data.
Geospatial Analyses in Python - Python-based geospatial analysis codes related to: GOES-16 satellite data, national land cover database (NLCD), elevation maps, building footprints, city mapping across the Continental USA.
Geospatial Mapping in Python - An exploration into working with geospatial data, transforming data types, and visualizing geospatial data in a multitude of ways.
Google Earth Engine Python Notebooks - A collection of 360+ Jupyter Python notebook examples for using Google Earth Engine with interactive mapping.
Google Earth Engine Examples - A collection of 290+ Python examples for using Google Earth Engine in QGIS.
Awesome Earth Engine - A curated list of Google Earth Engine resources.
Python From Space - Analyzing Open Satellite Imagery Using the Python Ecosystem. Contains slides and Jupyter notebooks for the “Python from Space” talk at Pycon 2017 in Portland, Oregon.
Geospatial Machine Learning - A curated list of resources focused on Machine Learning in Geospatial Data Science.
Python Geospatial - A collection of Python packages for geospatial analysis with binder-ready notebook examples.
Google Earth Engine with Python - Series of Jupyter notebook (colabs) to learn Google Earth Engine with python.
Kepler.gl - Data-agnostic, high-performance web-based application for visual exploration of large-scale geolocation data sets. Built on top of Mapbox GL and deck.gl.
R GeoNotebooks - RMarkdown notebooks documenting maps & GIS howto’s in R.
ETL Python Tools for Geospatial Data - Multiple tools to perform Extract-Transform-Load (ETL) operations on Geospatial data.
Plotting Geospatial Data - Utilize Geoplotlib to create geographical visualizations, identify the different types of geospatial charts, and create complex visualizations using tile providers and custom layers.
NDVI Stats - Jupyter Notebook with flow calculation of zonal statistics for selected polygons using geopandas, google earth engine api, rasterio, rasterstats and folium.
Lectures on scientific computing with Python - A set of lectures on scientific computing with Python, using IPython notebooks.
Practical Data Science - Course site for Duke Practical Data Science.
LearnEO - Learn Earth Observation with ESA.
EOPortal Directory - Earth Observation resources by ESA.
Cate - The CCI Climate Analysis Toolbox (Cate) is a cloud-enabled computing environment for analysing, processing and visualising all ESA Climate Change Initiative datasets. Cate works by mashing ECV data and other data sources into a common data model. Users operate on this model, then analyse, process, and visualise the results.
Intro to Geospatial Data Analysis - SciPy 2018 Video Tutorial.
Spatial Data Analysis with PySAL - SciPy 2020 Video Tutorial.
Data Science Hacks - Data Science Hacks consists of tips, tricks to help you become a better data scientist. Consists of python, jupyter notebook, pandas hacks and so on.
Python Data Science Handbook - Contains the full text of the Python Data Science Handbook by Jake VanderPlas
Python Data Science Handbook Code Materials - This repository contains the entire Python Data Science Handbook, in the form of Jupyter notebooks.
Python Essentials for GIS Learners - Materials, exercises and lessons for a 3-day course on Python Essentials for GIS Learners offered to the BK Faculty at TU Delft (Github Repository).
Dask
Dask is a flexible library for parallel computing in Python.
Dask is composed of two parts:
Dynamic task scheduling optimized for computation. This is similar to Airflow, Luigi, Celery, or Make, but optimized for interactive computational workloads.
“Big Data” collections like parallel arrays, dataframes, and lists that extend common interfaces like NumPy, Pandas, or Python iterators to larger-than-memory or distributed environments. These parallel collections run on top of dynamic task schedulers.
Dask Documentation - Official Dask Documentation
Dask Examples / Code Repository - Examples show how to use Dask in a variety of situations.
Dask Tutorial / Code Repository - Dask Tutorial that was last presented at SciPy 2020.
Dask Tutorial - Nvidia Blog - Beginner’s Guide to Distributed Computing with GPUs in Python.
Dask Tutorial with Jupyter Notebooks - A collection of notebooks that have Dask tutorials.
Official Dask Youtube Channel - Link to the official Dask youtube channel that has videos on how to use Dask.
Parallel and Distributed Computing in Python with Dask - SciPy 2020 Conference.
Scalable Data Analysis in Python with Dask - Playlist that contains videos on data analysis with Dask.
Xarray
xarray is an open source project and Python package that makes working with labelled multi-dimensional arrays simple, efficient, and fun!
Xarray Tutorial - Official Xarray Tutorials
Xarray Documentation - Official Xarray documentation
Xarray Video Tutorials - Official video tutorials for Xarray
GeoHackWeek Xarray Tutorial - Series of tutorials by GeoHackWeek
Digital Earth Africa Xarray Tutorial - Specific lesson on Xarrays as a part of Digital Earth Africa training.
Xarray with Dask - Xarray with Dask arrays
EGU2017 Xarray Tutorial and Answers - Contains the EGU2017 tutorial with answers.
Exploring netCDF Datasets using Xarray - A notebook provides discussion, examples, and best practices for working with netCDF datasets in Python using the xarray package.
Xarray tutorial for Rossbypalooza - This notebook introduces xarray for new users in the geophysical sciences.
Climate Data Analysis with Xarray - EuroPython talk on climate analysis with Xarray and Cartopy.
Weather Data Analysis with Python - Spire Weather’s global forecast data in GRIB2 format using Python
Xarray Introduction and Tutorial - Research Computing Days 2021 Xarray Video Tutorial
GeoPandas
GeoPandas is an open source project to make working with geospatial data in python easier. GeoPandas extends the datatypes used by pandas to allow spatial operations on geometric types
GeoPandas Documentation - Official Documentatiom
GeoPandas Examples / Code Repository - Official GeoPandas collection of examples.
GeoPandas Repository - Official GitHub repo of GeoPandas
Geospatial Fundamentals in Python - DLab workshop materials on Geospatial analysis using GeoPandas
Intro To GeoPandas - Course materials of CSC Finland Intro to Python GIS
GeoPandas Tutorial (focus on tabular vector data) - Introduction to geospatial data analysis in Python, with a focus on tabular vector data using GeoPandas.
Intro to GeoPandas (Intro to GIS course) - Course materials of Intro to GIS course, University of Helsinki
GeoPandas Tutorial by J Cutrer - Ingest and plot shapefiles using the geopandas library in python.
Exploring GeoPandas - JupyterLab notebook for exploring GeoPandas using the builtin Natural Earth Low Res dataset.
GIS Analysis with GeoPandas - Basic GIS Analysis with Geo pandas
RasterIO
Geographic information systems use GeoTIFF and other formats to organize and store gridded raster datasets such as satellite imagery and terrain models. Rasterio reads and writes these formats and provides a Python API based on Numpy N-dimensional arrays and GeoJSON.
RasterIO Repo - Official GitHub repository for RasterIO.
RasterIO Documentation - Official Documentation for RasterIO.
RasterIO QuickStart - Short examples of RasterIO in Python.
Raster Processing using Python - GeoHackWeek materials for Raster.
GeoHackWeek Raster Tutorials - Contents for GeoHacWeek 2019 raster tutorial.
Exploring RasterIO - Tutorial series by Patrick Grey from Duke.
Reading Raster Files - RasterIO materials as part of course Intro to Python GIS.
GDAL Raster Tutorials - Tutorials on how to use GDAL Python API and rasterio for raster data management, transformation, analysis and visualization tasks.
Intermediate Earth Data Science (Earth Lab CU Boulder) Raster Tutorial - Fundamental concepts related to working with raster data in Python, including understanding the spatial attributes of raster data, how to open raster data and access its metadata, and how to explore the distribution of values in a raster dataset.
Open, Plot, and Explore Raster Data in Python - Working with lidar derived raster data that represents both terrain / elevation data, and surface elevation.
Create Geospatial Raster from XY Data - Tutorial shows the procedure to run a Scipy interpolation over a Pandas dataframe of point related data having a 2D Numpy array as an output.
Advanced Features in RaSterIO - Notebook that demonstrates advanced RasterIO concepts useful for developing cloud-native applications.
Plotly
Plotly’s Python graphing library makes interactive, publication-quality graphs.
Plotly Fundamentals - Plotly Python Open Source Graphing Library Fundamentals.
Plotly Maps - Plotly Python Open Source Graphing Library Maps.
Awesome Dash - A curated list of awesome Dash (plotly) resources. -Plotly Tutorial for Beginners - Learn to use Plotly library.
Plotly Tutorial - Jupyter Notebook that condenses the Plotly API into one easy to use document with examples.
Plotly Tutorial - Extensive Plotly Tutorial.
Series of Plotly Tuturials - Covers Basic, Scientific, Statistic, Financial, Maps, and 3D Plots.
Plotly Tutorial for Energy System Modifiers - Tutorial on using the Python package Plotly to build plots for energy system related data, as given in openmod Zurich workshop, June 2018.
IPython Notebooks for Plotly - Gallery of IPython Notebooks in Python/v3.
Graphs and Plots Using Plotly - Plotly Tutorial on graphs and plotting.
NumPy
NumPy is the fundamental package for scientific computing in Python. It is a Python library that provides a multidimensional array object, various derived objects (such as masked arrays and matrices), and an assortment of routines for fast operations on arrays, including mathematical, logical, shape manipulation, sorting, selecting, I/O, discrete Fourier transforms, basic linear algebra, basic statistical operations, random simulation and much more.
NumPy Quickstart - Quick overview of arrays in NumPy.
NumPy Tutorials - NumPy tutorials & educational content in notebook format.
Creating and Manipulating Numerical Data - SciPy lectures that gives an overview of NumPy.
Python NumPy Tutorials - Tutorial by Justin Johnson of University of Michigan.
Data Visualization with Python - Shows you how to use Python with NumPy, Pandas, Matplotlib, and Seaborn to create impactful data visualizations with real world, public data.
Applied Intro To Beginners - Python NumPy tutorial for beginners.
NumPy Tutorial for Beginners - A Numpy Tutorial for Beginners covering Data Types, Array, Sampling Methods, Maths functions, Slicing and Indexing, Set operations, Linear Algebra.
NumPy Beginner Tutorials - Tutorial for beginners by Nicolas P. Rougier.
First Steps into Data Science in Python - RealPython guide to NumPy.
NumPy Essentials - NumPy tutorial by Matthew Kearns covering essential concepts.
EuroSciPy 2018 NumPy tutorial - NumPy tutorial covered in EuroSciPy 2018.
Advanced NumPy - SciPy lecture on Advanced NumPy functionality.
NumPy Cheat Sheet - Cheat Sheet for NumPy.
Python NumPy Beginner Video Tutorial - Basics of the NumPy library provided by freeCodeCamp.
NumPy Video Tutorial by Derek Banas - Updated Video tutorial for NumPy.
Matplotlib
Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python.
Matplotlib Documentation - Official Matplotlib documentation.
Matplotlib Tutorials - Official tutorials for Matplotlib covering concepts frmo beginner to advanced.
Anatomy of Matplotlib - Tutorial developed for the SciPy conference
Beginner Matplotlib Tutorials - Tutorials for beginner provided by Nicolas P. Rougier.
Getting Started with Matplotlib - SciPy 2019 Tutorial.
Plotting with Matplotlib - RealPython guide to Matplotlib.
SciPy Matplotlib Tutorial - Explore matplotlib in interactive mode covering most common cases.
Time Series Exploration with Matplotlib - Interactive Matplotlib tutorial.
Matplotlib Tutorial Notebooks - Tutorial notebooks on numpy, pandas and matplotlib.
Data Visualization with Python - Data Visualization tutorials with Jupyter Notebooks.
Maptplotlib Video Tutorial Series / Code Respository - Video Tutorial series by Corey Schafer.
Matplotlib Video Tutorial - Video tutorial by Derek Banas.
Pandas
pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
Pandas - Official GitHub repository of Pandas.
Pandas Documentation - Official Pandas Documentation. Has interactive Notebooks at the end.
Intro Tutorials - Official intro to Pandas
Community Tutorials - Pandas Community tutorials geared for new users. Has excellent resources (Notebooks and video).
Pandas Tutorial - Analyzing Video Game Data with Python and Pandas.
Pandas Exercises - Repository just with exercises to practice pandas.
Series of Pandas Tutorial - Jupyter Notebooks and Data Sets for Pandas Library.
Scikit-Learn and Pandas - Teaching materials for pandas and scikit-learn.
Pandas Scipy Conference Tutorials - Material for the pandas tutorial at EuroScipy 2016.
100 Pandas Puzzles - 100 data puzzles for pandas, ranging from short and simple to super tricky.
PyCon 2019 Presentation on Pandas - Data Science Best Practices with pandas.
Python Pandas Tutorial - A Complete Introduction for Beginners.
PyData Book Materials - Materials and IPython notebooks for “Python for Data Analysis” by Wes McKinney, published by O’Reilly Media.
Dataframes in Python - Datacamp tutorial on exploring data analysis with Python.
Pandas Video Tutorial - Video Tutorial series by Corey Schafer.
Pandas Extensive Video series - Jupyter notebook and datasets from the pandas Q&A video series from Data School.
Scikit-Learn
Scikit-Learn Tutorials - Official scikit-kearn tutorials.
Machine Learning with scikit-learn - An introduction to machine learning with scikit-learn.
Scikit-Learn Tutorial (ML) - An easy-to-follow scikit-learn tutorial that will help you get started with Python machine learning provided by Datacamp.
Sckit-Learn Extensive tutorial - Materials for Scikit-Learn tutorial.
Scikit-Learn Videos - Jupyter notebooks from the scikit-learn video series by Data School.
Scikit-Learn and Pandas - Teaching materials for pandas and scikit-learn.
Scikit-Learn Video Course - Scikit-Learn video course provided by freeCodeCamp.
Seaborn
Seaborn is a Python visualization library based on matplotlib. It provides a high-level interface for drawing attractive statistical graphics.
Seaborn - Statistical data visualization in Python. Official Github Repo.
Seaborn User Guide Tutorials - Official user guide and tutorials.
Ultimate Python Seaborn Tutorial - Step-by-step Seaborn tutorial
Visualization with Seaborn - Excerpt from the Python Data Science Handbook by Jake VanderPlas
Seaborn Tutorial - Includes all the types of plot offered by Seaborn, applied on random datasets.
Series of Seaborn Notebooks - Training notebooks in Seaborn.
Matplotlib and Seaborn Tutorial - Tutorials for easy understanding of matplotlib and seaborn commands for graph plotting.
Plotting with Seaborn and Matplotlib - Basic to advanced visualization techniques and some basic Exploratory Data Analysis on a dataset.
Practical Data Visualization - Resources for teaching & learning practical data visualization with python.
Data Visualization - Data Visualization With Matplotlib and Seaborn.
Cheat Sheet - Seaborn Cheat Sheet provided by Derek Banas.
Seaborn Video Tutorial - Video Tutorial provided by Derek Banas
Cartopy
Cartopy is a python package which provides a set of tools for creating projection-aware geospatial plots using python’s standard plotting package, matplotlib. Cartopy also has a robust set of tools for defining projections and reprojecting data.
Cartopy Documentation - Official cartopy documentation.
Cartopy SciPy 2018 - Cartopy tutorial: Around the world in 80 ways.
Basic Cartopy Tutorial - Basic tutorial for cartopy map plotting Python package.
Maps with Cartopy - Tutorial on building maps with Cartopy.
Simple Maps with Cartopy - Basic and quick intro to Cartopy.
PySAL
The python spatial analysis library for Geospatial Data Science
PySAL Documentation - Official documentation, contains a list of courses, workshops, tutorials, and presentations.
PySAL Notebooks Project - This is a compilation of official notebooks demonstrating the functionality of PySAL, the Python Spatial Analysis library.
Intermediate Methods for Geospatial Data Analysis - SciPy 2019 tutorial.
Geographic Data Science with PySAL and the pydata stack - SciPy 16 tutorials.
SpatioTemporal Asset Catalogs
The SpatioTemporal Asset Catalog (STAC) specification provides a common language to describe a range of geospatial information, so it can more easily be indexed and discovered. A ‘spatiotemporal asset’ is any file that represents information about the earth captured in a certain space and time.
STAC - Official Github repository containing the core object type specifications, examples, validation schemas, and documentation about the context and plans for the evolution of the specification.
STAC openAPI - The openAPI of STAC.
Introduction to SpatioTemporal Asset Catalogs - Video tutorial.
STAC utilities - Official Github repository for STAC utilities like integrations with various databases, clients and programming languages.
PySTAC Documentation - The official documentation of PySTAC, a python library for working with STAC.
PySTAC Tutorial - Video tutorial for the PySTAC library.
PySTAC Tutorial Jupyter Notebooks - Interactive notebooks with PySTAC examples.
geemap
geemap is a Python package for interactive mapping with Google Earth Engine (GEE), which is a cloud computing platform with a multi-petabyte catalog of satellite imagery and geospatial datasets.
geemap Documentation - geemap official documentation.
geemap Jupyter Notebooks - Collection of examples as interactive notebooks for geemap.
geemap Tutorial Series - Official video tutorials created by geemap author.
geemap list of tutorials - List of official geemap tutorials and examples with links to the resources used in the tutorial.
PyTorch
PyTorch is an optimized tensor library for deep learning using GPUs and CPUs.
PyTorch Documentation - Official PyTorch documentation.
PyTorch - Official PyTorch Github repository.
PyTorch Examples - Official Github repository containing PyTorch examples.
PyTorch Tutorials - Official tutorials for the whole PyTorch ecosystem.
PyTorch Video Tutorials - PyTorch video tutorial series made by Harrison Kinsley.
PyTorch Video Tutorials - PyTorch video tutorial series made by deeplizard.
PyTorch Curated Resources List - An extensive list of PyTorch tutorials, videos, tools, etc.
Interactive Deep Learning Book - Elaborate book for deep learning with examples using PyTorch.
TensorFlow
TensorFlow is an end-to-end open source platform for machine learning. It has a comprehensive, flexible ecosystem of tools, libraries and community resources that lets researchers push the state-of-the-art in ML and developers easily build and deploy ML powered applications.
TensorFlow Python Documentation - TensorFlow official documentation for Python.
TensorFlow - Official TensorFlow Github repository.
TensorFlow Tutorials - Official tutorials for the whole TensorFlow ecosystem.
TensorFlow Guides As Interactive Notebooks - Official interactive notebook guides for TensorFlow.
TensorFlow Video Tutorials - TensorFlow beginner video tutorial series made by Aladdin Persson.
TensorFlow Tutorial For Beginners - A tutorial aimed at beginners, includes interactive code examples.
TensorFlow Curated Resource List - An extensive list of TensorFlow tutorials, videos, tools, etc.
TensorFlow Tutorials - TensorFlow tutorials that include a detailed interactive notebook and accompanying video lecture.
Interactive Deep Learning Book - Elaborate book for deep learning with examples using TensorFlow.
JAX
JAX is Autograd and XLA, brought together for high-performance numerical computing and machine learning research.
JAX - Official Github repository.
JAX Documentation - Official documentation.
JAX tutorials - Official JAX tutorials.
Getting Started With JAX - A blog post introducing JAX concepts.
JAX: Accelerated Machine Learning Research - SciPy 2020 video tutorial.
JAX Ecosystem Meetup - NeurIPS 2020 video tutorial.
Introduction To JAX - Google Cloud Tech video.
JAX Curated Resource List - An extensive list of JAX tutorials, videos, tools, etc.
Textbooks
These are textbooks related to deep learning and geospatial computing. You can also view all the textbooks in the Textbooks folder.
Miscellaneous Libraries
Fiona
Fiona reads and writes geographic data files and thereby helps Python programmers integrate geographic information systems with other computer systems. Fiona contains extension modules that link the Geospatial Data Abstraction Library (GDAL).
Fiona - Official GitHub repository.
Fiona Documentation - Fiona Documentation
Examples - Fiona Examples
Xarray-spatial
Xarray-Spatial implements common raster analysis functions using Numba and provides an easy-to-install, easy-to-extend codebase for raster analysis.
Xarray-spatial - Official GitHub repository.
Xarray-spatial Documentation - Official Documentation for Xarray-spatial.
Examples - Official examples for using Xarray-spatial.
Rio-xarray
Geospatial xarray extension powered by rasterio
Rio-xarray Documentation - Official documentation for Rio-xarray.
Rio-xarray - Official GitHub repository.
Examples - Official exmaples for Rio-xarray.
Regionmask
regionmask is a Python module that:
Contains a number of defined regions, including: countries (from Natural Earth), a landmask and regions used in the scientific literature (the Giorgi regions 1 and the SREX regions 2). Can plot figures of these regions with matplotlib and cartopy. Can be used to create masks of the regions for arbitrary longitude and latitude grids (2D integer masks and 3D boolean masks). Support for shapefiles is provided via geopandas. Arbitrary regions can be defined easily.
RegionMask - Official Github Repository.
RegionMask Documentation - Official documentation for Regionmask.
RegionMask Tutorial Notebooks - Official tutorials with Jupyter Notebooks.
Geocube
Tool to convert geopandas vector data into rasterized xarray data.
Geocube - Official GitHub repository.
Geocube documentation - Official Geocube documentation.
Examples - Official examples for using Geocube.
Salem
Salem is a small library to do geoscientific data processing and plotting. It extends xarray to add geolocalised subsetting, masking, and plotting operations to xarray’s DataArray and DataSet structures.
Salem - GitHub repository for Salem.
Salem Documentation - Official documentation for Salem.
Examples - Examples for using Salem.