Cloud probability statistics per small area in 2024 (Version 1.0)
Data and Resources
-
SPF_LSOA_level.csvCSV
Tabular data containing cloud probability statistics per small area.
-
SPF_LSOA_level.gpkgGeoPackage
Geospatial data combining cloud probability statistics with small area...
Additional Info
| Field | Value |
|---|---|
| Author | Imago Team |
| Maintainer | Shaonlee Patranabis |
| Version | V1.0 |
| Last Updated | June 23, 2026, 08:59 (UTC) |
| Created | April 23, 2026, 12:17 (UTC) |
| Size | CSV: 727 KB, GPKG: 135 MB |
| constraints | None |
| content | The dataset contains the annual cloud probability for each small area (LSOAs in England and Wales, Data Zones in Scotland, Small Areas in Northern Ireland). This data is provided in two distinct formats: a CSV file, which contains the tabular data; and a GPKG file, a geospatial format that combines the tabular data with the boundary geometries. |
| crs | EPSG:27700, OSGB36/British National Grid |
| data_quality | Cloud probability estimates are derived from Sentinel-2 satellite imagery which provides global coverage; this product focuses on the United Kingdom. Further details on the methodology are available in the upcoming technical report. It is important to note that the underlying input data has a spatial resolution of 20-metres, and cloud probability estimates are aggregated to small area boundaries, which vary substantially in geographic area. Users should consult the Sentinel-2 Data Quality Report (https://sentinels.copernicus.eu/documents/247904/685211/Sentinel-2-L1C-Data-Quality-Report-September-2020.pdf) for further information on known quality limitations and uncertainties in the source imagery. |
| data_source | Sentinel-2 |
| file_id | SPF_LSOA_level |
| lineage | The underlying methods and source information used to construct the dataset are documented in the upcoming technical report and paper. Cloud probability is derived from Sentinel-2 Level-2A Bottom-of-Atmosphere (BOA) surface reflectance imagery, Collection 1, accessed via Element84 Earth Search STAC API (https://earth-search.aws.element84.com/v1), covering the period 1 January 2024 to 31 December 2024. Data is processed at 20-metre spatial resolution. A composite of individual scenes from the scene classification layer (SCL)(https://custom-scripts.sentinel-hub.com/custom-scripts/sentinel-2/scene-classification/), acquired over the full calendar year was used to generate annual cloud probability estimates . To reduce spatial artifacts arising from urban features and algorithmic misclassification at building boundaries as well as temporal gaps between different satellite orbits, a combination of a reflective surfaces correction and a quantile mapping of high-acquisition into low-acquision pixels was performed. The small area aggregation was performed using exact pixel-area weighting to account for partial pixel-boundary intersections. All processing was conducted by the Imago Team. |
| source | Imago: Data Service for Imagery |
| spatial_coverage | United Kingdom |
| spatial_resolution | Small area (LSOA / Data Zone / Small Area) |
| temporal_coverage | 01-01-2024: 31-12-2024 |
| temporal_resolution | Annual |