READ ME File For: Dataset in support of the paper ‘Integrated geospatial datasets to inform marine spatial planning and impact assessment in waters surrounding the United Kingdom’ [CartOcean-DS-UK]   Dataset DOI: 10.5258/SOTON/D3331 Date that the file was created: Mar, 2025 (latest update: Aug,2025)   ------------------- GENERAL INFORMATION ------------------- ReadMe Author: Hugo Putuhena, University of Southampton [https://orcid.org/0000-0003-1947-6984] Date of data collection: Sep 2021 - Mar 2025 Information about geographic location of data collection: University of Southampton, U.K. Related projects: the RAEng Chair and Centre of Excellence in Intelligent & Resilient Ocean Engineering (IROE), ECOWind – Benthic and Offshore Wind Interactions (BOWIE), and Southampton Marine and Maritime Institute (SMMI). -------------------------- SHARING/ACCESS INFORMATION -------------------------- Licenses/restrictions placed on the data, or limitations of reuse: CC-BY Recommended citation for the data: Putuhena, H., Williams, T., Sturt, F., Gourvenec, S., White, D., Godbold, J., & Solan, M. (2025). Dataset in support of the paper “Integrated geospatial datasets to inform marine spatial planning and impact assessment in waters surrounding the United Kingdom” [CartOcean-DS-UK]. https://doi.org/https://doi.org/10.5258/SOTON/D3331 This dataset supports the publication: AUTHORS: Putuhena, H., Williams, T., Sturt, F., White, D., Solan, M., Godbold, J., Gourvenec, S. TITLE: ‘Integrated geospatial datasets to inform marine spatial planning and impact assessment in waters surrounding the United Kingdom’ JOURNAL: Scientific Data PAPER DOI IF KNOWN: (tbd) Links to other publicly accessible locations of the data: - Links/relationships to ancillary or related data sets: - -------------------- DATA & FILE OVERVIEW -------------------- This dataset contains: * CSV.7z o Directory structure: * CSV.7z: > ED50_10km2_merged_cube_table_v_010425_territories_RMSE.csv > ED50_10km2_merged_non_cube_table_v_010825_territories_RMSE.csv o Brief description: > ED50_10km2_merged_cube_table_v_010425_territories_RMSE.csv This zip file contains table (.csv) data of geospatial layers from either anthropogenic or ecological themes in UK-EEZ waters that have time dimension (i.e., yearly and with maximum range from 2000 to 2020). > ED50_10km2_merged_non_cube_table_v_010825_territories_RMSE.csv This file contains data of geospatial layers from either anthropogenic, ecological, geoscience, and met-ocean themes in UK-EEZ waters that do not have time dimension (i.e., either portraying a spatial condition from a single time observation or a statistical condition across a specific time range). * SHP.7z o Directory structure: * OceanCart_DS_UK_v1_0_shp.zip: > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_anthropogenic.cpg > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_anthropogenic.dbf > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_anthropogenic.prj > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_anthropogenic.sbn > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_anthropogenic.sbx > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_anthropogenic.shp > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_anthropogenic.shp.xml > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_anthropogenic.shx > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE17_LE86.cpg > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE17_LE86.dbf > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE17_LE86.prj > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE17_LE86.sbn > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE17_LE86.sbx > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE17_LE86.shp > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE17_LE86.shp.xml > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE17_LE86.shx > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE106_LE123.cpg > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE106_LE123.dbf > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE106_LE123.prj > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE106_LE123.sbn > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE106_LE123.sbx > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE106_LE123.shp > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE106_LE123.shp.xml > ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE106_LE123.shx > ED50_10km2_merged_non_cube_table_v_010825_territories_RMSE.cpg > ED50_10km2_merged_non_cube_table_v_010825_territories_RMSE.dbf > ED50_10km2_merged_non_cube_table_v_010825_territories_RMSE.prj > ED50_10km2_merged_non_cube_table_v_010825_territories_RMSE.sbn > ED50_10km2_merged_non_cube_table_v_010825_territories_RMSE.sbx > ED50_10km2_merged_non_cube_table_v_010825_territories_RMSE.shp > ED50_10km2_merged_non_cube_table_v_010825_territories_RMSE.shp.xml > ED50_10km2_merged_non_cube_table_v_010825_territories_RMSE.shx o Brief description: This file contains data of geospatial layers as mentioned above in .csv but in shapefiles format and where the geospatial layers for cube data due to the limit of maximum size of data contain on each shapefile is separated into an antropogenic set and two ecological set that distinguish the columns L_E17-L_E86 and L_E106-L_E123. * GDB.7z o Directory structure: * GDB.7z: > CartOcean_DS_UK_v1.gdb o Brief description: > CartOcean_DS_UK_v1.gdb > This file contains data of geospatial layers as mentioned above in .csv but in file geodataset format. * Additional_Bio_Oracle_2000_2010.7z o Directory structure: * Additional_Bio_Oracle_2000_2010.7z: > ED50_Biooraclev3_2000_2010.csv o Brief description: This file contains data of geospatial layers to complement that have been resampled from Bio-ORACLE v.3. * Additional_info_interpolations.7z o Directory structure: * Additional_Bio_Oracle_2000_2010.7z: > Additional_info_exploratory_Interpolations_analysis_benthic_eco_params_per_year_v250801.7z > ED50_UKEEZ_10km2grids_IDW_interpolation.csv o Brief description: This file contains additional information of the alternative interpolation methods. Relationship between files, if important for context: Data inside of each zip file in CSV.7z, SHP.7z, and GDB.7z is the same with a different format. In the CSV.7z the data inside is in .csv format, while in SHP.7z the data inside is in .shp format, while in the GDB.7z, the data is in .gdb format. If data was derived from another source, list source: The layers in the dataset were generated from different data sources that include data provided by: MEDIN, Admiralty UKHO, JNCC, DEFRA, Marine Exchange Data, EMODNet, GOV.UK, NTSA, Global Fishing Watch, One Benthic – Cefas, Protected Planet, BGS, BRITICE (Uni of Sheffield), JNCC, Marine.GOV.Scot, Bio-oracle, Cefas, CEDA, Renewable atlas, and paper publications. See detail data source in https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_2_v250730_OceanCart_DS_UKv1_paper_.xlsx Additional related data collected that was not included in the current data package: Detail list of the data sources, and how each data source used to generate layers in the dataset is given further in detail in here: https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_2_v250730_OceanCart_DS_UKv1_paper_.xlsx If there are there multiple versions of the dataset, list the file updated, when and why update was made: - -------------------------- METHODOLOGICAL INFORMATION -------------------------- Description of methods used for collection/generation of data: All the spatial data gathered here were extracted from public data storage available that come from different sources with different data types and data resolution. The layers that were considered by authors would be important for marine spatial planning and impact assessment. List of those sources can be found from: https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Data_2_v240712_OceanCart_DS_UKv1_paper.xlsx Methods for processing the data: To sync all of the data, each dataset was set up to populate 10km2 grids square across the UK waters through pre-processing step that either involve a generation of kernel density estimation, spatial interpolation, or resampling. Details method can be found in the associated paper, while documentation for each pre-processing been done for each source dataset used here can be found in https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_1_v250730_OceanCart_DS_UKv1_paper_.docx https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_2_v250730_OceanCart_DS_UKv1_paper_.xlsx Software- or Instrument-specific information needed to interpret the data, including software and hardware version numbers: all data was integrated and exported using ArcGIS pro v.3.3.1 Standards and calibration information, if appropriate: not applicable Environmental/experimental conditions: not applicable Describe any quality-assurance procedures performed on the data: - People involved with sample collection, processing, analysis and/or submission: Dr. Tom Williams, Prof. Susan Gourvenec, Prof. Fraser Sturt, Prof. David White, Prof. Jasmine Godbold, Prof. Mart Solan -------------------------- DATA-SPECIFIC INFORMATION -------------------------- For ED50_10km2_merged_non_cube_table_v_010825_territories_RMSE.csv Number of variables: 453 fields/columns Number of cases/rows: 73,162 rows Variable list, defining any abbreviations, units of measure, codes or symbols used: o OBJECTID * Showing unique values for each row * Data type: OBJECTID/Integer o location_ID * Showing unique values for each row * Data type: text o IHO_SEA/sea region name * Data obtained from Flanders Marine Institute, 2020 http://www.marineregions.org * Data type: text o Join_Count, TARGET_FID, JOIN_FID, Join_Couint_1, TARGET_FID_1, pointid, grid_code * Showing unique code from spatial join process that been done during integration * Data unit: [-] * Data type: LONG/integer o Shape_Length * Data unit: in metre * Data type: double o Shape_Area * Data unit: metre2 Data type: double o x_new_ncube * Data unit: x-coordinate (centre of grid – ED50 UTM 30N) * Data type: double o y_new_ncube * Data unit: y-coordinate (centre of grid – ED50 UTM 30N) * Data type: double o Water_territories * Data obtained from UKHO, 2025 * https://datahub.admiralty.co.uk/portal/apps/sites/#/marine-data-portal/datasets/bf77b2ac1b654efc95dc3665c0501e23/about o The rest of the fields/columns that include each antropogenic, ecological, geoscience, and met-ocean layer in the dataset without time series can be seen in this table sheet: Sheet_5 in [https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_2_v250730_OceanCart_DS_UKv1_paper_.xlsx] Missing data codes: null Specialized formats or other abbreviations used: .csv For ED50_10km2_merged_cube_table_v_010425_territories_RMSE.csv Number of variables: 110 fields/columns Number of cases/rows: 1,682,634 rows Variable list, defining any abbreviations, units of measure, codes or symbols used: o OBJECTID * Showing unique values for each row * Data type: OBJECTID/Integer o location_ID * Showing unique values for each row * Data type: text o IHO_SEA/sea region name * Data obtained from Flanders Marine Institute, 2020 http://www.marineregions.org * Data type: text o year * Data unit: year * Data type: integer o Shape_Length * Data unit: in metre * Data type: double o Shape_Area * Data unit: metre2 Data type: double o x_new_ncube * Data unit: x-coordinate (centre of grid – ED50 UTM 30N) * Data type: double o y_new_ncube * Data unit: y-coordinate (centre of grid – ED50 UTM 30N) * Data type: double o Water_territories * Data obtained from UKHO, 2025 * https://datahub.admiralty.co.uk/portal/apps/sites/#/marine-data-portal/datasets/bf77b2ac1b654efc95dc3665c0501e23/about o The rest of the fields/columns that include each antropogenic and ecological layer in the dataset with time series can be seen in this table sheet: Sheet_3 in [https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_2_v250730_OceanCart_DS_UKv1_paper_.xlsx] -------------------------- DATA-SPECIFIC INFORMATION -------------------------- * The data in .shp is divided into three shapefiles due to maximum limit of shapefile size, all are located in SHP.7z. For no time series (non-cube data): ED50_10km2_merged_non_cube_table_v_010825_territories_RMSE.shp For the cube data: ED50_10km2_merged_cube_table_v_010425_territories_RMSE_anthropogenic.shp ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE17_LE86.shp ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE106_LE123.shp For ED50_10km2_merged_non_cube_table_v_010825_territories_RMSE.shp Number of variables: 453 fields/columns Number of cases/rows: 73,162 rows Variable list, defining any abbreviations, units of measure, codes or symbols used: o OBJECTID * Showing unique values for each row * Data type: OBJECTID/Integer o location_ID * Showing unique values for each row * Data type: text o IHO_SEA/sea region name * Data obtained from Flanders Marine Institute, 2020 http://www.marineregions.org * Data type: text o Join_Count, TARGET_FID, JOIN_FID, Join_Couint_1, TARGET_FID_1, pointid, grid_code * Showing unique code from spatial join process that been done during integration * Data unit: [-] * Data type: LONG/integer o Shape_Length * Data unit: in metre * Data type: double o Shape_Area * Data unit: metre2 Data type: double o x_new_ncube * Data unit: x-coordinate (centre of grid – ED50 UTM 30N) * Data type: double o y_new_ncube * Data unit: y-coordinate (centre of grid – ED50 UTM 30N) * Data type: double o Water_territories * Data obtained from UKHO, 2025 * https://datahub.admiralty.co.uk/portal/apps/sites/#/marine-data-portal/datasets/bf77b2ac1b654efc95dc3665c0501e23/about The rest of the fields/columns that include each antropogenic, ecological, geoscience, and met-ocean layer in the dataset without time series can be seen in this table sheet: Sheet_5 in [https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_2_v250730_OceanCart_DS_UKv1_paper_.xlsx] Further info for .shp files: The detail for fields/columns and row information for .shp file is the same with the .csv. Please refer to the information given in the .csv. To note in the shapefile the name for each filed was abbreviated to a limited number of letter (10). The full name of each abbreviated field/column name is detailed in the sheet of Sheet_4 for cube data and in Sheet_6 for non-cube data in this table: [https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_2_v250730_OceanCart_DS_UKv1_paper_.xlsx] For ED50_10km2_merged_cube_table_v_010425_territories_RMSE_anthropogenic.shp Number of variables: 55 fields/columns Number of cases/rows: 1,682,634 rows Variable list, defining any abbreviations, units of measure, codes or symbols used: o OBJECTID * Showing unique values for each row * Data type: OBJECTID/Integer o location_ID * Showing unique values for each row * Data type: text o IHO_SEA/sea region name * Data obtained from Flanders Marine Institute, 2020 http://www.marineregions.org * Data type: text o year * Data unit: year * Data type: integer o Shape_Length * Data unit: in metre * Data type: double o Shape_Area * Data unit: metre2 Data type: double o x_new_ncube * Data unit: x-coordinate (centre of grid – ED50 UTM 30N) * Data type: double o y_new_ncube * Data unit: y-coordinate (centre of grid – ED50 UTM 30N) * Data type: double o Water_territories * Data obtained from UKHO, 2025 * https://datahub.admiralty.co.uk/portal/apps/sites/#/marine-data-portal/datasets/bf77b2ac1b654efc95dc3665c0501e23/about o The rest of the fields/columns that include each antropogenic layer in the dataset with time series can be seen in this table sheet: Sheet_3 in [https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_2_v250730_OceanCart_DS_UKv1_paper_.xlsx] For ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE17_LE86.shp Number of variables: 46 fields/columns Number of cases/rows: 1,682,634 rows Variable list, defining any abbreviations, units of measure, codes or symbols used: o OBJECTID * Showing unique values for each row * Data type: OBJECTID/Integer o location_ID * Showing unique values for each row * Data type: text o IHO_SEA/sea region name * Data obtained from Flanders Marine Institute, 2020 http://www.marineregions.org * Data type: text o year * Data unit: year * Data type: integer o Shape_Length * Data unit: in metre * Data type: double o Shape_Area * Data unit: metre2 Data type: double o x_new_ncube * Data unit: x-coordinate (centre of grid – ED50 UTM 30N) * Data type: double o y_new_ncube * Data unit: y-coordinate (centre of grid – ED50 UTM 30N) * Data type: double o Water_territories * Data obtained from UKHO, 2025 * https://datahub.admiralty.co.uk/portal/apps/sites/#/marine-data-portal/datasets/bf77b2ac1b654efc95dc3665c0501e23/about o The rest of the fields/columns that include each ecological layer in the dataset with time series ranging from L_E17 to L_E86 can be seen in this table sheet: Sheet_3 in [https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_2_v250730_OceanCart_DS_UKv1_paper_.xlsx] For ED50_10km2_merged_cube_table_v_010425_territories_RMSE_ecology_LE106_LE123.shp Number of variables: 26 fields/columns Number of cases/rows: 1,682,634 rows Variable list, defining any abbreviations, units of measure, codes or symbols used: o OBJECTID * Showing unique values for each row * Data type: OBJECTID/Integer o location_ID * Showing unique values for each row * Data type: text o IHO_SEA/sea region name * Data obtained from Flanders Marine Institute, 2020 http://www.marineregions.org * Data type: text o year * Data unit: year * Data type: integer o Shape_Length * Data unit: in metre * Data type: double o Shape_Area * Data unit: metre2 Data type: double o x_new_ncube * Data unit: x-coordinate (centre of grid – ED50 UTM 30N) * Data type: double o y_new_ncube * Data unit: y-coordinate (centre of grid – ED50 UTM 30N) * Data type: double o Water_territories * Data obtained from UKHO, 2025 * https://datahub.admiralty.co.uk/portal/apps/sites/#/marine-data-portal/datasets/bf77b2ac1b654efc95dc3665c0501e23/about o The rest of the fields/columns that include each ecological layer in the dataset with time series ranging from L_E106 to L_E123 can be seen in this table sheet: Sheet_3 in [https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_2_v250730_OceanCart_DS_UKv1_paper_.xlsx] -------------------------- DATA-SPECIFIC INFORMATION -------------------------- * This zip file contains the file geodatabase format of ArcGIS software that contains the shapefiles describe in the SHP.7z, with the cube data merged into one data. Specific information of the shapefiles then replicate what has been described above in SHP.7z. -------------------------- DATA-SPECIFIC INFORMATION -------------------------- * This file contains data of geospatial layers to complement that have been resampled from Bio-ORACLE v.3 (Assis, J., Fernández Bejarano, S. J., Salazar, V. W., Schepers, L., Gouvêa, L., Fragkopoulou, E., Leclercq, F., Vanhoorne, B., Tyberghein, L., Serrão, E. A., Verbruggen, H., & De Clerck, O. (2024). Bio-ORACLE v3.0. Pushing marine data layers to the CMIP6 Earth System Models of climate change research. Global Ecology and Biogeography, 33(4). https://doi.org/10.1111/geb.13813) and integrated into the geospatial dataset. Which in the geospatial dataset the layers from Bio-ORACLE v.3 that been integrated with other layers are the historical data of mean between 2010-2020. While the geospatial layers added here are those from the Bio-ORACLE v.3 data from the mean of 2000-2010. o For ED50_Biooraclev3_2000_2010.csv o Variable list, defining any abbreviations, units of measure, codes or symbols used: The same list of the Bio-oracle layers generated in the main dataset, as listed in Sheet_2 in Supplementary Information 2: https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_2_v250730_OceanCart_DS_UKv1_paper_.xlsx, which in this data is for the mean between 2000-2010, instead the mean between 2010-2020 as given in the main dataset. -------------------------- DATA-SPECIFIC INFORMATION -------------------------- o This data contains: (a) the results from explanatory interpolations given for the benthic ecological parameters data source to see the best interpolators method (see Supplementary Information 1: https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_1_v250730_OceanCart_DS_UKv1_paper_.docx and Sheet_13 in Supplementary Information 2: https://www.southampton.ac.uk/~assets/doc/rser_gis_uk_ow/Supplementary_Information_2_v250730_OceanCart_DS_UKv1_paper_.xlsx) in Additional_info_exploratory_Interpolations_analysis_benthic_eco_params_per_year_v250801.7z; and (b) the interpolations layers resulted from Inverse Distance Weighting (IDW) for comparison to empirical Bayesian kriging interpolations resulted and integrated to the main dataset in ED50_UKEEZ_10km2grids_IDW_interpolation.shp