Skip to contents

The eData Data Reporting Format is a format for the reporting of chemical occurence/exposure data in the natural environment. It provides tables, vocabulary, and validation functions for structuring chemical occurence data, as well as spatial, social, biological, geographical, and other metadata.

Installation

To install the format, you will need the devtools package:

if (!requireNamespace("eDataDRF", quietly = TRUE)) {
  if (!requireNamespace("devtools", quietly = TRUE)) {
    install.packages("devtools")
  }
  devtools::install_github("NIVANorge/eDataDRF")
}

Usage

The format is structured as a series of tables that collect different types of data. The Measurements table is the central data table. It references the other tables via foreign keys, unique identifying columns present in both tables. For more information on each table, please see the section below or the Articles section of the documentation site.

Tables

Tables are created as tibble::tibble() calls with empty variables of specific types (e.g. character(0) for strings). These support easier validation (see Validation) and the extensive Tidyverse family of functions.

Tables are listed below:

Table Name Purpose Comments
Campaign Records data about sampling campaign and organisation collecting data.
Reference Records conventional publication metadata, where available
Sites Records site coordinates, land use, country/ocean
Parameters Records data on stressors (chemical, radiation, etc.), quality measurements
Compartments Records information on the compartment/matrix sampled
Samples Records which combinations of dates, sites, parameters and compartments were sampled Not used in final analysis, but exists as an intermediate table used to create measurements
Biota Where relevant, records biota species, tissue, life stage, and gender Optional
Methods Records type and descriptions of methods used for sampling, extraction, fractionation and analysis
Measurements Records measured values, units, uncertainty, sample size, and methods associated with a given sample
CREED (quality) Records assessment purpose statement, relevant data summarised from above tables, relevance/reliability scores, and final assessment of data quality.
CREED Scores [description needed]

Vocabulary

Likewise, controlled vocabulary is available as functions that return vectors, lists, or tables. In some cases, helper functions are available that wrap multiple invididual functions.

Where external data sources are used to generate a vocabulary, functions may wrap (processed) data from other R packages or load raw data from external sources.

measured_categories_vocabulary() # returns a named vector
#>               External               Internal                Surface 
#>       "External Media" "Internal to Organism"  "Surface of Organism"

environ_compartments_sub_vocabulary() # returns a nested list
#> $Aquatic
#>                    Freshwater             Marine/Salt Water 
#>                  "Freshwater"           "Marine/Salt Water" 
#>   Brackish/Transitional Water                   Groundwater 
#> "Brackish/Transitional Water"                 "Groundwater" 
#>                    Wastewater          Liquid Growth Medium 
#>                  "Wastewater"        "Liquid Growth Medium" 
#>                     Rainwater                    Stormwater 
#>                   "Rainwater"                  "Stormwater" 
#>                      Leachate              Aquatic Sediment 
#>                    "Leachate"            "Aquatic Sediment" 
#>                     Porewater                        Sludge 
#>                   "Porewater"                      "Sludge" 
#>                      Snow/Ice 
#>                    "Snow/Ice" 
#> 
#> $Atmospheric
#>    Indoor Air   Outdoor Air 
#>  "Indoor Air" "Outdoor Air" 
#> 
#> $Terrestrial
#>     Terrestrial Biological Residue              Soil H Horizon (Peat) 
#>   "Terrestrial Biological Residue"            "Soil H Horizon (Peat)" 
#>           Soil O Horizon (Organic)           Soil A Horizon (Topsoil) 
#>         "Soil O Horizon (Organic)"         "Soil A Horizon (Topsoil)" 
#>           Soil E Horizon (Mineral)           Soil S Horizon (Mineral) 
#>         "Soil E Horizon (Mineral)"         "Soil S Horizon (Mineral)" 
#>   Soil C Horizon (Parent Material)           Soil R Horizon (Bedrock) 
#> "Soil C Horizon (Parent Material)"         "Soil R Horizon (Bedrock)" 
#> 
#> $Biota
#>   Biota, Terrestrial       Biota, Aquatic   Biota, Atmospheric 
#> "Biota, Terrestrial"     "Biota, Aquatic" "Biota, Atmospheric" 
#>         Biota, Other 
#>       "Biota, Other"

extraction_protocols_vocabulary() # returns a tibble
#> # A tibble: 19 × 3
#>    Protocol_Type       Short_Name                        Long_Name              
#>    <chr>               <chr>                             <chr>                  
#>  1 Extraction Protocol Not relevant                      Not relevant           
#>  2 Extraction Protocol Not reported                      Not reported           
#>  3 Extraction Protocol None                              No extraction          
#>  4 Extraction Protocol Methanol                          Methanol extraction    
#>  5 Extraction Protocol Dichloromethane                   Dichloromethane extrac…
#>  6 Extraction Protocol SPE Isolute Env+                  Solid phase extraction…
#>  7 Extraction Protocol Membrane filtration 0.45um        Membrane filtration th…
#>  8 Extraction Protocol Membrane filtration 0.2um         Membrane filtration th…
#>  9 Extraction Protocol Membrane filtration               Membrane filtration    
#> 10 Extraction Protocol Filtration                        Filtration             
#> 11 Extraction Protocol Microwave-assisted acid digestion Microwave-assisted aci…
#> 12 Extraction Protocol Acid digestion                    Acid digestion         
#> 13 Extraction Protocol Pressurised liquid                Pressurised liquid ext…
#> 14 Extraction Protocol Ultrasonic                        Ultrasonic extraction  
#> 15 Extraction Protocol Soxhlet                           Soxhlet extraction     
#> 16 Extraction Protocol QuEChERS                          Quick easy cheap effec…
#> 17 Extraction Protocol Accelerated solvent               Accelerated solvent ex…
#> 18 Extraction Protocol Sequential extraction             Sequential extraction …
#> 19 Extraction Protocol Other                             Other

protocol_options_vocabulary() # calls bind_rows() on four *_protocol_vocabulary() functions to return a tibble
#> # A tibble: 75 × 3
#>    Protocol_Type     Short_Name     Long_Name            
#>    <chr>             <chr>          <chr>                
#>  1 Sampling Protocol Not relevant   Not relevant         
#>  2 Sampling Protocol Not reported   Not reported         
#>  3 Sampling Protocol Point          Point sampling       
#>  4 Sampling Protocol Composite      Composite sampling   
#>  5 Sampling Protocol Trawl          Trawl sampling       
#>  6 Sampling Protocol Grab           Grab sampling        
#>  7 Sampling Protocol Core           Core sampling        
#>  8 Sampling Protocol Seine net      Seine net sampling   
#>  9 Sampling Protocol Electrofishing Electrofishing       
#> 10 Sampling Protocol Plankton net   Plankton net sampling
#> # ℹ 65 more rows

coordinate_systems_vocabulary() # calls crsuggest::crs_sf, returns 4 rows
#> [1] "Not relevant"          "Not reported"          "WGS 84"               
#> [4] "ETRS89"                "WGS 84 / UTM zone 32N" "WGS 84 / UTM zone 33N"
#> [7] "WGS 84 / UTM zone 34N" "WGS 84 / UTM zone 35N" "Other"

ocean_vocabulary() # loads an RDS of IHO ocean definitions from /extdata/
#>   [1] "Not relevant"                "Not reported"               
#>   [3] "Other"                       "Torres Strait"              
#>   [5] "Tasman Sea"                  "Solomon Sea"                
#>   [7] "Ross Sea"                    "Coral Sea"                  
#>   [9] "Bismarck Sea"                "Bellingshausen Sea"         
#>  [11] "Bass Strait"                 "Amundsen Sea"               
#>  [13] "Timor Sea"                   "Sunda Strait"               
#>  [15] "Sumba Strait"                "Sulu Sea"                   
#>  [17] "Sulawesi Sea"                "South China Sea"            
#>  [19] "Singapore Strait"            "Seram Sea"                  
#>  [21] "Sawu Sea"                    "Natuna Sea"                 
#>  [23] "Mindanao Sea"                "Maluku Sea"                 
#>  [25] "Makasar Strait"              "Joseph Bonaparte Gulf"      
#>  [27] "Jawa Sea"                    "Halmahera Sea"              
#>  [29] "Gulf of Tonkin"              "Gulf of Tomini"             
#>  [31] "Gulf of Thailand"            "Gulf of Carpentaria"        
#>  [33] "Gulf of Bone"                "Gulf of Berau"              
#>  [35] "Flores Sea"                  "Banda Sea"                  
#>  [37] "Bali Sea"                    "Aru Sea"                    
#>  [39] "Arafura Sea"                 "South Pacific Ocean"        
#>  [41] "Yellow Sea"                  "Taiwan Strait"              
#>  [43] "Seto Naikai"                 "Sea of Okhotsk"             
#>  [45] "Philippine Sea"              "Liaodong Gulf"              
#>  [47] "Japan Sea"                   "Gulf of Tartary"            
#>  [49] "Gulf of Panama"              "Gulf of California"         
#>  [51] "Gulf of Alaska"              "East China Sea"             
#>  [53] "Coastal waters of SE Alaska" "Bo Sea"                     
#>  [55] "Bering Sea"                  "Anadyrskiy Gulf"            
#>  [57] "North Pacific Ocean"         "Pacific Ocean"              
#>  [59] "Weddell Sea"                 "Scotia Sea"                 
#>  [61] "Rio de la Plata"             "Lazarev Sea"                
#>  [63] "Drake Passage"               "South Atlantic Ocean"       
#>  [65] "Skagerrak"                   "Scotland Sea"               
#>  [67] "North Sea"                   "Labrador Sea"               
#>  [69] "Irish Sea"                   "Gulf of St. Lawrence"       
#>  [71] "Gulf of Mexico"              "Gulf of Guinea"             
#>  [73] "Greenland Sea"               "English Channel"            
#>  [75] "Celtic Sea"                  "Caribbean Sea"              
#>  [77] "Canarias Sea"                "Bristol Channel"            
#>  [79] "Bay of Fundy"                "Bay of Biscay"              
#>  [81] "North Atlantic Ocean"        "Atlantic Ocean"             
#>  [83] "Tirreno Sea"                 "Strait of Sicilia"          
#>  [85] "Strait of Gibraltar"         "Sea of Marmara"             
#>  [87] "Sea of Azov"                 "Ligure Sea"                 
#>  [89] "Ionian Sea"                  "Black Sea"                  
#>  [91] "Balear Sea"                  "Alboran Sea"                
#>  [93] "Aegean Sea"                  "Adriatic Sea"               
#>  [95] "Western Basin"               "Eastern Basin"              
#>  [97] "Mediterranean Sea"           "Mediterranean Region"       
#>  [99] "Riiser-Larsen Sea"           "Red Sea"                    
#> [101] "Prydz Bay"                   "Persian Gulf"               
#> [103] "Palk Bay, Palk Strait"       "Mozambique Channel"         
#> [105] "Malacca Strait"              "Lakshadweep Sea"            
#> [107] "Gulf of Suez"                "Gulf of Oman"               
#> [109] "Gulf of Mannar"              "Gulf of Aqaba"              
#> [111] "Gulf of Aden"                "Great Australian Bight"     
#> [113] "Cosmonaut Sea"               "Burma Sea, Andaman Sea"     
#> [115] "Bay of Bengal"               "Arabian Sea"                
#> [117] "Indian Ocean"                "The Sound"                  
#> [119] "The Little Belt"             "The Great Belt"             
#> [121] "Kattegat"                    "Gulf of Riga"               
#> [123] "Gulf of Finland"             "Gulf of Bothnia"            
#> [125] "Baltic Sea"                  "White Sea"                  
#> [127] "Norwegian Sea"               "Northwestern Passages"      
#> [129] "North Greenland Sea"         "Lincoln Sea"                
#> [131] "Laptev Sea"                  "Kara Sea"                   
#> [133] "Iceland Sea"                 "Hudson Strait"              
#> [135] "Hudson Bay"                  "East Siberian Sea"          
#> [137] "Davis Strait"                "Chukchi Sea"                
#> [139] "Beaufort Sea"                "Barents Sea"                
#> [141] "Baffin Bay"                  "Arctic Ocean"