Skip to content

Australia Greenhouse Gasses (GHG) Analysis, Visualization and CO2 Prediction

Notifications You must be signed in to change notification settings

salmiah-ls/Australia-Greenhouse-Gasses-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 

Repository files navigation

Australia Greenhouse Gasses Analysis

This repository contains analysis of Greenhouse Gasses emission of Australia.

Background

Australia has revised target of reducing emissions by 43 per cent below 2005 levels by 2030. Australia's quarterly emissions have fallen by 21% since 2005. Australia has reduced its emissions at a faster rate than many similar developed countries.

Link: Prime Minister of Australia

Dataset

For this project, I use dataset of Greenhouse Gas Emissions from Organisation for Economic Co-operation and Development (OECD). Link to dataset.

Tools

  1. Microsoft Excel to load and transform data.
  2. Microsoft Power BI to create visualization.
  3. R to create model and predict CO2 value.

Analysis

After analyzing data Australia GHG data, below are charts showing total emissions, LULUCF (Land Use, Land-Use Change and Forestry), and pollutants, from 1990 to 2020







Based on analysis above, I will build linear regression model to predict the maximum amount of CO2 emission the country has to keep each year in order to achive the target GHG emission in 2030. As we know from the report, there are 6 gasses that contribute to total GHG emission, plus LULUCF factor. Among the 6 gasses, the main contributor is CO2.

GHG Emission

  • First, calculate the target GHG emission for year 2030. It is 43% below emission in 2005. Hence, the emission value is 57% of 2005 emission (608646.542 thousands tonnes).
    2030 GHG emission = 57% * 608646.542 = 346928.5289 thousands tonnes
  • Calculate the difference between 2030 emission and 2020 emission (487591.186 thousands tonnes).
    487591.186 - 346928.5289 = 140662.66 thousands tonnes
  • From 2020 to 2030, there are 10 years interval i.e. 2021, 2022, ... 2030. So, I divide the difference value above with 10. 140662.66 / 10 = 14066.266
  • For LULUCF value, I use constant value and it is same with the 2020 LULUCF value, -40145.861 thousands tonnes, assuming that Australia Government is consistent to keep the forest area and green space in the country.
  • Calculate the GHG emission excluding LULUCF, by substracting the GHG emission including LULUCF above with LULUCF value. Here are the complete values of GHG emission based on calculation above: GHG emission values

Predicting CO2 Emission with R

After retrieving the values of GHG emission for 2021 to 2030, next step is building the linear regeression model with R to predict the CO2 emission for each of the years. This values are useful for all sectors to measure their CO2 emission limit in order to achieve the country's GHG emission target in 2030.

I use 3 libraries here:

  • readxl to read Excel files,
  • tidyverse for general analysis,
  • chemCal to use inverse.predict() function, because here I will predict the CO2 values (x) based on GHG values (y).

Plot the CO2 and GHG emission (values in thousands tonnes):
ggplot(data = NULL, aes(x = co2_value, y = emission_value)) + geom_point() + geom_smooth(method = 'lm')

CO2 and GHG linear regression
Based on plot above, we can seen that CO2 and GHG emission can be modeled with Linear Regression.

Calculate the correlation value between CO2 and GHG:
cor(emission_value, co2_value)

0.9974014

The correlation value shows that CO2 and GHG emission have strong positive correlation. It is 0.9974014 and almost +1.

Build the linear regression model:
emission_co2_model <- lm(emission_value ~ co2_value)
summary(emission_co2_model)

CO2 and GHG correlation

Next step, enter the values of GHG emission (without LULUCF) from table above, I assign them as ynew variable:
ynew <- c(513670.8, 499604.5, 485538.2, 471472, 457405.7, 443339.5, 429273.2, 415206.9, 401140.7, 387074.4)

Predict the values of CO2, xpred variable, based on the GHG emission values above:
xpred <- t(sapply(ynew, function(y) inverse.predict(emission_co2_model, y)[1:3]))

Here are the predicted CO2 values (in thousands tonnes) with 95% confidence interval for the next 10 years (2021 to 2030):
CO2 predicted values and CI values

Since we want the maximum limit of CO2, sum the prediction values and the confidence values for each year. Here is the final table:
CO2 maximum values for each year

Visualize the prediction values of CO2 with Power BI:


About

Australia Greenhouse Gasses (GHG) Analysis, Visualization and CO2 Prediction

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published