Mauna Loa CO2 Growing Super-Exponentially?
Tags: Mauna Loa , Climate Change , Data , Exponential , Quadratic , Tikz
(Español: ¿Los niveles de CO2 crecen super-exponencial?)
As part of my work with the Mathematics Consortium Working Group (currently creating data-related activities for our Applied Calculus book), I discovered that $\text{CO}_2$ concentration, measured at the Mauna Loa observatory in Hawaii since the 1950s, can be modeled surprisingly well by a “super-exponential” function of the form $$f(t)=P_0(b+mt)^t.$$
Then, while convinced of how alarming and serious this was (super-exponential growth is as alarming as it sounds), I discovered one could also fit an “almost exponential” model which assumes exponential growth not from $0$, but from a base level $C$ instead, like this: $$g(t)= C+Ba^{t}.$$ This model may make more sense physically, since it seems reasonable to assume $\text{CO}_2$ levels were constant at $C$ ppm before humans started having an impact on the atmosphere, and then began to rise, apparently exponentially. So, the conclusion is that $\text{CO}_2$ concentration may be growing exponentially after all, which is very alarming, but not as alarming as “super-exponential” growth.
Finally, I found a quadratic function that also gives a good fit! $$ h(t)= a_2x^2+a_1x+a_0. $$ These three functions all give very good fits for almost $70$ years’ worth of data, and then have radically different long term behaviors. It seems like all bets are off on what could happen to $\text{CO}_2$ levels besides the fact that they are most definitely rising!
Here are some interesting details about this little modeling “telenovela”.
Source of the Data – the Keeling Curve
The graph below shows the monthly mean CO$_2$ concentration data the from Mauna Loa observatory in Hawaii, 11,135ft above sea level, which has been continuously collecting data related to climate change since the 1950s. This is known as the Keeling Curve, named after Charles David Keeling who started the monitoring program.
In what follows I will be looking at the annual mean CO$_2$ data, which smooths out seasonal variation. This data is downloaded from the official NOAA site.
First Model: the Super-Exponential $P_0(b+mt)^t$
Growth is not exponential
One way to realize that the $\text{CO}_2$ concentration is not exponential is by plotting it in a spreadsheet and asking it to give an exponential fit. Here is the result:
The fact that the best exponential fit according to a spreadsheet does not grow as fast as the data is a clear indication that faster-than-exponential growth could be happening. But this “could” has to be taken with care! Saying “the spreadsheet could not do it” is not a definite reason/explanation for anything.
Knowing for Sure That There is no Exponential Fit to the Data
The following is a log-plot of the same data (technically, a semi-log plot because only the vertical axis got a log scale). Forgive the lack of numbers on the vertical axis, I could not get LibreOffice to display any more of them on the axis. I tried plenty!
Here is a TikZ+pgfplots figure I created after having given up with LibreOffice. Such a huge TikZ+pgfplots fan! This is the source code.
The fact that the data still bends upward in these figures that use a log scale on the vertical axis is a definite indication that there is no exponential fit to the data. Why this is so: the log plot of any exponential function (i.e., of the form $P_0a^t$ for constants $P_0$ and $a$) will look like a straight line. You can see this in the figures above: the plot of $309.837e^{0.0045t}$ looks like a straight line instead of bending upwards.
In fact, this “straightening exponential functions” is precisely what placing a logarithmic scale on the vertical axis accomplishes – it straightens out exponential growth by speeding up the labels on the vertical axis in exactly the right way. Pretty cool if you ask me!
Looking at the Growth Factors in the Data
So, $\text{CO}_2$ seems to be growing faster than any exponential $P_0a^t$. How much faster?
One way to investigate this is to figure out how the $a$ would be changing with time if we forced the data to follow $P_0a^t$. Here is what I mean by this: note the $a$ in $P_0a^t$ is the factor by which the quantity gets multiplied in one unit of time, as the following equations show $$ \begin{aligned} (\text{quantity at time}~t+1) & = P_0a^{t+1} \\ & = P_0\left(a^t\times a\right) \\ & = a \times P_0a^t \\ & = a \times (\text{quantity at time}~t). \end{aligned} $$ So, what I mean by “how the $a$ is changing” in the data is “by what factor do I need to multiply the data of one year to get the next year?”. These multiplication factors from one year to the next are technically called the growth factors, and figuring them out is easy: you just divide the next years’ CO$_2$ levels by the current one!
Here is a plot of the growth factors as a function of time (so, a plot of “how the $a$ is changing”):
Note that the growth factors seem to be jumping up and down somewhat erratically, but there seems to be a clear upward trend. This upward trend is a manifestation that the data is not an exponential $P_0a^t$, since for exactly exponential data you would just be seeing a horizontal line (the constant value of $a$).
What if we Assume This Upward Trend in the Growth Factors Were Linear? Finding a Model for the Data of the Form $P_0(b+mt)^t$
The figure below shows a linear trend line for the growth factor data. It was found using a spreadsheet.
Identifying this trend line formula as the way the $a$ should change in a super-exponential model, we would guess a model for the CO$_2$ levels of the form $P_0(b+mt)^t$, as follows: $$ 315.98(1.002615+0.000059t)^t. $$ Here
- $P_0=315.98$ is the CO$_2$ measurement in Mauna Loa in the year 1950 corresponding to $t=0$, and
- $b+mt=1.002615+0.000059t$ is the trend line we found with the spreadsheet. It tells us the factor by which to multiply from one year to the next (increasing with the year $t$).
If one plots this model one sees it does not give a great fit, but tweaking the parameters a little, one does get a very good fit. The final model, graphed below together with the data, is given by $$f(t)=P_0(b+mt)^t=315.98(1.0026+0.00003t)^t.$$
Very good fit!
Second Model: the Exponential Plus Constant $C+Ba^{t}$
All that I showed above seems to indicate that no exponential function could possibly model the data. It is growing too fast. Right? Well, not really!
In reality, lines in log plots and exponential fits in spreadsheets only relate to the algebraic form $P_0a^t$ of exponential functions. Anything that is not of this form will look bendy in a log plot, and the fact that it curves upwards does not necessarily mean it is faster than any exponential. It just means that it is not modeled well by any exponential function (of the form $P_0a^t$). This is a common misconception with log plots, which I had myself!
As seen in this article, there may be a good fit of the form $$C+Ba^{t}$$ (I changed the letters from $C+P_0a^{t}$ to $C+Ba^{t}$ since the constant in front of the $a^t$ is no longer the amount when $t=0$).
Finding possible parameter values for $a,B$, and $C$ that give a good fit to the data is not as straightforward with a spreadsheet, because these do not have “give me a trend-line of the form $C+Ba^{t}$” functionality.
How I found good parameter values
I tried out different values of $C$ to make the quantity $[\text{CO}_2~\text{level}-C]$ behave like an exponential by computing its growth factors and trying to make them constant (this would not be possible if there was no good model of the form $C+Ba^{t}$).
Resulting model
Here is the model, and its plot:
$$g(t)= C+Ba^{t}=267.277+48.703(1.01823)^t$$
Very good fit too!
Difference Between the two Models (exponential + constant vs. super-exponential)
Here they are together (now as an awesome TikZ+pgfplots figure that allows me to generate figures for different ranges in a breeze). Making a plot with curves that are so similar and that still allows to differentiate between them was not that simple! Using different markers for each curve (also very important for accessibility related to color-blindness) definitely helps.
They look incredibly similar, but one is super exponential, and one is not, so the super-exponential one will eventually outpace the non-super one.
Here are some plots looking into the future
After 150 years (since 1950) the exponential one seems to be growing faster, but this can only be temporary…
After 350 years the super-exponential overtakes…
After 450 years the super-exponential starts to show what it means to be super-exponential…
And after 550 years the super-exponential is in a different league…
If the desmos applet below loads, you can play around with this by zooming in and out interactively. Note however that zooming out and seeing the figures I show above is not as simple as one might think – one needs to re-adjust the ranges of the axis constantly as one zooms out.
Follow this link if the interactive does not load.
One Final Note:
Looking 500 years into the future is pretty meaningless for this model. Note that the C0$_2$ level in ppm reaches a million ppm. This would say all air molecules would be CO$_2$ (silly), and anything beyond that million ppm is meaningless. The CO$_2$ is more likely to stabilize at some point in the future with an absurd theoretical limit of a million parts per million. Maybe it is already starting to stabilize!
Third Model: the Quadratic $a_2x^2+a_1x+a_0$
What I showed above indicates that an exponential+constant function can also model the data. So, it could be growing super-exponentially, or exponentially+constant. Surely nothing like a low degree polynomial, which grows much slower, could model the growth. Right? Well, not really!
The following quadratic model is found using a spreadsheet directly using the “trend-line” option with polynomial of degree 2 as a choice of trend. The resulting model is given by $$ h(t)=0.013t^2+0.8055t+315.5219 $$
Here is a plot together with the Mauna Loa data. Fantastic fit too!
Graphical Comparison Between the Three Models
They say a picture says more than a thousand words…
Here are the three models against together with data up to 2022. Basically indistinguishable!
150 years into the future since data collection began (1950)
350 years into the future since data collection began
550 years into the future since data collection began
Back into a more sensible range
Here is the plot of the three models up to the year 2050, about 25 years from now:
If you want to play with the ranges yourself, all you have to do is adjust the number in the \newcommand\Max{550}
in the TikZ+pgfplots
source file and then run pdf$\LaTeX$ on it.
What to Take Away From all of This
-
Which model is better? Any of the three the models is equally valid in the sense that they all give very tight fits to the data.
-
It is pretty amazing that there are such simple formulas that model the growth of CO$_2$ levels for over 70 years.
-
Even though the three models give radically different long-term behaviors, they do so at time-scales that are far larger than what it would make sense to expect the models to be valid.
-
The model that may make more sense physically is the exponential plus constant, because it starts off from a constant base level pre- antropocene, and then follows exponential growth like many modern-world phenomena (including world population).
-
CO$_2$ levels are most certainly rising at an increasing rate (graph bending upwards). Even if the rising rate stops increasing and they start leveling off, it is unlikely this will happen below 600–700 ppm (just from looking at the plots).