Sunday, October 12, 2008

How Certain People Get Rich

I've got a post in the works describing the plateau effect in the shock model, with an eye on making it quantitative. In the meantime, I have gotten so fed up with this financial crisis, that I had to understand something about the global situation. Note the name of this post.

How Certain People Get Rich, mathematically speaking

1. Assume that people accumulate wealth linearly at the start of their careers
   Income = k * t
Read this as: if you do work steadily, you will get rewarded

2. Assume that at some point, the wealthy start accumulating assets at an exponential rate, mainly via investments.
   Income = b * exp(a*t)
In other words, once you reach a threshold, you will get exponentially rewarded with little extra work.

3. Total income combines the two.
   Income = k*t + b*exp(a*t)
You would expect that linear income starts out more strongly than that of the compounded growth exponential. Unfortunately, this turns into a transcendental function to solve for t. This will get partitioned instead of solved in a later step.

4. Assume an exponential distribution of income dispersion at any point in time. We assume a wide dispersion such that the standard deviation equates to the mean. The accumulated income at a point disperses in rate R as:
   N = N0 exp (-C/R)
This basically follows the same distribution as runners finishing a marathon race as I posted before (or searching for oil discoveries). The fastest runners occur rarely, and so do the fastest income earners.

5. We need to change this from a rate distribution to a time distribution to compare it to readily available data. First we take the derivative in (4)
   dN/dR = C*N0 exp (-C/R)/R^2
6. Next convert this to a time dependence, R ~ 1/T
   dN/dt = dN/dR * dR/dT ~ exp (-C*T)
7. The integral of this equation becomes the histogram of the income distribution, which has some dependence on the effort T put into your work
   N(T) ~ exp (-C*T)
We have not set the term T yet, but in this case it relationally equates time to Income (see update at the end of the post).

8. Plotting this for linear effort T, where income directly relates to effort expended, you get the red curve for the cumulative histogram below. Note that 100% of earners make at least $0 and then it decreases rapidly for higher incomes.


9. But from assumption (3) we know that the scale of T needs to "accelerate" to meet the needs of the exponential growth term. So if we weakly include that term near the intersection
   T ~ ln(k*Income/b)
If we plot this term in step 7, replacing T, it shows up as the green curve above.

I could iterate and solve the transcendental equation in (3) numerically but for now, it shows the relationship quite clearly.

The idea that this derivation conveys predicts that the dispersion of efforts from many income-earners (via hard-work or luck) will result in a certain segment of the population having enough assets to trigger the compound growth dependence which supplements and finally outpaces their regular income.

This paper analyzes the USA's census income data in the sense of picking out some heuristics from physics and trying to fit the curves.

It tries to cast an understanding in terms of hand-wavy statistical mechanics, but I think I have provided a better and intuitive derivation. The paper also notes that the green curve likely has to do with the stock market -- a real compound growth situation if profits get reinvested. Realize that the bottom 97 to 98% of the population never get to invoke the strong compound growth that the elite 2% to 3% do. And the dispersion continues in this term leading to the Pareto law long tails.

I have a feeling that this break in the curve represents the separation between the "haves" and "have-nots" that exist in the stock market or in investing in general. In the current market, mutual funds no longer work to create the strong compound growth for anybody but the most committed and financially backed investors. I have a feeling that the market simply "pumps" the buy/sell cycle to attract investors. I recall discussing this stuff back here.



Wall Street learned well that too many individual investors during the stock run of the 90's used ordinary mutual funds and ended up reducing the "entry barrier" and thus removing the distinction between the "haves" and "have-nots". And it clearly could not have sustained itself as you could see how much capital it would introduce into the market. They have thus tipped the scales back to a flat ratchety growth curve designed to ensnare ordinary investors (not to mention 401Ks and Social Security). If you blink like many people did the past few weeks, you miss it. The last 10 years basically came out flat. I didn't lose anything on my own investments (as I tend to stay away from risk) but I managed a fund for a relative and screwed up royally, getting out only on the downslide and losing 16%. Even the fixed income mutual funds got hammered (don't do growth bonds). I feel bad for everyone involved and it will take me a while to get over this, and hope the world economy rights itself. I'd like to say I could learn from this and put it to good use next time, but that won't happen for me. These people think the money in your wallet belongs to them, and they will do anything to get at it. And the cretins who defaulted on the bond funds probably realize the retirees that depend on this don't go around wielding baseball bats and shotguns.

What a mess!

Update: The set of two equations is actually very easy to solve. Simply parametrically plotting the two equations (3) and (7) using time (t) as the independent parameter gives this result:

This formulation does wonders for understanding what causes what. I find it interesting that I can actually extract the compound growth rate of Wall Street investments from the curve. The long tail above comes about from a 30% yearly growth rate, matching some of the high end returns in the 90's.

I will likely update this post at a later time since it combines the two growth rates, constant and exponential, that we use in the Dispersive Discovery models. The big distinction between the that model and the income model, is that the latter uses both types of growth in the same model! This suggests that it likely happens in oil discovery, of which the empirical absence has puzzled me for awhile. I assume that if we look hard enough we may see it for oil too.

Update 2: After studying the dynamics a bit more, I have decided that the compound growth part does not play as big a role as I first thought. The idea for using this adjunct form of compensation first surfaced in the referenced paper where the authors saw effects somewhat correlated to the stock market. I believe that the high income part of the curve maps to the Dispersive Discovery model with a stochastic "well of knowledge" or "depth of wisdom". We don't actually reach a specific point at which we determine our salary, but instead grow that salary over a range of years that we would willingly try to advance. For a dispersion with a damped exponential well, this leads to a straight line on the log-log plot, which matches that of the high income part of the curve. The low income part of the curve has a more rapidly declining profile, indicative of a income earner that trains for a fixed period of time and becomes satisfied with the salary at that point. This period may match the length of time for a high school education or a trade school job. On the other hand, for the higher-income earner, the training may become a life-long process, so that the goal-posts continuously expand until they reach retirement age. So I don't believe that the change in slope has as much to do with stock market fortunes as it has to do with effort expended. To first order, this effort relates to education time, continuing education, and overtime worked, leading to increased wages with experience.

The decline rate under the conditions of a variable academic stint plus a job maturation period should lead to a slope of ~ 1/t2, close to that observed (usually above 1/t1.5).


Dispersion would indicate that only a few billionaire salaries would exist, and you can see that in the extrapolated curve. Other than that, the difficulty remains in absolutely aligning and correlating time effort with an average level of salary. Empirically it looks like a power-law somewhere around 1.

Thursday, October 9, 2008

Google Discovery Timeline

I found a quite eye-opening use of the news archive feature of news.google.com. I typed in search strings for "oil" and "discovery" and "timeline", and then when you invoke the timeline view, you see this:



This search essentially looks at references to dates in archived news articles over the years and it automatically creates a histogram of the relative counts. Interestingly, it shows peaks in much the same places as the classic oil discovery curve.



I believe that the curve has a USA bias as many of the large peaks correlate with specific lower-48 discoveries, such as the historically significant field found at Spindletop right after the year 1900. And of course it nails the original discovery in 1859. Yet it predicts pretty accurately the accepted worldwide peak discovery date in the early 1960's.

Google doesn't use an AI program specifically tailored for discovering oil of course, so they cannot figure out that the dates have to solely relate to that year's oil discoveries. So we see many references to recent dates, which probably relates more to the explosion of news sources during the information age.

I would classify this as another example of the "Wisdom of the Crowds" data mining approach. This works because the dates get reported by masses of people, and this large sample collectively improve the statistical accuracy of the estimate.

Update: The Google discovery peak for coal:
Free Image Hosting at www.ImageShack.us

Sunday, October 5, 2008

The Rethug Bizness Card

Not all of us can be as smart as Sarah Palin, who knows more about energy than anybody in the world. Therefore, I created a cheat-sheet in the form of a Rethug Bizness Card. You can discretely refer to it when you want to ridicule or embarrass Republicans that you happen across. Go to the link above and print this at 25% scale and it will fit in your wallet.





It is during the second great depression in the US, and the land is full of people who are now homeless.



"You ain't stopping at this hotel, kid. My hotel! The stars at night, I put 'em there. And I know the presidents, all of them. And I go where I damn well please. Even the chairman of the New York Central can't do it better. My road, kid, and I don't give lessons and I don't take partners. Your ass don't ride this train! "


"Stay off the tracks. Forget it. Its a bum's world for a bum. You'll never be Empress of the North Pole, kid. You had the juice, kid, but not the heart and they go together. You're all gas and no feel, and nobody can teach you that, not even A-No.1. So stay off the train, she'll throw you under for sure. Remember me for that. So long, kid."


Thursday, October 2, 2008

Asymmetries in Oil Discovery Curves

Dispersive Discovery curves for exponential growth search rates:


Dispersive Discovery curves for power-law growth search rates:

Tuesday, September 30, 2008

The Curse of Dispersion

For the Dispersive Discovery model, the ultimate reserve reaches a bounded value. However, the growth proceeds slow but steady enough that it can fool people into a sense of complacency. The fact that geologists themselves have considered reserve growth "an enigma" indicates that they can't quite make heads or tails of how it comes about. It occurred to me that the rate of reserve growth plays in people's perceptions in rather alarming ways.

Two analogies come to mind: the carrot dangling in front of a horse and the frog being slowly boiled alive.

The fact that no one understands the enigma lets people's imaginations run a little wild as they have no checks to balance their viewpoint. Yet the fact that we have a model allows us to develop a few benchmarks that we can use as a countering influence. One that I will describe briefly comes from considering the running average time it takes to find oil in a region using the simple "seam" variant of dispersive discovery. This average has an interesting tendency to creep upwards over time.
Solving a troubling enigma can lead to a blessing or a curse.
We can evaluate this average by evaluating the definite integral of T*exp(-T/t)/t/t multiplied by t. This grows as the logarithm of t for larger times. The positive trend looks at least somewhat encouraging, but then realize that we have to divide by t to properly scale the result. After we do this scaling then we can see the law of diminishing return acting on the result directly, since the divisor t has a greater magnitude than the numerator log(t). This factor log(t)/t becomes a valuable measure to estimate return on investment.

However, non-mathematical analyses and intuition can lead to optimistic outlooks. They may forget that the beneficial dispersed growth gets more than compensated by the effort to find it and that divergence becomes greater with time. Until we get more people on board understanding the real effects we will have to suffer from the "Curse of Dispersion" -- dispersive reserve growth acts as the carrot in front of the horse, while at the same time quietly boiling us alive if we ignore its rather obvious diminishing returns.


"The picture's pretty bleak, gentlemen. ..the world's climates are changing, the mammals are taking over, and we all have a brain about the size of a walnut."

Update: Added figure

Monday, September 29, 2008

Network Dispersion

In the last post I gave examples of dispersion in the finishing times for marathon races. The dispersion results from human variability but due to (self) censoring in the population of runners, we never see complete dispersion like we would in oil exploration and discovery.

So to give a hint as to how complete dispersion works, we take an example from a physical process, that of network RTT (round-trip time) dispersion. This figure provides a classic example of packet dispersion, caused by slight differences in transmission rates of messages (taken from SLAC experiments).


I assumed a simple dispersion of rates, using T=50 microseconds, and determined the following fit:

The equation T*exp(-T/t)/t/t matches that which I use for the seam or "sweet-spot" variant of the Dispersive Discovery model. Although drawn on a semi-log scale, that shape has much similarity to a shocklet kernel with an immediate production drain on the receiving end. It also illustrates how math that works on a macroscopic scale also often applies to a microscopic scale.

I thought I came up with something new here, since the authors of the tutorial chose to use instead a heuristic (of course, similar to what everyone does for oil depletion analysis) but I believe the phenomena has some basis in what researchers refer to as "asymptotic dispersion rate" and "average dispersion rate".

To commemorate yesterday's Black Monday, I dedicate this derivation to everyone who finds it insane that no one can definitely illustrate what kind of mathematical mess that we have gotten the world financial market into. Bunch of flaming idiots.

Friday, September 26, 2008

SUMMIT ENERGY CORPORATION
is an energy services company devoted to helping to meet world energy needs by engaging in energy development and delivery activities and by developing and marketing alternative and renewable fuels. Our activities include oil production services, trading of crude oil, retail petro-gasoline marketing, production and marketing of bio-fuels plus financing of energy development. Summit operates primarily in the United States and throughout Africa. It is essential for the global economy, including both developed economies like the United States and developing economies in countries across Africa, to be able to access dependable, clean and inexpensive energy sources. Such reliable and reasonably priced energy enables economic growth and innovation, modern transportation, and 21st century living. In Africa, well-managed energy can be a positive stimulusv to sustained economic growth. More growth demands more energy. Summit Energy uses its resources and expertise to find, develop and deliver the fuel to sustain today’s economy and stimulate tomorrow’s growth.