Recent evaluations have been mostly focused on evaluating the following:
- Repeatability of the procedure in terms of the variability associated with its possible tracks for each realization.
- Determining any additional gain (if any) in terms of returns as a function of the number of portfolios evaluated (J) at any given year.
- The existence of any indications between current year portfolios' medians and subsequent year (same portfolio) performance. Are there any associations and if so how weak or strong are these?
- Investigating a stable and plausible stopping rule and assessing how beneficial it might be to run the random search until this condition is met.
Several experiments were set up to determine if it would be worthwhile to inspect more randomly sought portfolios on a yearly basis as part of the overall procedure. A job simulating a total of 104 tracks (each consisting of J=25,000J=25,000 portfolios per year over a 43 year period, 1965 though 2008) was submitted to ADA and took approximately three days to complete. Several important observations can be made from the outcomes of these simulations (shown in Figures 3 and 4, below). We note that, here, we can exploit the independence regarding the portfolios evaluated to get 52 tracks of J=50,000J=50,000 portfolios each by combining pairs of J=25,000J=25,000 tracks and selecting the maximum of the pair (simply the maximum of a longer execution). Essentially this gives us information regarding what would have happened (in terms of the performance of the strategy should we have run it for twice as long). Analogously, tracks for J=100,000J=100,000 and J=200,000J=200,000 portfolios were constructed. Finally, some overall discussion of the results is given after the figures.
The total portfolio value was evaluated at the end of both years 2006 and 2008 and contrasted to both market performance (blue track) and the performance of a single track of a whopping J=2,600,000J=2,600,000 portfolios considered yearly (green track). As expected the variability of the procedure compounds as a function of time, and by chance we might under-perform the market. However, more often than not the procedure out-performed the market and by a quite reasonable amount. The proportion of portfolio-tracks simulated that were over an equally-weighted alternative at the end of 2006 was over 80% (for the cases where J=25,000J=25,000 and J=50,000J=50,000) and over 90% (for the cases where we assessed more randomly sought portfolios, i.e. J=100,000J=100,000, and J=200,000J=200,000). Also, there is weak evidence suggesting that, although running as many as J=2,600,000J=2,600,000 portfolios might at times outperform the market, this approach is generally not consistently higher on-average than considering tracks consisting of less yearly-evaluated portfolios.
Another rather interesting observation is made through the scatter-grams produced (see Figure 5, below) assessing the correlation between current year portfolio median and (same portfolio) next year performance contrasted to the performance of the S&P 500 index. The number of portfolios evaluated for this purpose was 2,000,0002,000,000 and the those that are highlighted as producing the maximum of the medians represent (<0.1%<0.1%, i.e. <2,000<2,000). The main purpose of this effort was to assess any associations between the current year medians as a forward-looking measure of portfolio performance (as we intend to pick the maximum and by chance we can pick portfolios of performance similar to those in the top 0.999 percentile). As expected the associations are weak, though not extremely weak (correlations are 0.2090.209 for the first case and -0.182-0.182 for the second), however can be noticed and depend highly of the year evaluated.
More often than not, we observed a positive correlation for the years inspected (the strongest correlations are those shown in the figures below). It turns out that for certain years (those with a negative correlation), we ought to utilize the “min-median” as a selection criterion. However, this cannot be known ex-ante, and the best we can do is utilize a measure that more often than not, produces above-average results. Here again, we can appreciate how these conflicting effects would average-out with time in a favorable direction, reiterating the fact that a strategy such as this one, if considered, should be evaluated over the long-haul.
Lastly, several evaluations were performed comparing the various max-medians of the portfolios simulated as a function of the number of portfolios run (i.e. J) and compared to the single-stock max-median (See Figure 6 below), which could, at least heuristically, serve as an upper bound. This resulted (empirically) to be somewhat unstable as there is no guarantee that any thresholds set in terms of percentage to the bound could be attained in any reasonable computing time, mainly due to the fact the after a reasonable amount of simulations (namely J=500,000J=500,000 and up to J=2,000,000J=2,000,000) the percentages of this single-stock max-median attained depended considerably on the year inspected, making a generalization impossible. The most recent evaluations were performed with stopping after 5 ticks past J=10,000J=10,000 simulations, which seems stable, however based on aforementioned results it seems to not provide any incremental benefit when contrasted to, for instance a hard-coded constant J stopping rule.