Chapter 10 - Confidence-Weighted Mean Reversion (1/3)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 10

Conﬁdence-Weighted Mean Reversion

Empirical evidence (Borodin et al. 2004) shows that stock price relatives may follow

the mean reversion property, which has not been fully exploited by existing strategies.

Moreover, all existing online portfolio selection (OLPS) strategies only focus on the

ﬁrst-order information of a portfolio vector, though second-order information may

also beneﬁt a strategy. This chapter proposes a novel strategy named “conﬁdence-

weighted mean reversion” (CWMR) (Li et al. 2011b, 2013). Inspired by the mean

reversion principle in ﬁnance and conﬁdence-weighted (CW) online machine learning

technique (Crammer et al. 2008; Dredze et al. 2008), CWMR models the portfolio vec-

tor as a Gaussian distribution, and sequentially updates the distribution following the

mean reversion principle. Analysis of CWMR’s closed form updates clearly reﬂects

the mean reversion trading idea and the interaction of ﬁrst-order and second-order

information. Extensive experiments, in Part IV, on various real markets show that

CWMR is able to effectively exploit the power of mean reversion and second-order

information, and is superior to the state-of-the-art techniques.

This chapter is organized as follows. Section 10.1 motivates the proposed CWMR

strategy. Section 10.2 formulates the strategy, and Section 10.3 derives the algorithms

based on the formulations. Section 10.4 further analyzes the algorithms. Finally,

Section 10.5 summarizes this chapter and indicates future directions.

10.1 Preliminaries

10.1.1 Motivation

The proposed method, similartopassive–aggressivemean reversion (PAMR), is based

on the meanreversiontradingidea, which, in the context of portfolio or multipleassets,

implies that good-performing assets tend to perform worse than others in subsequent

periods, and poor-performing assets are inclined to perform better. Thus, to maximize

the next portfolio return, we could minimize the expected return with respect to

today’s price relatives since next price relatives tend to revert. This seems somewhat

counterintuitive, but, according to Lo and MacKinlay (1990), the effectiveness of

mean reversion is due to the positive cross-autocovariances across assets.

T&F Cat #K23731 — K23731_C010 — page 71 — 9/28/2015 — 21:24

72 CONFIDENCE-WEIGHTED MEAN REVERSION

Besides the virtual example in Section 9.1.2, we empirically analyze real market

data to show that mean reversion does exist.

∗

Although measuring mean reversion in

a single stock is well studied (Poterba and Summers 1988; Chaudhuri and Wu 2003;

Hillebrand 2003),thestudy of meanreversionin a portfolio israre. Since, in ourformu-

lation, the portfolio is long-only,

†

we focus on whether we can obtain a higher return

than the market by investing on poor-performing assets.

‡

With a threshold δ, let A

be the set of poor-performing stocks (x

t,i

< δ), B

be the set of mean reversion (MR)

stocks (x

t,i

< δ & x

t+1,i

> 1), C

be the set of non–mean reversion (non–MR) stocks

t,i

< δ & x

t+1,i

< 1), and D

be the setofremaining stocks (x

t,i

< δ & x

t+1,i

= 1).

On period t, we calculate the percentage of a set U , which can be either A, B,

C,orD,asP

(U) =|U

|/|A

|, where |·|denotes the cardinality of a set, and the

gain of uniform investment in the set as G

(U) =



i∈U

t,i

/|U

|. For a total of n

periods, we further calculate their average values as

P(U)=

n−1



n−1

t=1

(U) and

G(U) =

n−1



n−1

t=1

(U), respectively. In particular, we refer to the percentage of

mean reversion stocks as

P(B), and the gain of mean reversion stocks as

G(B).To

show whether buying poor-performing stocks is proﬁtable, we calculate the average

gain of uniform investment on poor-performing stocks, denoted as

G(A), and the

average gain of uniform investment in the whole market, denoted as

G(Market).

Table 10.1 gives the statistics on six real market daily datasets.

On the one hand,

except for the DJIA dataset (please refer to Chapter 12 for details), mean reversion

does exist (

P(B) >

P(C)),

and uniform investment on poor-performing stocks pro-

vides a greater proﬁt

∗∗

than the market (

G(A) >

G(Market)). On the other hand, the

test failed on theDJIAdataset, and inthefollowingempirical evaluations, CWMR also

failed badly on the dataset, which motivates our next proposed method in Chapter 11.

Moreover, all state-of-the-art approaches only exploit ﬁrst-order information of a

portfolio vector, while higher order information may also beneﬁt the portfolio selec-

tion task (Harvey et al. 2010). Evidence (Chopra and Ziemba 1993) shows that in

portfolio selection, errors in variance have about 5% impact on the objective value

as errors in mean do. For simplicity, we exploit variance information while ignor-

ing covariance information, which has a much smaller impact on the ﬁnal objective

value. To take advantage of both ﬁrst- and second-order information, we adopt CW

online learning (Crammer et al. 2008; Dredze et al. 2008), which was originally pro-

posed for classiﬁcation. CW’s basic idea is to maintain a Gaussian distribution for a

∗

The test program and datasets will be available at http://stevenhoi.org/olps

†

Long-only means if something is considered undervalued, managers would invest; if something is

considered overvalued, managers would avoid it.

‡

If short is allowed, we can also show whether shorting good-performing stocks provides a higher

return.

We list their details in Section 12.2. We empirically choose δ = 0.985 on all datasets. As we have

tested, other thresholds also release similar observations. For tests on other frequencies, please refer to

Li et al. (2013).

This indicates a higher probability of reversion, but we have no theoretical guarantee for the criteria.

∗∗

The absolute return in the daily scale is relatively small. However, considering their net return, such a

strategy makes much higher proﬁt than the market does. Moreover, with compounding, such small absolute

differences will result in huge differences over time.

T&F Cat #K23731 — K23731_C010 — page 72 — 9/28/2015 — 21:24

FORMULATIONS 73

Table 10.1 Summary of mean reversion statistics on real markets

Dataset

P(B)

G(B)

P(C)

G(C)

P(D)

G(A)

G(Market)

TSE 42.89% 1.022370 41.63% 0.978395 15.48% 1.000598 1.000405

MSCI 54.19% 1.015737 45.05% 0.984046 0.76% 1.001107 1.000053

NYSE (O) 43.43% 1.021599 39.86% 0.981949 16.71% 1.002523 1.000620

NYSE (N) 47.87% 1.019624 43.19% 0.982050 8.93% 1.001644 1.000610

DJIA 48.54% 1.018545 50.57% 0.980843 0.90% 0.999398 0.999719

SP500 50.20% 1.020692 47.96% 0.980502 1.84% 1.000881 1.000488

classiﬁer, and sequentially update the distribution similar to passive–aggressive (PA)

learning (Crammer et al. 2006). Thus, CW learning can take advantage of both ﬁrst-

and second-order information of the classiﬁer.

To address the above two concerns, we present a novel OLPS method named

CWMR. To exploit the ﬁrst- and second-order information of a portfolio vector,

we model the portfolio vector as a Gaussian distribution, which is probably the most

widely studied distribution and can satisfy our motivations. We do not consider higher

orders and other distributions for their complexities. Then, we sequentially update the

distribution following the mean reversion principle. On the one hand, we keep the

previous distribution if the portfolio is proﬁtablebyusingmeanreversion.Onthe other

hand, we move the distribution to a new distribution such that the new distribution is

expected to make proﬁt while keeping it close to the previous distribution. Different

from CRP and Anticor, CWMR actively exploits the mean reversion property of

ﬁnancial markets with a powerful learning method. Moreover, compared with all

existing algorithms, including PAMR, which only consider the ﬁrst-order information,

CWMR exploits both the ﬁrst- and second-order information of a portfolio vector.

10.2 Formulations

We model b as a Gaussian distribution with mean μ ∈ R

and diagonal covariance

matrix  ∈ R

m×m

with nonzerodiagonalelements and zero for off-diagonalelements.

The i-th element of μ represents the proportion of the i-th element. The i-th diagonal

term of  stands for the conﬁdence on the i-th proportion. The smaller the diagonal

term, the higher the conﬁdence we have in the corresponding μ.

At the beginning of period t, we ﬁgure out a b based on the distribution N(μ, ),

that is, b ∼ N(μ, ). Then, after x

is revealed, the wealth increases by a factor

of b



. It is straightforward that the return D = b



can be viewed as a random

variable of the following univariate Gaussian distribution:

D ∼ N





, x



x



Its mean is the return of mean vector, and its variance is proportional to the projection

of x

on .

T&F Cat #K23731 — K23731_C010 — page 73 — 9/28/2015 — 21:24

74 CONFIDENCE-WEIGHTED MEAN REVERSION

According to the mean reversion idea, the probability of a proﬁtable b with respect

to a predeﬁned mean reversion threshold  is deﬁned as

b∼N(μ,)

[D ≤ ]=Pr

b∼N(μ,)

(



≤ 

)

For simplicity, we write Pr[b



≤ ] instead. Note that we are considering the mean

reversion proﬁtability in a portfolio consisting of multiple stocks; thus, this deﬁnition

is equivalent to the motivating idea of buying poor-performing stocks or, equivalently,

selling good-performing stocks.

The algorithm adjusts the distribution to ensure that the probability of a mean

reversion proﬁtable b is higher than a conﬁdence-level parameter θ ∈[0, 1]:

(



≤ 

)

≥ θ.

This is somewhat counterintuitive but reasonable with respect to the mean reversion

idea. If it is highly probable that the portfolio return b



is less than a threshold, it

is also highly probable that its next return based on x

t+1

tends to be higher since x

t+1

will revert.

Then, following the intuition underlying PA algorithms (Crammer et al. 2006),

our algorithm chooses a distribution closest to the current distribution N(μ

, 

) in

terms of Kullback–Leibler (KL) divergence (Kullback and Leibler 1951).As a result,

at the end of period t, the algorithm updates the distribution by solving the following

optimization problem.

The Raw Optimization Problem: CWMR

(μ

t+1

, 

t+1

) = arg min D

(N(μ, )N(μ

, 

))

s.t. Pr[b



≤ ]≥θ

μ ∈ 

(10.1)

The optimization problem (10.1) clearly reﬂects our motivation. On the one hand,

if the current μ

is mean reversion proﬁtable, that is, the ﬁrst constraint is satisﬁed,

CWMR chooses the same distribution, resulting in a passive CRP strategy. On the

other hand, if μ

does not satisfy the mean reversion constraint, CWMR tries to

ﬁgure out a new distribution, which is expected to proﬁt and not far from the current

distribution.

Let us reformulate the objective and constraints. For the objective part, the KL

divergence between two Gaussian distributions can be rewritten as

(N(μ, )N(μ

, 

))



log



det



+Tr(

−1

) +(μ

−μ)



−1

(μ

−μ) −d



T&F Cat #K23731 — K23731_C010 — page 74 — 9/28/2015 — 21:24

FORMULATIONS 75

For the constraint part, since b ∼ N(μ, ), b



has a univariate Gaussian

distribution with mean μ

= μ



and variance σ

= x



x

. Thus, the probability

of a return less than  is

Pr[D ≤ ]=Pr

D −μ

≤

 −μ

In the preceding equation,

D−μ

is a normally distributed random variable; thus,

the probability equals 



−μ



, where  is the cumulative distribution function of

Gaussian distribution. As a result, we can rewrite the constraint as

−μ

≥ 

−1

(θ).

Substituting μ

and σ

by their deﬁnitions and rearranging the terms, we can obtain

 −μ



≥ φ



x

where φ = 

−1

(θ). Clearly, we require that the weighted summation of return and

standard deviation is less than a threshold. Till now, we can rewrite the preceding

optimization problem.

The Revised Optimization Problem: CWMR

(μ

t+1

, 

t+1

) = arg min



log



det



+Tr(

−1

) +(μ

−μ)



−1

(μ

−μ)



s.t.  −μ



≥ φ



x



1 = 1, μ  0. (10.2)

For the optimization problem (10.2), the ﬁrst constraint is not convex in , there-

fore we have two ways to handle it. The ﬁrst way (Dredze et al. 2008) is to linearize it

by omitting the square root, that is,  −μ



≥ φx



x

. As a result, we can ﬁnalize

the ﬁrst optimization problem, named CWMR-Var.

The Final Optimization Problem 1: CWMR-Var

(μ

t+1

, 

t+1

) = arg min



log



det



+Tr(

−1

) +(μ

−μ)



−1

(μ

−μ)



s.t.  −μ



≥ φx



x



1 = 1, μ  0. (10.3)

The second reformulation (Crammer et al. 2008) is to decompose the positive

semideﬁnite (PSD) , that is,  = ϒ

with ϒ = Qdiag(λ

1/2

,...,λ

1/2



, where

Q is orthonormal and λ

,...,λ

are the eigenvalues of  and thus ϒ is also PSD.This

reformulation yields the second ﬁnal optimization problem, named CWMR-Stdev.

T&F Cat #K23731 — K23731_C010 — page 75 — 9/28/2015 — 21:24

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 10 - Confidence-Weighted Mean Reversion (1/3)

Create new playlist

Sign In

Sign Up

Table of Contents for
Chapter 10 - Confidence-Weighted Mean Reversion (1/3)