On the explanatory power of firm-specific variables in cross

Journal of Empirical Finance 16 (2009) 306–317
Contents lists available at ScienceDirect
Journal of Empirical Finance
j o u r n a l h o m e p a g e : w w w. e l s e v i e r. c o m / l o c a t e / j e m p f i n
On the explanatory power of firm-specific variables in cross-sections
of expected returns ☆
Chu Zhang ⁎
Department of Finance, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
a r t i c l e
i n f o
Article history:
Received 4 September 2007
Received in revised form 2 September 2008
Accepted 7 October 2008
Available online 15 October 2008
JEL classification:
G12
Keywords:
Factor-mimicking portfolios
Firm-specific variables
Principal component factors
a b s t r a c t
This paper pertains to the controversy surrounding the explanatory power of certain firmspecific variables such as size and the book-to-market ratio in cross-sections of average stock
returns. To investigate whether these firm-specific variables capture the sensitivity of returns to
unobserved systematic risk, two sets of principal component factors are used. The first set is
constructed from individual stock returns and the second set is from size- and book-to-marketsorted portfolio returns. The evidence from the first set of factors shows that size and the bookto-market ratio have little to do with factor betas. The evidence from the second set of factors
shows that the forces underlying size and the book-to-market ratio are indeed systematic risks,
although they explain very little return variation at the firm level, and that the betas of size- and
book-to-market-sorted portfolio returns with respect to the corresponding systematic factors
do explain the size and book-to-market effects.
© 2008 Elsevier B.V. All rights reserved.
The finding in Fama and French (1992) that two firm-specific variables, size and the book-to-market equity ratio, can explain
average returns across firms sparked a serious debate among researchers. This finding is controversial because it potentially
invalidates the standard asset pricing theory that expected returns across assets are linear in the betas (i.e., standardized
covariances) of the returns with respect to a few marketwide factors. The Fama-French finding certainly contradicts the wellknown Capital Asset Pricing Model (CAPM), which includes the market portfolio as the only factor. The question is then whether
these two firm-specific variables are proxies for other unidentified systematic factors. To that end, Fama and French (1993, 1996)
construct two factors, one the return on small stocks minus the return on large stocks (SMB) and the other the return on high bookto-market stocks minus the return on low book-to-market stocks (HML). Together with the excess return on the market portfolio
(MKT), the three Fama-French factors pass Gibbons, Ross and Shanken (1989) multivariate test and explain most asset pricing
anomalies. Therefore, Fama and French (1996) interpret the predictability of the two firm-specific variables as being consistent
with the standard beta pricing theory. The three-factor model has since caught on in the literature of empirical asset pricing and it
is treated by many researchers as the empirical representation of the Intertemporal Capital Asset Pricing Model (ICAPM) of Merton
(1973) and of the Arbitrage Pricing Theory (APT) of Ross (1976).
The Fama-French interpretation of the explanatory power of the size and book-to-market effects, however, is not agreed by all
researchers. Lakonishok, Shleifer and Vishny (1994), for example, contend that the explanatory power of a firm's characteristics
comes from the irrational behavior of the investors who overreact to the past performance of the firms and drive the stock prices
away from their fundamental values. When the prices eventually return to their fair values, the reversals are picked up in
regression equations on firm characteristics related to the previously distorted prices. Their empirical evidence is that high book-
☆ I would like to thank Raymond Kan, Robert Savickas, Yexiao Xu, Yong Wang and Guofu Zhou for their helpful comments on an earlier version. The financial
support from the Direct Allocation Grant DAG03/04.BM36 of HKUST is gratefully acknowledged. All remaining errors are mine.
⁎ Corresponding author. Tel.: +852358 7684; fax: +852 2358 1749.
E-mail address: [email protected].
0927-5398/$ – see front matter © 2008 Elsevier B.V. All rights reserved.
doi:10.1016/j.jempfin.2008.10.001
C. Zhang / Journal of Empirical Finance 16 (2009) 306–317
307
to-market portfolios are not necessarily riskier than are low book-to-market portfolios. Another challenge to the rationality of the
size and book-to-market effects comes from Daniel and Titman (1997) who use a clever two-way independent sorting of portfolios
by the book-to-market ratio and the beta with respect to HML. They show that average returns vary with the book-to-market ratio
holding the HML beta constant and that average returns do not change with the HML beta holding the book-to-market ratio
constant. The Daniel–Titman result casts doubt on the claim that size and the book-to-market ratio are proxies for the betas of
unidentified factors.
A lingering issue is that, although the Fama-French factors fail the Daniel–Titman test, other mimicking portfolios can be
constructed to explain the size and book-to-market effects.1 The issue then becomes moot because it is impossible to consider all
the ways that mimicking portfolios are constructed.
This paper addresses these issues. Are the size and book-to-market effects proxies for betas with respect to unobserved
systematic factors and how systematic are these factors in terms of the variation they explain in returns? Can factors be
constructed in a way that avoids the difficulty faced by the Fama-French factors? To address these questions, we construct two sets
of principal component factors. The first set of principal component factors is constructed from individual stock returns and the
second set is constructed from one-hundred portfolios sorted by size and the book-to-market ratio. The reason for constructing
two sets of principal component factors is the following. If the forces underlying the explanatory power of size and the book-tomarket ratio are the only systematic factors, or at least the dominant ones, they will then undoubtedly be present in the principal
component factors constructed from individual stock returns. Existing evidence by Brennan et al. (1998) has shown otherwise.
Suppose that the forces underlying the size and book-to-market effects are not dominant factors, but still systematic. It is possible
that these factors may be swamped by other more important systematic factors (in the sense of accounting for return variations) in
the principal component factors extracted from individual stock returns. If portfolios are sorted by size and the book-to-market
ratio, factors unrelated to size and the book-to-market ratio are averaged out and factors related to size and the book-to-market
ratio stand out, if they are indeed systematic. These factors can then be extracted by principal component analysis.
Empirical results confirm this intuition. The betas of individual stock returns with respect to the first ten principal component
factors constructed from individual stock returns have little cross-sectional correlation with size and the book-to-market ratio, so
size and the book-to-market cannot be proxies for the betas with respect to these factors. These principal component factors also
do not explain the size and book-to-market effects in an extended multivariate test. On the other hand, the principal component
factors constructed from the one-hundred portfolios sorted by size and the book-to-market ratio have the following features. First,
the factor structure is more obvious. The first three principal components explain more than 85% of the total return variation in the
one-hundred portfolios, much more than do those extracted from individual stock returns. Second, the factor loadings on the onehundred portfolio returns have clear patterns that are similar to the patterns of Fama-French factors. The loadings of the first factor
are relatively flat, reminiscent of the equally weighted market portfolio. The loadings of the second factor are positive for large
stock portfolios and negative for small stock portfolios, resembling the negative SMB, The loadings of the third factor are positive
for high book-to-market portfolios and negative for low book-to-market portfolios, resembling HML. Despite the resemblance, the
principal component factors extracted from the size- and book-to-market-sorted portfolios avoid the difficulty faced by the FamaFrench factors. They pass the two-way sorting test introduced by Daniel and Titman (1997) and they pass an extended multivariate
test to explain the size and book-to-market effects.
The rest of this paper is organized as follows. Section 1 lays out the basic structure of the beta pricing models. Section 2 presents
the empirical results on the principal component factors extracted from individual stock returns. Section 3 presents the empirical
results on the principal component factors extracted from one-hundred portfolios sorted by the size and book-to-market. Section 4
summarizes the results and discusses the implications.
1. Beta pricing models and econometric tests
Suppose that rt is the vector of returns on N assets in excess of the riskfree rate over the period t − 1 to t, following a stationary
process with mean ν and variance matrix Σ. The beta pricing theory states that the expected excess returns are linear in the betas
of the return with respect to some marketwide factors, ft, that can be traced by factor-mimicking portfolios. The asset pricing
model assumes that the returns are generated from
rt = α + Bft + et ;
Eðet jIt−1 ; ft Þ = 0N ;
ð1Þ
where It − 1 represents the information set at t − 1. Without loss of generality, we can assume that α′B = 0, i.e., the alpha is orthogonal
to the beta matrix. If ft is the excess returns on K factor-mimicking portfolios, the beta pricing theory asserts, in its exact form, that
α = 0N. Therefore,
Ert = BEft :
ð2Þ
The well-known CAPM is an example of a one-factor model in which ft is the excess return on the market portfolio. In the face of
the empirical difficulty for the CAPM, many researchers now resort to multifactor models that do not depend on the criterion of
mean-variance efficiency. Prominent examples of multifactor models include the ICAPM and APT in which the number and the
identity of the factors are unspecified. The essence of the beta pricing theory is that a large cross-section (N) of expected returns is
1
A discussion on this possibility is presented in the conclusion section of Daniel and Titman (1997).
308
C. Zhang / Journal of Empirical Finance 16 (2009) 306–317
explained approximately by their betas with respect to a small number (K) of systematic factors. In conditional versions of these
models, the beta matrix can be time-varying.
The ICAPM and APT do not specify what the systematic factors are. In empirical research, there are three major approaches to
constructing systematic factors, other than the return on a market portfolio. The first approach, pioneered by Chen, Roll and Ross
(1986), uses macroeconomic variables as systematic factors. The advantage of this approach is its straightforward interpretation of
the systematic factors, if they are found to be priced. The second approach, adopted by many researchers and exemplified by Fama
and French (1993), is to form factor-mimicking portfolios based on the explanatory power of some firm-specific variables and take
the returns on these portfolios as the systematic factors. The advantage of the second approach is also its easy interpretation of the
constructed factors and its suitability in addressing issues related to the explanatory power of the firm-specific variables. The third
approach uses purely statistical methods to extract systematic factors, relying on the factor structure of the return-generating
process, that is assumed in most asset pricing models. The advantage of the third approach is that extracted factors are guaranteed
to be systematic in the sense of accounting for return variations. The disadvantage is the potential lack of economic interpretation
of the extracted factors.
A widely used methodology in testing whether a given set of systematic factors is priced, i.e., demanding nonzero factor
premiums, is the so-called two-pass cross-sectional regression test in which the betas of the given factors are estimated in time
series and the returns are then regressed on the estimated betas to obtain the factor premiums. For the case in which the
systematic factors, ft, are excess returns on portfolios, a popular methodology to test the beta pricing model is the multivariate test
developed by Gibbons, Ross and Shanken (1989). It tests the null hypothesis, α = 0N, against the general alternative, α ≠ 0N, with a
statistic that is χ2N distributed asymptotically under the null, or FK,N − K distributed in finite sample under the additional normality
assumption.
Suppose that Xt = (x1t,…,xMt) is an N × M matrix of predetermined firm-specific variables and that Xt ∈ It − 1 with the possibility
that x1t ≡ 1. The firm-specific variables are said to explain the cross-section of expected returns if, in the empirically motivated
panel data regression model,
rt = Xt θ + ηt ;
E ηt jXt = 0;
ð3Þ
the vector of the slope coefficients is significantly different from the zero vector. Theoretically, there are two extreme cases in
which firm-specific variables may explain the cross-section of expected returns. The first is that the beta pricing theory is correct
and the firm-specific variables are perfect proxies for the betas of unidentified systematic factors. In this case, the columns of Xt
belong to the space spanned by columns of (possibly time-varying) B for some unspecified ft. The other extreme is α ≠ 0 and Xt lies
in the orthogonal complement of B. In this case, Xt can explain the cross-section of returns because it is cross-sectionally correlated
with α. More generally, Xt can be decomposed into two parts, one in the space spanned by B and the other in its orthogonal
complement, correlated with α. To argue that size and the book-to-market ratio are proxies for betas of systematic factors, Fama
and French (1993) construct the two factor-mimicking portfolios, SMB and HML, and show that the hypothesis that the α with
respect to MKT, SMB and HML equals zero cannot be strongly rejected.
In this paper, an extension to the multivariate test will be used that suits the purpose of examining the consistency between the
explanatory power of firm-specific variables and the beta pricing model. The model takes the form
rt = Xt θ + Bft + et ;
E½et jXt ; ft = 0N ; Var½et jXt ; ft = ∑;
ð4Þ
with the null hypothesis being θ = 0M and the alternative hypothesis being θ ≠ 0M. If the first column of Xt is the vector of ones, and
θ = (θ1,θ′2)′ where θ2 is the vector of slope coefficients, then the null hypothesis and the alternative hypothesis can be revised to
θ2 = 0M − 1 and θ ≠ 0M − 1, respectively. The difference between the extended multivariate test and the original one is that, in the
extended test, the alternative hypothesis is specifically tied to the given firm-specific variables, while in the original test, the
alternative is general and unspecified. The extended test is more powerful than the original one if the mispricing, α, is indeed
related to the given firm-specific variables, but it is less powerful if the mispricing is unrelated to the given firm-specific variables. If
the question is about a given set of firm-specific variables, then the extended multivariate test is more appropriate. Despite the fact
that both time-series parameters, B, and cross-sectional parameters, θ, are involved, the model is linear in B and θ, so that they can
be easily estimated given Σ, while Σ can be estimated in iterations as usual. A hypothesis regarding θ can be tested with a test
statistic that is asymptotically χ2 distributed under the assumptions that the error terms, εt, are conditionally heteroskedastic and
autocorrelated.
2. Principal component factors extracted from individual stock returns
2.1. Features of the principal component factors
The original APT model in Ross (1976) and Huberman (1982) assumes a strict factor structure for the return-generating process
in which firm-specific error terms are uncorrelated with each other. Factor analysis is required to extract factors in such models.
Realizing that it is unnecessary to make such a strict assumption, Chamberlain and Rothschild (1983) develop the APT implication
under an approximate factor model for the return-generating process in which firm-specific error terms may be weakly correlated
with each other, paving the way for principal component analysis to be used to extract factors. Based on the asymptotic APT of
Chamberlain and Rothschild (1983), Connor and Korajczyk (CK 1986, 1988) design a scheme of extracting factors from individual
C. Zhang / Journal of Empirical Finance 16 (2009) 306–317
309
Table 1
Principal component factors of individual returns
A. Proportional variances (%)
i
1
1966–1970
33.2
1971–1975
35.8
1976–1980
26.8
1981–1985
17.8
1986–1990
17.1
1991–1995
16.4
1996–2000
16.5
2001–2005
20.2
i
15
1966–1970
1.5
1971–1975
1.3
1976–1980
1.5
1981–1985
1.8
1986–1990
1.8
1991–1995
1.7
1996–2000
1.8
2001–2005
1.8
B. Cross-sectional regression test of
k
1
1966–1970
0.298
1971–1975
0.721
1976–1980
0.002
1981–1985
0.702
1986–1990
0.581
1991–1995
0.246
1996–2000
0.272
2001–2005
0.393
2
4.0
4.9
3.4
4.2
3.5
6.7
6.2
5.0
20
1.3
1.1
1.4
1.6
1.6
1.5
1.4
1.5
3
3.1
3.6
3.0
3.1
2.8
3.0
4.4
3.8
25
1.1
1.0
1.2
1.4
1.4
1.4
1.3
1.3
factor premiums
2
3
0.581
0.780
0.475
0.666
0.001
0.002
0.001
0.002
0.382
0.377
0.017
0.041
0.331
0.414
0.137
0.036
4
2.3
2.7
2.6
2.9
2.6
2.9
4.0
3.1
30
1.0
0.9
1.1
1.2
1.3
1.2
1.1
1.1
5
2.3
2.4
2.2
2.7
2.6
2.6
3.6
2.9
35
0.9
0.8
1.0
1.1
1.2
1.1
1.0
1.0
6
2.2
2.2
2.1
2.6
2.5
2.5
3.3
2.6
40
0.8
0.7
0.9
1.0
1.1
1.0
0.9
0.9
7
2.0
2.0
2.1
2.4
2.3
2.3
2.7
2.5
45
0.7
0.6
0.8
0.9
1.0
0.9
0.8
0.8
8
1.9
2.0
2.1
2.3
2.2
2.3
2.4
2.4
50
0.5
0.5
0.7
0.8
0.9
0.8
0.7
0.7
9
1.8
1.8
2.0
2.2
2.1
2.1
2.2
2.2
55
0.4
0.4
0.6
0.7
0.8
0.7
0.6
0.6
10
1.7
1.7
1.9
2.1
2.0
2.1
2.1
2.2
60
0.3
0.3
0.5
0.6
0.6
0.5
0.4
0.4
4
0.873
0.801
0.003
0.002
0.542
0.059
0.484
0.053
5
0.941
0.823
0.004
0.003
0.437
0.096
0.466
0.083
6
0.467
0.902
0.008
0.007
0.436
0.117
0.584
0.110
7
0.214
0.943
0.009
0.001
0.553
0.178
0.665
0.168
8
0.296
0.957
0.009
0.002
0.516
0.117
0.708
0.228
9
0.386
0.975
0.016
0.002
0.385
0.170
0.515
0.294
10
0.423
0.986
0.025
0.001
0.368
0.232
0.593
0.313
Panel A of this table reports proportional eigenvalues δi / ∑60
j = 1 δj of the second-moment matrix of excess returns. Panel B reports the p-value of the cross-sectional
regression test that the factor premiums of the first k principal component factors are all zero for k = 1,…,10. The calculations are done for each of eight sixty-month
subperiods during 1966–2005.
stock returns where the number of assets is much greater than the number of time-series observations. In this section, we follow
the CK approach to construct the principal component factors from individual stock returns.
Since the CK approach requires a constant number of stocks, it is usually applied to, say, five-year subperiods in the literature.
From the monthly returns of individual NYSE/AMEX/NASDAQ stocks with non-missing data in eight five-year subperiods during
1966–2005, we construct the principal component factors of the returns using the CK methodology. The number of stocks ranges
from 579 in 1966–1970 to 3130 in 1996–2005. Panel A of Table 1 reports the relative eigenvalues of the monthly return variance
matrix, δi / ∑60
j = 1 δj for i = 1,…,10, and, to save space, for i from 15 to 60 in steps of 5. Each number represents the proportion of the
total variation a factor explains in all the stocks that have non-missing returns during the subperiod. Panel B reports the p-values of
the two-pass cross-sectional regression test that the factor premiums associated with the first k principal component factors are
jointly zero, for k = 1,…,10.
The results in Panel A of Table 1 indicate that the first principal component explains 16.4% to 35.8% of the total variation in the
individual stock returns. Each of the remaining factors explains much less, but the proportion of explained variation declines
slowly. The p-values in Panel B show that, except for the 1976–1980 and 1981–1985 subperiods, the factor premiums are not
significantly different from zero. The results in Panel B are exploratory only. It is well known that the two-pass cross-sectional
regression test is not n-consistent and it does not work well when the number of stocks is much greater than the number of timeseries observations.2
We then examine the relations between extracted principal component factors and the firm-specific variables at the level of
individual firms. If size and the book-to-market ratio are proxies for the betas with respect to the major factors, then the crosssectional correlations of size and the book-to-market ratio with the betas of the first few extracted factors should be high. Panel A (B)
of Table 2 reports pairwise correlations between the size (book-to-market ratio) and one factor. The cross-sectional correlations are
calculated for each month and are averaged across months within each of the five-year periods.3 The correlations are low except for
the first factor or for certain periods. The numbers reported in Table 2 are pairwise correlations between the size (or book-to-market
ratio) and one factor. We also calculate multiple correlations between the size (or book-to-market ratio) and a set of factors (not
reported here). These multiple correlations are also low. The low correlations cast doubt on the notion that size and the book-to-
2
The factors extracted using the CK approach are unique only up to a nonsingular transformation, as are the factor betas and factor premiums. Testing the
significance of the premium for each individual factor is meaningless.
3
Alternatively, size and the book-to-market can be averaged across months within each of the five-year periods. One cross-sectional correlation can then be
calculated between the average size (or the book-to-market ratio) and the factor beta of the period. Theoretically, the two ways of calculation are different, but
the actual results turn out to be similar, so the results of the alternative calculation are not reported.
310
C. Zhang / Journal of Empirical Finance 16 (2009) 306–317
Table 2
Features of the loadings of principal component factors from individual returns
Factors
3
4
5
6
7
8
9
A. Average correlations
1966–1970
1971–1975
1976–1980
1981–1985
1986–1990
1991–1995
1996–2000
2001–2005
between the betas and SZ
0.51
0.56
0.47
0.45
0.33
0.18
0.13
0.16
0.05
0.33
0.18
0.02
0.24
0.28
0.18
0.29
1
0.25
0.36
0.29
0.43
0.05
0.01
0.22
0.02
0.09
0.04
0.22
0.19
0.19
0.21
0.01
0.03
0.04
0.24
0.02
0.11
0.03
0.00
0.19
0.17
0.11
0.04
0.17
0.04
0.22
0.01
0.17
0.01
0.06
0.08
0.26
0.09
0.07
0.02
0.05
0.05
0.02
0.11
0.12
0.17
0.07
0.12
0.04
0.09
0.00
0.05
0.12
0.00
0.15
0.28
0.09
0.03
0.05
0.06
0.01
0.06
0.06
0.05
0.07
0.12
B. Average correlations
1966–1970
1971–1975
1976–1980
1981–1985
1986–1990
1991–1995
1996–2000
2001–2005
between the betas and BM
0.10
0.14
0.15
0.35
0.10
0.15
0.28
0.01
0.21
0.14
0.03
0.23
0.24
0.02
0.09
0.03
0.28
0.14
0.20
0.05
0.07
0.08
0.06
0.15
0.09
0.08
0.14
0.10
0.03
0.07
0.09
0.04
0.10
0.10
0.03
0.02
0.04
0.09
0.04
0.08
0.12
0.06
0.04
0.16
0.10
0.05
0.03
0.04
0.04
0.04
0.03
0.08
0.02
0.01
0.00
0.04
0.10
0.08
0.13
0.07
0.00
0.05
0.01
0.01
0.09
0.20
0.07
0.07
0.00
0.00
0.10
0.00
0.04
0.01
0.06
0.05
0.06
0.12
0.06
0.06
1.7
0.3
0.7
1.3
0.9
1.4
3.4
8.9
0.3
0.4
0.9
0.2
0.9
0.3
7.1
1.2
0.8
0.2
0.1
1.0
0.5
1.5
0.6
1.5
0.1
0.6
0.7
1.2
0.0
2.2
0.2
0.1
0.0
0.3
0.3
1.6
0.2
0.3
0.4
0.7
0.1
0.9
0.8
0.6
0.1
1.0
1.2
0.2
0.1
1.0
1.5
0.3
0.1
0.9
2.2
0.0
1.0
1.3
0.3
0.3
0.2
0.7
0.7
0.1
0.02
0.03
0.05
0.01
0.05
0.01
0.26
0.06
0.06
0.01
0.01
0.07
0.03
0.08
0.02
0.07
0.01
0.04
0.05
0.08
0.00
0.11
0.01
0.00
0.00
0.02
0.02
0.11
0.01
0.02
0.02
0.04
0.01
0.06
0.05
0.05
0.01
0.05
0.05
0.01
0.01
0.07
0.10
0.02
0.00
0.05
0.10
0.00
0.09
0.10
0.02
0.02
0.02
0.04
0.03
0.01
C. Average beta
1966–1970
1971–1975
1976–1980
1981–1985
1986–1990
1991–1995
1996–2000
2001–2005
43.3
56.2
47.5
37.0
43.4
19.1
33.3
42.8
2
5.4
4.6
1.0
0.2
1.5
23.9
19.0
1.3
D. Average beta divided by standard deviation of beta
1966–1970
1.90
0.30
0.11
1971–1975
2.26
0.21
0.02
1976–1980
2.12
0.06
0.04
1981–1985
1.91
0.01
0.08
1986–1990
2.38
0.07
0.05
1991–1995
0.38
1.01
0.06
1996–2000
0.74
0.66
0.11
2001–2005
1.20
0.05
0.39
10
Panel A of this table presents the average cross-sectional correlations between the betas with respect to the first ten principal component factors extracted from
individual stock returns and the size of the individual firms. Panel B of the table presents the average cross-sectional correlations between the betas with respect to
the first ten principal component factors extracted from individual stock returns and the book-to-market ratio of the individual firms. Panels C and D present the
average beta and the average beta divided by the standard deviation of the beta, respectively. The calculation is done on eight 60-month periods during 1966–2005.
market ratio are proxies for the betas with respect to the factors extracted from individual stock returns. Panels C and D of Table 2
report the average beta and the average beta divided by the standard deviation of the beta, respectively, for each principal
component factor from 1 to 10. With few exceptions, the mean beta of the first factor is different from zero, while other factors have
their mean betas near zero. This pattern and the results in Panels A and B have implications that are discussed later.
2.2. Extended multivariate test
We now conduct a formal test of the consistency between the size and book-to-market effects and the beta pricing model using
the first ten principal component factors extracted from individual stock returns. The testing assets to be used here are the twentyfive portfolios sorted by size and the book-to-market ratio which are used by Fama and French (1993, 1996) and many other
empirical studies. The portfolios' returns and corresponding sizes and book-to-market ratios are obtained from French's website.
The time-series averages of the returns and the time-series standard deviations of the returns are reported in Table 3. For each
month, the cross-sectional average of the portfolios' sizes and book-to-market ratios are subtracted from the corresponding
variables, so that both size and the book-to-market ratio to be used in the extended multivariate test are in excess of their crosssectional means. The demeaned sizes and book-to-market ratios have more stable time-series properties, needed for econometric
evaluation, than do the original sizes and book-to-market ratios.
The twenty-five portfolios sorted by size and the book-to-market ratio are widely used in the literature. Their properties are well
understood. In Panel A of Table 3, we see that except for the five lowest book-to-market portfolios, there is a decreasing pattern
between the average return and size. Holding size constant, there is an increasing pattern between the average return and the book-
C. Zhang / Journal of Empirical Finance 16 (2009) 306–317
311
Table 3
Descriptive statistics of twenty-five portfolios sorted by size and book-to-market
A. Time-series means of portfolio returns
Low
Small
0.385
2
0.313
3
0.392
4
0.503
Big
0.392
2
0.893
0.657
0.703
0.525
0.554
3
1.047
0.930
0.711
0.742
0.563
4
1.196
0.943
0.866
0.844
0.615
High
1.451
0.981
1.035
0.839
0.650
B. Time-series standard deviation of portfolio returns
Low
Small
8.778
2
7.930
3
7.282
4
6.343
Big
5.380
2
7.281
6.331
5.698
5.401
4.844
3
6.432
5.577
5.124
5.019
4.662
4
5.970
5.297
4.924
4.858
4.405
High
6.346
5.960
5.583
5.549
4.819
C. Time-series mean of the relative size of portfolios
Low
Small
−2.402
2
−0.898
3
−0.053
4
0.867
Big
2.884
2
−2.365
−0.893
−0.038
0.859
2.591
3
−2.394
−0.880
−0.038
0.870
2.554
4
−2.469
−0.884
−0.038
0.878
2.408
High
−2.737
−0.908
−0.041
0.881
2.247
D. Time-series mean of the relative book-to-market ratio of portfolios
Low
2
Small
−0.567
−0.265
2
0.567
−0.293
3
−0.569
−0.300
4
−0.566
−0.290
Big
−0.578
−0.305
3
−0.062
−0.080
−0.084
−0.081
−0.082
4
0.183
0.158
0.158
0.179
0.159
High
0.906
0.802
0.753
0.745
0.649
This table presents the time-series mean and standard deviation of returns on twenty-five portfolios sorted by size and the book-to-market ratio. The returns are
monthly, measured in percentages. The sizes and book-to-market ratios of these portfolios are relative to their cross-sectional mean. The sample period is 1966.01–
2005.12.
to-market ratio. From Panel B, we see that the standard deviation of the portfolio return is decreasing in size. There is no clear
pattern for the standard deviation with respect to the book-to-market ratio. This fact is used by Lakonishok et al. (1994) to argue that
the book-to-market is not a valid measure of risk. Panels C and D report the demeaned sizes and book-to-market ratios for the
twenty-five portfolios. These two firm-specific variables are used in the extended multivariate test below. Since portfolios are sorted
by them, there is a clear increasing pattern in the numbers in Panel C across size and a clear increasing pattern in the numbers in
Panel D across the book-to-market ratio, but, overall, there are no clear patterns between size and the book-to-market ratio.
To implement the extended multivariate test, we need to determine the set of systematic factors. As we can see, the first factor
accounts for 16.4%–35.8% of the total variation in various periods. It is well known in the APT literature that the first principal
component factor has a strong correlation with the equally weighted market returns. The contribution of the remaining factors is
much smaller. A rule of thumb in empirical studies to determine the number of factors is to choose a K such that the first K eigenvalues account for a large proportion of the total variance while additional eigenvalues do not add much. Practically, however, it is
difficult to determine the cut-off point. There is no consensus in the literature on what should be the number of factors for the US
stock returns. Connor and Korajczyk suggest any number from one to six. Brennan et al. (1998), for example, choose K = 5. Others
use various numbers and check the robustness. In the following study, we consider three different choices: K = 3,5 and 10. We then
conduct an extended multivariate test on the twenty-five size/book-to-market ratio portfolios using no factors and using three sets
of principal component factors. The test pits firm-specific variables against factor betas in explaining average returns in cross
sections. Table 4 reports the results.
The model without factors is just the cross-sectional regression (3) at the portfolio level and the result shows that the returns
are negatively related to the size and positively related to the book-to-market ratio. The slope coefficients are significantly different
from zero. In the next three models with three, five, and ten principal component factors, the size and book-to-market effects
remain. The slope coefficients actually become slightly more significant because, as the number of factors increases, the variance of
the error term in the regression model decreases, and the standard error of the estimator becomes smaller. The results of the
extended multivariate tests indicate that the size and book-to-market effects cannot be explained by the betas of the first ten
principal component factors.
Brennan et al. (1998) examine similar issues as those in this paper and their results should be discussed here. Using a variant of
the two-pass test, they contrast the firm-specific variables with two sets of factors, one the first five principal component factors
extracted from individual stock returns and the other the three Fama-French factors. They note that the principal component
factors and the Fama-French factors are quite different. In the model with principal component factors, they obtain basically the
same result as those obtained in this paper: the size and book-to-market effects are strong in the presence of the principal
312
C. Zhang / Journal of Empirical Finance 16 (2009) 306–317
Table 4
Extended multivariate tests of the size and book-to-market effects with factors extracted from individual returns
Model
θ1N
θSZ
θBM
No factor
1.3057
(7.7925)
1.3704
(8.3467)
1.3783
(8.5447)
1.3734
(8.5260)
− 0.1254
(− 4.3040)
− 0.1302
(− 4.5930)
− 0.1328
(− 4.6957)
− 0.1349
(− 4.7803)
0.1516
(2.6964)
0.1590
(2.9610)
0.1549
(2.9016)
0.1655
(3.1325)
Three factors
Five factors
Ten factors
This table reports the result of the extended multivariate test of the size and book-to-market effects in the model without factors
rt = θ1N 1N + θSZ SZt + θBM BMt + et
and in the model with principal component factors
K
rt = θ1N 1N + θSZ SZt + θBM BMt + ∑ bk hkt + et
k=1
where rt is the return on the twenty-five size- and book-to-market-sorted portfolios, SZt and BMt are their sizes and book-to-market ratios, hkts are the principal
component factors extracted from individual stock returns. The number of factors, K, is three, five, and ten. The numbers in parentheses are t-ratios. The sample
period is 1966.01–2005.12.
component factors. In the model with Fama-French factors, the size and book-to-market effects are much attenuated. They merely
report the difference without concluding whether the explanatory power of size and the book-to-market ratio is or is not
consistent with the beta pricing theory. The main conclusion of their paper is that there are many other firm-specific variables,
such as trading volume, dividend yield, stock price level, and past returns, not considered by Fama and French (1992) that can
explain returns and the Fama-French factors do not explain the explanatory power of these additional firm-specific variables.
In this paper, the apparent inconsistency between the size and book-to-market effects and the beta pricing model with the
principal component factors extracted from individual stock returns is the puzzle to be solved. There are two scenarios with which
the results in Table 4 are consistent. The first scenario is that the size and book-to-market effects are indeed inconsistent with the
beta pricing theory in general. They are not proxies for the betas of unidentified factors at all. The second scenario is that the size
and book-to-market effects are consistent with the beta pricing theory, but the corresponding factors (or their linear combinations)
are too small to be included in the first ten Connor–Korajczyk factors extracted from the individual stock returns. To find out which
scenario is more likely the case, we proceed to analyze principal component factors extracted from portfolios that are sorted by size
and the book-to-market ratio.
3. Principal component factors extracted from size and book-to-market sorted portfolios
3.1. Features of the principal component factors
Since the exact number of factors in the individual stock returns is difficult to determine, it is possible that the actual number of
factors is greater than ten and the size and book-to-market effects can be explained by the remaining factors. Adding more factors
to the extended multivariate test turns out to be an ineffective way of determining this. Instead, we choose to extract principal
component factors from size- and book-to-market-sorted portfolios. When portfolios are sorted by certain firm-specific variables,
patterns in the stock returns related to the given firm-specific variables are strengthened, while patterns unrelated to the given
firm-specific variables are canceled out. By working on portfolios sorted by size and the book-to-market ratio and on principal
component factors extracted from the portfolios, we can identify the systematic factors associated with size and the book-tomarket ratio more easily. We can then examine the consistency between the size and book-to-market effects and beta pricing
theory more rigorously in the extended multivariate tests.
We use the one-hundred portfolios sorted by size and the book-to-market ratio, also obtained from French's website. Monthly
time series from January 1966 to December 2005 are used for the analysis. Out of the one-hundred portfolios, four of them contain
no stocks in certain years. These four portfolios are excluded from the analysis. But, for convenience, we continue to call the
remaining portfolios the one-hundred portfolios. From the variance matrix of the one-hundred portfolios, we can construct the
principal components of the returns. Their features are reported in Table 5.
Panel A of Table 5 lists the proportional variances and cumulative proportional variances of the first ten principal components.
The first principal component factor accounts for 75.70% of the total variation in the one hundred portfolios and the first three
principal component factors account for 85.72% of the total variation. The remaining factors are not very important in the sense
that they explain a very small amount of return variation. The factor structure is more obvious for portfolios than for individual
stocks. The pattern of the variances of the principal components indicate that there are common systematic factors in the onehundred portfolios sorted by size and the book-to-market ratio.
Panels B, C and D list the portfolio weights for the first, second and third principal component factors. These weights are of great
interest. The first principal component factor has portfolio weights relatively evenly across all one-hundred portfolios with small
stocks and low book-to-market stocks receiving relatively higher weights. The first factor, therefore, resembles a weighted market
C. Zhang / Journal of Empirical Finance 16 (2009) 306–317
313
Table 5
Principal component analysis of 100 size/book-to-market-sorted portfolios
A. Variances of the first ten principal components (%)
i
1
2
Prop. var.
75.70
6.46
Cumu. prop. var.
75.70
82.15
3
4
3.57
85.72
1.07
86.79
5
0.74
87.54
6
0.51
88.05
3
1.34
1.34
1.27
1.22
1.13
1.06
0.99
0.97
0.85
0.81
4
1.28
1.22
1.20
1.11
1.07
1.00
0.97
0.97
0.86
0.76
5
1.19
1.15
1.09
1.08
1.01
0.95
0.95
0.84
0.80
0.75
C. Weights of the second principal component portfolio (%)
Low
2
3
Small
−17.96
− 14.17
−12.48
2
−13.03
−9.39
− 6.00
3
−9.08
−5.82
− 3.73
4
−8.20
−1.58
− 0.80
5
−7.03
−0.59
1.00
6
−5.22
0.27
2.42
7
−3.22
2.38
5.68
8
−1.97
3.67
4.56
9
0.05
4.14
6.29
Big
3.30
5.30
6.33
4
−10.92
−4.65
−0.39
−0.98
2.13
3.94
5.63
6.30
7.74
7.72
D. Weights of the third principal component portfolio (%)
Low
2
3
Small
12.09
9.94
23.56
2
− 35.89
−18.05
− 0.11
3
−47.48
−24.00
−15.19
4
−60.77
−31.31
− 3.24
5
−68.51
− 25.87
−12.11
6
−72.52
− 34.07
−16.77
7
− 74.99
−33.77
−10.75
8
−82.92
−33.15
−25.06
9
−75.37
−41.23
−20.41
Big
−68.29
− 34.52
−30.45
4
32.15
6.57
8.91
−0.99
−4.92
− 11.22
−10.56
−9.54
−15.67
−20.66
B. Weights of the first principal component portfolio (%)
Low
2
Small
1.52
1.38
2
1.64
1.46
3
1.58
1.39
4
1.48
1.26
5
1.46
1.20
6
1.36
1.18
7
1.18
1.10
8
1.19
1.00
9
1.08
0.90
Big
0.81
0.80
7
8
9
0.48
88.53
0.45
88.98
0.39
89.37
10
0.34
89.71
6
1.14
1.08
1.08
0.98
0.96
0.86
0.89
0.92
0.77
0.70
7
1.07
1.08
0.99
0.99
0.91
0.86
0.90
0.83
0.80
0.64
8
1.06
1.05
0.99
0.98
0.92
0.85
0.86
0.80
0.69
–
9
1.06
1.09
1.03
1.04
0.91
0.94
0.93
0.84
0.72
–
High
1.10
1.17
1.14
1.21
1.16
1.05
–
1.02
0.75
–
5
− 9.13
− 2.25
− 0.02
1.86
3.73
4.07
5.09
6.66
8.41
7.81
6
−8.01
−1.76
0.56
1.44
3.55
5.54
6.52
7.62
7.63
7.42
7
− 7.06
− 1.95
1.28
3.39
4.51
5.99
5.03
7.77
7.80
8.67
8
− 6.98
− 0.89
1.79
1.79
4.65
5.39
5.36
7.08
7.91
–
9
−6.81
−1.17
0.90
2.89
5.03
4.72
6.12
6.92
8.09
–
High
−8.35
−1.74
2.27
1.10
2.02
3.60
5
38.26
11.31
9.65
9.26
2.93
4.41
− 1.97
− 3.48
− 7.58
−16.38
6
38.58
26.02
21.58
13.45
10.82
7.64
1.37
−1.06
−3.06
−16.31
7
42.51
25.36
26.30
16.41
17.37
13.26
8.07
5.44
1.47
2.10
8
46.42
26.01
29.86
19.12
23.76
19.90
13.81
15.81
9.42
–
9
45.94
36.65
34.30
27.36
29.46
27.04
26.66
13.55
11.19
–
High
61.69
52.79
47.33
40.36
27.12
43.25
–
30.89
13.71
–
6.85
7.67
–
Panel A reports the proportional variances and cumulative proportional variances of the first 10 principal components extracted from the one-hundred size- and
book-to-market-sorted portfolio returns. Panels B, C and D report the weights of the first three principal component factors.
portfolio where the weights are proportional to the product of the numbers in Panel B and the number of stocks in each of the onehundred portfolios. The second principal component factor has long positions in the big stocks and short positions in the small
stocks. Thus, the second principal component factor resembles the negative of the second Fama-French factor, SMB. It is not exactly
the same as it is tilted toward having more short positions on low book-to-market stocks. The third principal component factor
tends to have long positions in high book-to-market firms and short positions in low book-to-market firms, so it resembles the
third Fama-French factor, HML. The exception is for the smallest size decile portfolios that all receive positive weights. Roughly
speaking, the three principal component factors resemble the Fama-French factors, (MKT, -SMB, HML). There are several
differences between the two. The first is that the principal components are constructed to explain maximum variance among all
uncorrelated unit length linear combinations, while the Fama-French factors are designed to capture expected returns without
taking the variances into consideration. The second is that MKT is value-weighted, while the first principal component here is not.
The third is that MKT, SMB and HML are zero-cost portfolios while the principal component factors are not. The fourth difference is
that, while the Fama-French factors are correlated, the principal components are uncorrelated with each other by design. The tilt
seen from their weights reflects the necessary rotation to achieve zero correlation.
The phenomenon that exactly three systematic factors show up in the one-hundred portfolios and that the second and third
factors are related to size and the book-to-market ratio have something to do with the fact that the portfolios themselves are sorted
by size and the book-to-market ratio and other patterns in the return are smoothed out. For the results in Table 5 to hold, three
conditions are crucial. The first condition is that much systematic risk at the level of individual stocks, except for the systematic risk
similar to the equally weighted average return, has betas uncorrelated with size and the book-to-market ratio at the firm level. The
second condition is that these betas average to zero. These two conditions guarantee that that systematic factors whose betas are
unrelated to the market beta, size and the book-to-market ratio drop out in the size- and book-to-market-sorted portfolios. The
314
C. Zhang / Journal of Empirical Finance 16 (2009) 306–317
Table 6
Goodness-of-fit for various multivariate regressions
A. rt = a + BftK + εt
1966–1970
1971–1975
1976–1980
1981–1985
1986–1990
1991–1995
1996–2000
2001–2005
B. rt = a + Bxt + εt
f t3
f t5
f10
t
xt = g t
xt = ht
0.404
0.445
0.316
0.248
0.236
0.250
0.270
0.284
0.450
0.496
0.366
0.305
0.288
0.307
0.347
0.344
0.546
0.594
0.471
0.422
0.399
0.423
0.476
0.464
0.391
0.420
0.298
0.222
0.215
0.171
0.220
0.258
0.126
0.113
0.098
0.070
0.088
0.097
0.143
0.097
C. gt = a + Bf tK + εt
1966–1970
1971–1975
1976–1980
1981–1985
1986–1990
1991–1995
1996–2000
2001–2005
D. ht = a + Bf tK + εt
f t3
f t5
f 10
t
f t3
f t5
f10
t
0.625
0.697
0.465
0.284
0.555
0.342
0.382
0.487
0.719
0.803
0.641
0.293
0.661
0.509
0.455
0.658
0.764
0.882
0.796
0.418
0.727
0.633
0.590
0.703
0.614
0.689
0.451
0.267
0.534
0.336
0.375
0.477
0.710
0.798
0.632
0.277
0.646
0.505
0.448
0.652
0.757
0.879
0.791
0.404
0.715
0.629
0.585
0.697
f t3
f t5
f 10
t
f t3
f t5
f 10
t
0.777
0.671
0.583
0.595
0.514
0.413
0.526
0.679
0.499
0.447
0.396
0.363
0.339
0.282
0.386
0.467
0.259
0.258
0.238
0.202
0.213
0.191
0.220
0.243
0.467
0.344
0.323
0.300
0.273
0.157
0.322
0.376
0.308
0.249
0.234
0.190
0.188
0.128
0.233
0.285
0.162
0.158
0.156
0.113
0.127
0.108
0.138
0.152
E. f tK= a + Bgt + εt
1966–1970
1971–1975
1976–1980
1981–1985
1986–1990
1991–1995
1996–2000
2001–2005
F. f Kt= a + Bht + εt
This table reports the goodness-of-fit for the multivariate regressions of the form yt = a + Bxt + εt, defined as 1 − vε / vy where vy is the sum of the variances of the
components in yt and vε is the sum of the variances of the components in εt. The variables are defined as follows.
rt: individual stock returns.
ftK: first K principal component factors extracted from individual returns.
gt: the first three principal component factors extracted from the one-hundred size- and book-to-market-sorted portfolios.
ht: the second and third principal component factors extracted from the one-hundred size- and book-to-market-sorted portfolios.
results in Table 2 indicate that this is indeed the case. The third condition, of course, is that there are systematic factors, not
necessarily corresponding to small numbers of the principal component factors extracted from the individual stock returns, whose
betas are correlated with size and the book-to-market ratio.4
Before we move to examine the consistency between the explanatory power of the firm-specific variables and the beta pricing
model with factors extracted from size- and book-to-market-sorted portfolios, we compare the two sets of principal component
factors. To reinforce the notion that the forces underlying the size and book-to-market effects are systematic, but they do not
account for the most important variations in individual returns, we calculate a goodness-of-fit measure for various multivariate
regressions of the form yt = a + Bxt + εt. The goodness-of-fit measure is defined as 1 − vε / vy where vy is the sum of the variances of the
components in yt and vε is the sum of the variances of the components in εt, consistent with principal component analysis.
The calculated goodness-of-fit measures are reported in Table 6.
Panel A reports the goodness-of-fit measures for individual stock returns regressed on the principal components factors
extracted from individual stock returns. These numbers are similar to those in Panel B of Table 1.5 Panel B reports the goodness-offit measures for individual stock returns regressed on the principal components factors extracted from the one-hundred size- and
book-to-market-sorted portfolio returns. The goodness-of-fit measures for the first three principal components factors extracted
from portfolios are smaller, but not too much smaller, than those for the first three principal components factors extracted from
individual returns. The main contribution, however, comes from the first principal component factor extracted from portfolios,
which also resemble equally weighted market portfolios. For the second and third principal component factors extracted from
4
It should be noted that the extracted factors from portfolios sorted by certain firm-specific variables may not represent the most important factors in terms of
their contribution to explaining the total variation in the individual returns. They may not be relevant for patterns in the expected returns associated with other
firm-specific variables either.
5
The CK method is based on second moment matrices while the goodness-of-fit measures here are based on variance matrices, causing some slight
differences.
C. Zhang / Journal of Empirical Finance 16 (2009) 306–317
315
Table 7
Daniel–Titman test on 100 portfolios sorted by SZ and BM
Average number of portfolios
β
Low
A. All portfolios
Low
BM
High
Average excess returns on portfolios
High
Low
β
High
26.3
5.6
0.1
5.0
19.2
7.8
1.0
7.2
23.8
0.50
0.50
0.06
0.63
0.75
0.72
1.10
0.97
1.03
8.8
2.2
0.0
2.4
4.9
2.7
0.0
2.9
8.1
0.65
0.70
–
1.13
0.77
0.98
–
1.18
0.95
C. Medium stock portfolios
Low
10.1
BM
0.9
High
0.0
1.1
8.0
0.9
0.0
1.0
10.0
0.86
0.56
–
0.61
0.82
0.54
–
0.70
0.92
D. Large stock portfolios
Low
BM
High
1.2
8.2
0.6
0.0
0.8
10.3
0.87
0.66
–
0.92
0.83
0.44
–
0.41
0.91
B. Small stock portfolios
Low
BM
High
9.9
1.1
0.0
This table reports the test of Daniel and Titman (1997) using the returns on portfolios independently sorted by book-to-market (BM) and the beta (β) with respect
to the third factor extracted from the returns on the one-hundred portfolios sorted by size and the book-to-market ratio. The analysis is conducted for all portfolios
(Panel A) and each of three size groups (Panel B for small stock portfolios, Panel C for medium stock portfolios, and Panel D for large stock portfolios).
portfolios, the explanatory power is substantially lower. This is the main evidence that the factors that track size and book-tomarket effects in returns are not major factors in individual stock returns. Panels C and D report the goodness-of-fit for the
regressions of the principal component factors extracted from the portfolio returns on the principal component factors extracted
from the individual stock returns. As we see, these goodness-of-fit numbers tend to be large. This means that part of the forces
underlying size and the book-to-market ratio is contained in the first ten principal component factors extracted from individual
stock returns, but they are scattered and are not strongly associated with any of the factors. Panels E and F report the goodness-offit for the regressions of the principal component factors extracted from individual stock returns on principal component factors
extracted from the portfolio returns. Two patterns show up. The explanatory power of the second and third principal component
factors extracted from the portfolio returns is much smaller than that of the first three principal component factors extracted from
the portfolio returns. As the number of factors extracted from individual stock returns on the left-hand side increases from three to
ten, the explanatory power of the factors extracted from the portfolios declines quickly. These patterns reflect the fact that much of
non-market factors in individual stock returns is canceled out in the size- and book-to-market-sorted portfolios and, as a result, the
factors extracted from portfolios have low explanatory power for the factors extracted from individual stock returns, except for the
common market factor.
3.2. Firm-specific variables vs. factor betas
Since the weights of the principal component factors have patterns related to the firm-specific variables, we can conduct some
tests comparing the role of firm-specific variables and factor betas. We pay more attention to the book-to-market ratio which is at
the center of the debates in the recent literature.
We first conduct an informal test, introduced by Daniel and Titman (1997), using portfolios independently sorted by the bookto-market ratio and the beta with respect to the third principal component from the one-hundred portfolios. The procedure is as
follows. For each year from 1966 to 2005, the one-hundred portfolios are sorted by their book-to-market ratio and are divided into
three equal groups. The one-hundred portfolios are also sorted by their beta with respect to the third principal component factor
and divided into three groups.6 These two sets of portfolios are sorted independently. For each of the nine intersections of the two
groups, monthly portfolio returns are averaged within the year and within the intersection. Also calculated are the number of
portfolios in each intersection for a given year. These annual numbers are then averaged over the forty years in the sample period.
The results are in Panel A of Table 7. We also conduct the same analysis for three equally divided size groups and the results are in
Panels B, C, and D.
Before we discuss the results, we note the differences between the analysis conducted here and that in Daniel and Titman
(1997). The first difference is that the beta they use is with respect to the Fama-French factor, HML, while the beta used here is with
respect to the third principal component factor extracted from the one-hundred portfolios. Their HML beta is estimated in rolling
regressions using return data in the previous three years, while the beta used here is the unconditional beta. The second difference
6
We divide them into three beta groups only, rather than five as Daniel and Titman do, because we do this for portfolios rather than for individual stocks.
Finely divided groups may end up with too few portfolios in many groups.
316
C. Zhang / Journal of Empirical Finance 16 (2009) 306–317
Table 8
Extended multivariate tests of the size and book-to-market effects with factors extracted from size- and book-to-market-sorted portfolio returns
Model
θ1N
θSZ
θBM
First factor
− 0.0026
(− 0.3090)
0.8123
(6.2751)
0.7482
(5.1590)
0.0023
(0.4712)
0.0023
(0.3567)
0.1791
(2.4731)
0.0003
(0.0555)
−0.0417
(−1.5445)
−0.1168
(−6.7447)
0.0124
(0.8248)
−0.0149
(−2.5109)
0.0048
(0.3308)
−0.0200
(−2.6527)
−0.0024
(−0.8701)
0.1459
(2.6207)
0.0643
(1.3340)
−0.0197
(−0.6292)
0.1180
(2.5040)
0.0280
(0.9575)
−0.0311
(−1.3363)
0.0164
(1.2164)
Second factor
Third factor
First & second factors
First & third factors
Second & third factors:
First three factors
This table reports the result of the extended multivariate test of the size and book-to-market effects in the model
rt = θ1N 1N + θSZ SZt + θBM BMt + ∑ bk hkt + et
k
where rt is the return on the twenty-five size- and book-to-market-sorted portfolios, SZt and BMt are their sizes and book-to-market ratios, respectively. hkts are
principal component factors extracted from the one-hundred size- and book-to-market-sorted portfolios. The numbers in parentheses are t-ratios. The sample
period is 1966.01–2005.12.
is that they sort individual stocks according to their beta with respect to HML, while we sort portfolios according to their beta with
respect to the third principal component factor. Even for the same factor, these two approaches may yield different groups.
The results reported in Table 7 are interesting and can be contrasted with those in Daniel and Titman (1997). First, from the left
panels, we see that the portfolios tend to fall in intersections on the main diagonal lines. There are few portfolios that have low
book-to-market ratios and high betas. Likewise, there are few portfolios that have high book-to-market ratios and low betas. The
sorting by the book-to-market ratio and the sorting by the beta with respect to the third principal component from the onehundred portfolios are in agreement. This feature is anticipated if the book-to-market ratio is a proxy for the beta with respect to a
systematic factor. In Daniel and Titman (1997) where the factor is HML and the sorting is done for individual stocks, this feature is
absent. The presence of this feature here bodes well for the results on average returns in the right panels. In Panel A on the right,
the average return no longer increases with the book-to-market ratio, but it increases with the beta. In Panels B, C and D, the
pattern is less clear, probably because there are fewer portfolios in each intersection of the groups. At least, the pattern that the
average return increases with the book-to-market ratio instead of the beta, observed by Daniel and Titman (1997) for HML, does
not emerge. The results in this table show that with a right factor and at a more aggregated level, the book-to-market effect is
consistent with the beta pricing theory.
We then conduct a formal test of the consistency between the size and book-to-market effects and the beta pricing model with
the principal component factors from the one-hundred portfolios. Table 8 contains the results of the extended multivariate test on
the twenty-five size- and book-to-market-sorted portfolios with various combinations of the principal component factors
extracted from the one-hundred size- and book-to-market-sorted portfolios.
From the table, we see that for models with first and/or second factors, the size and book-to-market effects remain strong, but
for the model with three factors, or the mode with just the third factor, the slope coefficients of the firm-specific variables become
much smaller and their t-ratios indicate that they are no longer significantly different from zero. The size and book-to-market
effects can indeed be explained by the betas of the three principal component factors extracted from the one-hundred size- and
book-to-market-sorted portfolios.
4. Conclusion
This paper deals with the difficulty that the Fama-French factors face in reconciling the size and book-to-market effects with the
beta pricing theory. We construct principal component factors from both individual stock returns and returns on portfolios sorted
by size and the book-to-market ratio. As has been shown in the previous literature, principal component factors extracted from
individual stock returns fail to explain the size and book-to-market effects. We also show that the individual stocks' betas with
respect to these factors are poorly correlated with the size and book-to-market ratio at the firm level. The principal component
factors extracted from the one-hundred size- and book-to-market-sorted portfolios tell a different story. The one-hundred sizeand book-to-market-sorted portfolios have a more obvious factor structure. The factor loadings are clearly related to size and the
book-to-market ratio, in a way resembling those of the Fama-French factors. The empirical performance of these factors is better
than that of the Fama-French factors. They pass the Daniel–Titman test using two-way independent sort by the book-to-market
and the beta of the third factor. They also pass the formal extended multivariate test.
C. Zhang / Journal of Empirical Finance 16 (2009) 306–317
317
The different results for the two sets of principal component factors rise from the following scenario. The forces underlying the
size and book-to-market effects are systematic, but they do not account for much of the return variation at the firm level and they
do not manifest themselves in the principal component factors extracted from individual stocks returns. In portfolios sorted by size
and the book-to-market ratio, most variation in returns unrelated to size and the book-to-market ratio is averaged out and
systematic factors underlying the size and book-to-market effects show up.
The main conclusion of this paper is that the size and book-to-market effects are consistent with the beta pricing theory,
although the systematic factors underlying these effects are not the most important ones in the sense of accounting for return
variation. Beside the main conclusion, the results from this paper have some implications for empirical asset pricing studies. The
twenty-five size- and book-to-market-sorted portfolios are often used in empirical studies as testing assets and the Fama-French
factors have been used as a yardstick in the empirical asset pricing literature against which the performance of other proposed
systematic factors is judged. In view of the results presented in this paper, these practices are unwarranted if the issues under study
are unrelated to the size and book-to-market effects.
References
Brennan, M., Chordia, T., Subrahmanyam, A., 1998. Alternative factor specifications, security characteristics, and the cross-section of expected stock return. Journal
of Financial Economics 49, 345–374.
Chamberlain, Gary, Rothschild, Michael, 1983. Arbitrage, factor structure and mean-variance analysis on large asset markets. Econometrica 51, 1281–1304.
Chen, N., Roll, R., Ross, S.A., 1986. Economic forces and the stock market. Journal of Business 59, 383–403.
Connor, Gregoey, Korajczyk, Robert, 1986. Performance measurement with the arbitrage pricing theory: a new framework for analysis. Journal of Financial
Economics 15, 373–394.
Connor, Gregoey, Korajczyk, Robert, 1988. Risk and return in an equilibrium APT: application of a new testing methodology. Journal of Financial Economics 21,
255–289.
Daniel, Kent, Titman, Sheridan, 1997. Evidence on the characteristics of cross-sectional variation in stock returns. Journal of Finance 52, 1–33.
Fama, Eugene, French, Ken, 1992. The cross-section of expected stock returns. Journal of Finance 47, 427–465.
Fama, Eugene, French, Ken, 1993. Common risk factors in the returns on stocks and bonds. Journal of Financial Economics 33, 3–56.
Fama, Eugene, French, Ken, 1996. Multifactor explanations of asset pricing anomalies. Journal of Finance 51, 55–84.
Gibbons, M., Ross, S., Shanken, J., 1989. A test of the efficiency of a given portfolio. Econometrica 57, 1121–1152.
Huberman, Gur, 1982. A simple approach to arbitrage pricing theory. Journal of Economic Theory 28, 183–191.
Lakonishok, J., Shleifer, A., Vishny, R., 1994. Contrarian investment, extrapolation, and risk. Journal of Finance 49, 1541–1578.
Merton, Robert, 1973. An intertemporal asset pricing model. Econometrica 41, 867–887.
Ross, Stephen, 1976. The arbitrage theory of capital asset pricing. Journal of Economic Theory 13, 341–360.