The influence of social forces: evidence from the behavior of football referees.
Dohmen, Thomas J.
I. INTRODUCTION
Do social forces affect decisions or actions of individuals? And if
so, under what circumstances do social influences determine
socioeconomic behaviors? These are important questions for economists to
answer. Sociologists and social psychologists widely acknowledge that
individuals' decisions are not only governed by their material
payoffs but also influenced by nonmaterial social payoffs that arise in
the decision makers' social environment, for example, in the form
of social approval of social sanctions, as argued, for example, by Asch
(1951) or Coleman (1990). This type of social pressure can cause
individuals to make decisions that accommodate the preferences of a
social group even if they are not in accordance with the decision
maker's own interest. I refer to this view as the "social
pressure hypothesis."
Economists have built on the idea that social forces might affect
individual behavior and have developed models of social interaction in
which the quest for social rewards or the avoidance of social sanctions
can explain adherence to social custom as, for example, in Akerlof
(1980) as well as the evolution and persistency of social norms as in
Bernheim (1994). (1) In these models, social payoffs, which comprise
social approval or sanctions, become an argument in the utility function
in addition to intrinsic consumption utility, which captures material
payoffs. Several other scholars (e.g., Becker and Murphy 2000) have used
such a framework that generalizes an individual's utility function
to internalize social payoffs to analyze how social influences affect a
variety of socioeconomic behaviors, consumption, status, as well as the
evolution of norms, fads, and fashions. (2)
Despite much theoretical progress, there is little empirical
evidence that convincingly assesses the role of social influences, not
least because data that meet the necessary requirements are scarce. (3)
A useful type of data would record how a decision maker behaves in
different social environments, where different social groups have
well-defined, and potentially conflicting, interests that are not
aligned with the interests of the individual decision maker. Garicano,
Palacios-Huerta, and Prendergast (2005) exploit the fact that soccer
matches offer such a setting and provide one of the few pieces of
empirical evidence. In a soccer match, it is in the referee's
private interest to be impartial while fans in each camp derive utility
from their team's success and therefore have an interest to work
toward their common goal by sanctioning referee decisions that do not
favor their preferred team and by approving favorable decisions.
Analyzing data from two seasons of the Spanish premier soccer league,
the Primera Division, Garicano, Palacios-Huerta, and Prendergast (2005)
detect that Spanish referees favor the home team by prolonging the match
by almost 2 min when the home team is one goal behind at the end of
regulation time compared to the situation in which the home team is
leading by one goal. They also investigate whether crowd size and the
ratio of attendance-to-capacity matters and find that a one standard
deviation increase in attendance increases the bias by about 20%, while
a higher attendance-to-capacity ratio reduces the bias. They conclude
that nonmonetary incentives, in particular social pressure from the
crowd, cause the preferential treatment. (4) Sutter and Kocher (2004)
report corroborative findings based on data from the 2000 to 2001 season
in the German premier soccer league (1. Bundesliga), although they do
not assess whether crowd attributes affect the magnitude and
significance of home-biased refereeing.
This paper provides complementary evidence of referee bias based on
data from 3,519 games of the 1. Bundesliga, which supports the view that
the social environment can affect individual's decisions. The
empirical analysis that professional referees, who are appointed and
paid by the German Football Association (DFB) and are expected to be
impartial, in fact systematically favor the home team. Favoritism is
manifested in stoppage time decisions and in decisions to award goals
and penalty kicks. The data also provide new evidence that crowd
characteristics such as crowd composition and distance to the soccer
field impair referees' decisions in a way that is consistent with
the social pressure hypothesis, that is, that social forces influence
individual behavior. I find that the size of the bias depends on the
composition of the crowd: the home bias tends to be smaller when more
supporters of the visiting side attend the match. This is consistent
with the idea that social approval and social sanctions have
countervailing effects on net social rewards. We expect supporters of
each side, who have the common interest that their preferred team
achieves success, to work toward this common goal by acclaiming
favorable decisions of the referee and by expressing dissatisfaction
with unfavorable referee decisions. Referees' decisions hence evoke
social approval from supporters of the favored team and social sanctions
from the opponent side. A referee who is not inherently biased, that is,
who does not derive intrinsic utility from a particular match outcome
and values social payoffs, is expected to weight the social costs and
benefits.
Strikingly, home team favoritism is found to be stronger when the
match takes place in a stadium without a running track, that is, when
the crowd is physically closer to the field and to the referee, in which
case the intensity of social pressure is arguably higher. (5) This
finding lends support for the conjecture that social forces influence
the referees' decision, be it because social pressure from the
crowd directly triggers-biased refereeing or because of a more oblique
channel, in which, for example, the crowd creates an atmosphere that
encourages the players on the field to exert pressure on the referee.
Since the nature of biased refereeing is affected by the crowd's
proximity to the field, we can dismiss an alternative mechanism that
leads to home-biased decisions, namely that the DFB condones the
preferential treatment of home teams of even instructs its referees to
favor home teams. Even a soccer association with strong preferences for
nondiscriminatory competition might rationally accept home-biased
refereeing as long as the referees' preferential treatment was
exactly the same for all home teams such that the resulting home
advantage would balance out over the season and therefore would not
affect the outcome of the championship. This could be an optimal policy
to maximize gate revenues if attendance was boosted when the home team
is more likely to win, as Garicano, Palacios-Huerta, and Prendergast
(2005) discuss. However, given the fact that teams who play in stadiums
with an athletics track are affected differently than teams who play in
a stadium without a track, this mechanism is implausible.
In fact, the DFB monitors the quality of refereeing and sacks a
referee if he is detected to be biased so that being partial lowers
reappointment probabilities. (6) Since remuneration amounted to more
than 3,000 Euros (~US$4,000) per match at the end of the observation
period, biased refereeing entails substantial expected pecuniary losses
for umpires. This suggests that referees are induced to favoritism by
social forces, although they have a strong intrinsic motivation for
impartiality. In summary, the findings indicate that social groups can
influence individuals to work toward achieving an outcome desired by
that group (and to adhere to the groups' social norm) even if
individuals do not derive utility from the outcome and adherence to the
norm is diametrically opposed to their own private interest.
The remainder of the paper is organized as follows. Section II
provides some institutional background and describes the data. Section
III presents the empirical results and Section IV concludes.
II. BACKGROUND AND DATA
The data were made available for scientific use by IMP--Innovative
Medientechnik und Planung A G, a company that maintains the official
soccer database of the DFL Deutsche Fussball Liga GmbH, the German
soccer league association, and cover all 3,519 matches that were played
in the German premier soccer league (1. Bundesliga) since the start of
the season 1992/1993 until the first half of the season 2003/2004). IMP AG sends several observers to each league match who record about 2,000
actions per match, including all goals, shots on goal, tackles, passes,
corner kicks, every single ball contact, yellow and red cards, as well
as the number of injury treatments on the field. Various match
statistics are provided separately for both periods of the match,
including the amount of stoppage time in each half. The data also record
the date, destination, and outcome of the match, the number of
spectators, and the referee's identity.
The I. Bundesliga consists of 18 teams that compete for the
national soccer championship. Teams play each other twice a season, once
during rounds 1-17 and once during rounds 18-34 when the status of home
and visiting team is reversed. (7) The outcome of a match (i.e., a win,
a draw, or a loss) determines the number of points that are allocated to
the teams: no points are allocated to the loser of a match, while each
team receives one point in case of a draw. The reward for winning a
match was raised from two to three points at the start of the season
1995/1996. The accumulated number of points during a season determines
the league ranking, and the team that finishes the season at the top of
the ranking (i.e., the team that has accumulated most points) wins the
German soccer championship. (8) The three lowest ranking clubs of the
championship table are relegated to the second division (2. Bundesliga)
and are replaced by the three highest ranking teams from the second
division. (9)
The referees for 1. Bundesliga matches are appointed by the DFB and
receive a piece rate for refereeing. This piece rate has risen over
time: from July 1992 (i.e., at the start of the observation period)
until July 1997, referees received 2,500 German Marks (DM) per match;
from August 1997 until July 2000, they were paid 4,000 DM per match; and
since August 2000, the reward is 6,000 DM or 3,067.75 Euros per match.
In addition, travel expenses (including hotel and transportation) have
been fully covered during the entire period since 1992. Bundesliga
referees are experienced and have been selected in sequential promotion
tournaments. After having passed a written and a physical test, referees
typically start in the lowest division. Once they have been promoted to
referee in the Landesliga, the sixth division, they can be promoted at
most one division each year if judged as qualified by official
observers. In total, 73 umpires have refereed at least 1 of the 3,519
games in our sample. The number of referees in the 1. Bundesliga has
been limited to 22 since 1995. Previously, the DFB had appointed up to
36 referees each season.
The performance of referees is monitored and judged by an official
observer of the DFB referee committee, who attends the match in the
stadium and fills a performance evaluation form afterward. The first
item in this evaluation addresses the referee's appearance during
the game, in particular whether the referee was decided, secure, and had
the courage to take unpopular decisions, or whether he was impressed by
complaining players. (10) The second and third points of the evaluation
form concern how well the referee interpreted and implemented the Laws
of the Game and additional instructions. This covers the adequacy of the
referee's allowance for time lost (see Linn 2003).
Information on the frequency of match interruptions for injury
treatments on the field makes it possible to control for the
"appropriate" amount of stoppage time, which is a key
determinant of stoppage time according to the official rules, the Laws
of the Game, which are authorized by the International Football
Association Board and established by the Federation Internationale de
Football Association. According to Law 7 of the Laws of the Game
(Federation Internationale de Football Association 2002, 19),
"[a]llowance is made in either period for all time lost through:
substitution(s), assessment of injury to players, removal of injured players from the field of play for treatment, wasting time, any other
cause. The allowance for time lost is at the discretion of the
referee." Data on the number of corner kicks, goal kicks, and
yellow or red cards allow to control for events that according to Linn
(2003) might result in "wasting time" by impeding a quick
restart of play.
Information on the correctness of decisions to award penalty kicks
is available for all 857 penalty kicks that were called since the start
of the season 1993/1994 (i.e., when IMP started to collect this
information), while there is information on decisions to award goals for
all but 3 of the 10,166 goals that had been awarded since the start of
the season 1992/ 1993. Data on decisions not to award a goal have only
been collected since 1993 and cover all but one of the 463 situations in
which a goal was not accepted. Information on the decision not to award
a penalty kick is only recorded since the start of the season 1998/1999.
Until January 2004, 892 critical situations, in which suspicious action
took place in one of the penalty areas, had been evaluated. Specialists
of IMP AG assess the correctness of these decisions after each match,
relying on video recordings and considering about 30 pieces of detailed
information in their judgment. A decision is classified as
"correct" if the evaluators consent that it was correct.
Likewise, a decision is labeled "wrong" if they agree that it
was wrong, and the decision is classified as "disputable" if
their verdict is not unanimous.
III. RESULTS
The empirical analysis starts with an investigation of stoppage
time decisions, which proceeds in three steps. Using data on 12 soccer
seasons and including additional control variables for match
interruptions like the number of fouls and the number of actual injury
treatments on the field, I first replicate the analysis by Garicano,
Palacios-Huerta, and Prendergast (2005) to assess whether there is
evidence for referee bias in the German data. If the social pressure
hypothesis is correct, we expect that referees, in the pursuit of social
rewards from the home crowd, award more stoppage time when the home team
is behind in score. We also expect this bias to be stronger when more is
at stake, for example, when the score margin is close or when second
half rather than first-half stoppage time is concerned. In these cases,
the expected marginal effect of additional time on the match outcome is
larger, and consequently, crowd pressure is likely to be stronger. A
model of social interaction, in which material consumption and social
payoffs are substitutes (e.g., Bernheim 1994), predicts that deviation
from intrinsic goals is less pronounced when social rewards become
relatively less important. In a second step, I analyze whether the
composition of the crowd matters, an issue that Garicano,
Palacios-Huerta, and Prendergast (2005) also addressed. Since supporters
of the home team and the visiting team have conflicting goals, we should
expect a larger share of visiting team supporters to attenuate the home
bias if the social pressure hypothesis is correct because one
group's social approval countervails the other group's social
sanctions.
In the third step, I study a new aspect of crowd attributes, namely
whether physical proximity affects the size of the bias. Given the
presumption that the intensity of crowd pressure is higher if the
distance between the crowd and the referee is smaller, we hypothesize that the home bias becomes stronger when the crowd is closer to the
field. Since stoppage time decisions account for only one way among many
others in which the referee can favor a particular team, I investigate
referees' impartiality concerning another crucial set of decisions
in the game, namely decisions to approve goals and to award penalty
kicks.
A. Stoppage Time Decisions
The first piece of evidence for biased stoppage time decisions is
presented graphically in Figure 1, which plots kernel density estimates
of second-half stoppage time distributions conditional on the score
difference (defined as the number of goals scored by the home team minus
the number of goals scored by the visiting side). (11) It is apparent
that the amount of stoppage time awarded at the end of 90 min regulation
time depends on the score difference and on the identity of the team.
Most stoppage time is awarded in close matches when the score margin is
1. Referees also seem to use their discretionary power to systematically
award more stoppage time to the home team in close matches as the
estimated stoppage time distribution conditional on the home team being
one goal behind stochastically dominates all other estimated conditional
stoppage time distributions. (12)
When the match is decided at the end of regulation time, that is,
when one team is ahead by two goals or more, the amount of stoppage time
does not depend on the identity of the leading team and less time is
added. Interestingly, supporters have much weaker incentives to
influence the referee in decided matches in which the ultimate match
outcome is unlikely to change during stoppage time. Hence, the raw data
are consistent with the hypothesis that referee's decisions are
affected by social forces. In the following analysis of stoppage time
decisions, I concentrate on close games to further investigate the
forces that lead referees to make biased decisions.
[FIGURE 1 OMITTED]
Estimates from ordinary least squares regressions reported in Panel
A of Table 1 confirm the descriptive results that more stoppage time is
granted when the home team is one goal behind than in matches in which
the home team is one goal ahead at the end of second-half regulation.
The home bias in awarding stoppage time, which is measured by the
coefficient on the "Home Ahead" dummy that takes the value of
1 if the home team is one goal ahead and 0 if the home team is one goal
behind at the end of the first half, amounts to about 22 sec of
additional stoppage time. (13) This difference in stoppage time is
statistically significant at the 1% level, even controlling for factors
that should affect the amount of stoppage time according to the Laws of
the Game, such as the number of injury treatments, substitutions, and
cards (Column 1). Actual injury treatments on the field have the
expected positive effect on the duration of stoppage time, but the
effect is small; referees add about 6 sec of extra time for each time a
player has been yellow carded. (14) The home bias remains statistically
significant and its size is virtually unchanged when allowing for
season-specific effects and controlling for other potentially
confounding factors such as the relative number of shots on goal, fouls,
corner kicks, crosses, and other proxies for relative strength (Table 1,
Column 2, and table notes for details) as well as team- and
referee-fixed effects (Column 3). (15)
Pressure from the crowd is arguably less intense at the end of the
first half when fans might still have hope and faith that their team at
least levels the score in the second half. This gives rise to the
hypothesis that first-half stoppage time decisions are less biased, but
that delays of the match, which should be compensated for by stoppage
time according to the official rules, have the same impact on the length
of stoppage time in both halves. The regression results in Panel B of
Table 1 support this hypothesis: the home bias in awarding stoppage time
is generally much smaller in the first half compared to the second hall.
Explanatory variables that capture match delays, in contrast, have
similar effects in both halves. (16)
The systematic differential treatment of home and visiting teams in
second-half stoppage time decisions is less pronounced in the German
Bundesliga than in the Spanish Primera Division, where home teams are
granted about 113 sec more of stoppage time in close matches (Garicano,
Palacios-Huerta, and Prendergast 2005). Home teams might be favored less
in Germany because relatively more visiting supporters accompany their
team to away matches in Germany, where the opponents' home cities
are typically geographically less distant than in Spain. This conjecture
entails that countervailing social forces mitigate referee bias. Since
the data do not contain information about crowd composition, I construct
two proxy variables that exploit the fact that supporters are more
likely to accompany their team to away marches the shorter the traveling
distance is. The first variable proxies the traveling distance of fans
who live close to their home team's city by the linear distance
between opponents' home cities and is calculated based on
coordinates. (17) Fans living close to their preferred team's home
city account for a substantial part of supporters, as is evident by the
geographical distribution of fan clubs of German soccer teams. The
second proxy variable is a measure of relative popularity and accounts
for traveling distance of another group of visiting team's
supporters, namely those who live close to the away match location (and
potentially further away from their team's home city). Since more
popular teams have more fans nationwide, the number of fans who live
close to any match location is positively correlated with a team's
popularity, which I measure by the attendance-to-capacity ratio in away
matches. (18) The relative popularity index is calculated as the ratio
of the average attendance-to-capacity ratio in the visiting team's
away marches to the average attendance-to-capacity ratio in the home
team's home matches. Since the linear distance and the
attractiveness of the visiting team have a positive and statistically
significant effect on match attendance, we can be confident that the two
proxy variables capture variation in the composition of the crowd. (19)
To assess whether the size of the home bias depends on the
composition of the crowd, I estimate the regression model from Column 3
of Table 1 separately for matches in which both teams come from cities
that are less than 150 km apart and cities that are further apart. At
the same time, I augment the regression model with an interaction term
between the score difference and a dummy variable that equals 1 if the
visiting team's relative popularity is in the upper third of the
distribution. The regression results in Table 2 reveal that the home
bias is negligible and statistically not different from zero if the
distance between the cities is less than 150 km, unless the visiting
team is relatively popular (see Column 1). When the cities are further
than 150 km apart, the home bias (about 24 sec of additional stoppage
time when the home team is behind in score) is statistically significant
(see Column 2). (20) These findings support the social pressure
hypothesis.
To investigate next whether the intensity of social pressure, as
measured by distance between the crowd and the field, affects the size
of the bias, I exploit the fact that some teams play their home matches
in stadiums in which an athletics track separates the stands from the
field, while other teams play their home matches in stadiums without
such a running track. Table 3 reports separate regressions for matches
that take place in stadiums with a running track and for matches that
take place in stadiums without a track. The regressions include controls
for crowd size in addition to the set of explanatory variables of
Specification (3) in Table 1. The variable of interest is the
"Score difference," which equals 1 if the home team is one
goal ahead and -1 if it is one goal behind. The results indicate that
the estimated difference in stoppage time awarded in close matches
amounts to almost 1 min (twice the coefficient on Score difference) of
additional stoppage time when the match takes place in a stadium without
a running track (Column 1). However, when the match takes place in a
stadium in which a running track separates the stands from the field,
this effect is much weaker and statistically not significant. This
finding indicates that physical distance extenuates the bias in stoppage
time decisions, suggesting that the intensity of crowd pressure is an
important determinant of referee favoritism. (21) This empirical result
provides additional new support for the hypothesis that social forces
affect referees' decisions.
B. Decisions to Award Goals and Penalty Kicks
Since goals and penalty kicks have a much more immediate impact on
the ultimate match outcome than additional playing time, I examine next
whether referees also favor home teams in decisions to award a goal or a
penalty kick. I begin by investigating the correctness of goal and
penalty kick decisions when a goal or a penalty kick was awarded. I then
also take into account a second type of misjudgment, namely not granting
a goal or a penalty kick when it should be awarded, by also considering
critical situations that could have lead to a goal or a penalty kick.
The raw data on ratings of correctness of referee judgments,
summarized in Table 4A, indicate that decisions to grant a goal are more
likely to be wrong or disputable when the goal is awarded for the home
team: 95.99% of goals scored by the visiting team, but only 95.05% of
goals scored by the home team are rightly awarded. This 0.94 percentage
point difference is statistically significant at the 5% level and
considerable in magnitude since it implies that home teams were awarded
57 more goals based on wrong or disputable decisions. Remarkably,
granted goals are less likely to be correctly awarded when a team is
behind in score, especially when the home team is behind. The home team
is particularly likely to be granted a goal based on a wrong or
disputable decision if it is behind by one or two goals.
Referees also seem to favor home teams in penalty kick decisions
(see also Sutter and Kocher 2004). The raw data reveal that a smaller
fraction of penalties kicks for the home team is rightly awarded (65.20%
vs. 72.57%, see Table 4B). (22) Observed differences in the frequencies
of wrong, correct, and disputable decisions are statistically
significant. (23) Again, the fraction of wrong or disputable decisions
in favor of the home team is largest when the home team is behind in
score. However, it must be noted that referees also make more disputable
decisions in favor of the visitor, when the visitor is just one goal
behind.
Probit estimates (not reported here) confirm that a correct goal
decision is significantly less likely when the home team has scored.
Interestingly, referees' decisions to award a goal are
significantly less likely to be correct when the match takes place in a
stadium without a running track that separates the stands from the
field. Likewise, a probit model for the probability that a penalty kick
was correctly awarded reveals that a correct judgment is 10% less likely
when the game takes place in a stadium without a track. (24)
The evidence presented so far ignores situations in which a goal or
a penalty kick was not awarded. Information on the second type of
decisions is available since the start of the season 1993/1994 for goal
decisions and only since the start of the season 1998/1999 for penalty
kick decisions. Including these situations, we can classify goals and
penalty kicks as legitimate, disputable, of illegitimate in the
following way: goal (penalty kick) that should be awarded according to
the rules is a legitimate goal (penalty kick); illegitimate goals
(penalty kicks) should not be awarded; and disputable goals (penalty
kicks) comprise goals (penalty kicks) for which there is no consensus
regarding their legitimacy.
Table 5 summarizes referees' goal and penalty kick decisions
for home and visiting teams conditional on legitimacy. The first row of
the upper panel shows that there is no indication that referees award
the visitor fewer legitimate goals. There is tentative but no
statistically significant evidence that referees grant the visiting team
fewer disputable goals, but the home side is granted significantly (p =
.07) more illegitimate goals than the visiting side. (25) The lower
panel of the table reveals that visiting teams are more likely to be
denied a legitimate of a disputable penalty kick: the visiting team was
wrongly denied a legitimate penalty kick in 35.75% of cases but the home
team only in 29.59% of cases. This preferential treatment of the home
team is statistically significant at the 10% level as a one-sided
nonparametric test reveals (see last column of the first row in Table
5B). In case of disputable penalty kick decisions, the evidence for home
team favoritism is even more pronounced: home teams are awarded 28.67%
of disputable penalties but visiting teams only 20.27%. Estimation
results based on these data (reported in Dohmen 2005) indicate that home
teams are significantly more likely to receive a penalty kick when it
should objectively be awarded and when a penalty call is disputable. The
estimates also show that referees tend to award fewer disputable and
unjustified penalty kicks when the crowd is separated from the field by
an athletics track.
IV. CONCLUDING REMARKS
This paper has provided evidence that referees in German
professional soccer, who are appointed to be impartial, tend to make
decisions regarding stoppage time, penalty kicks, and goals that favor
the home team. Home-biased refereeing is more pronounced when the home
crowd has a stronger interest in a decision that favors their team, for
example, when the score margin is close and their team is behind in
score. There is also evidence that the home bias is mitigated when the
fraction of supporters of the visiting team rises, which indicates that
conflicting social forces have countervailing effects on individuals.
Strikingly, referees are more biased in stoppage time decisions and make
fewer correct penalty kick and goal decisions if the match is played in
a stadium without a running track that separates the stands from the
field. This indicates that refereeing quality is impaired when the crowd
is closer and when social pressure is arguably experienced as more
intense by the referee.
A likely channel that leads to the observed systematic differences
in referee decisions is that social pressure from the crowd directly
affects the referee, who then departs from the decision that maximizes
his expected material payoff. An alternative explanation for the pattern
that close games last longest is that referees lengthen exciting games
because spectators like suspense (see Chan, Courty, and Li 2006 for a
theoretical model). However, if this was the mechanism it is not clear
why there is a systematic difference between close matches in which the
home team is behind and those in which the home team is ahead, unless
home teams are generally more likely to level the score when being
behind by one goal. In any event, this alternative transmission channel
presupposes that referees submit to the preferences of the crowd and
would therefore not undermine the evidence that social forces affect
individuals' decisions. The finding that refereeing quality is
lower when no running track separates the stands from the field adds
important new evidence that social pressure is the mechanism at play. It
also helps to invalidate the alternative hypothesis that the governing
body, the DFB, condones home team favoritism.
Another explanation emphasizes a more indirect channel that leads
to biased refereeing; it entails that the crowd pressure affects the
players on the field who then feel encouraged to complain about
decisions that do not favor their team. It is interesting to recall in
this context that the official observer of the DFB referee committee,
who completes the evaluation form after the match, rates the
referee's decisiveness and his competence in dealing with
complaining players. As a result, referees are expected not to entertain
the players complaints, which renders this indirect channel less
plausible. But even if the true transmission mechanism worked through
players influencing the referees, the social pressure hypothesis would
not be invalidated; it would in fact be at the heart of the explanation.
Regardless of the particular channel through which referees'
decisions are affected, the fact that the social atmosphere in the
stadium is related to refereeing quality is indicative for the role of
social forces. As a result, we can conclude that the evidence is
consistent with the hypothesis that social forces can affect the
decisions of an individual who values not only material payoffs but also
social payoffs.
These results are important for the literature on endogenous preference formation for they provide support for theories that
emphasize the relevance of the social environment (e.g., Akerlof 1980;
Becker and Murphy 2000; Bernheim 1994). Bernheim's model implies
that we should expect a stronger influence of the social environment
when social pressure rises relative to the individual's material
payoff. This is likely the case in lower leagues, where monetary rewards
for referees are lower. In fact, a DFB official has confirmed that
referees in lower level leagues are regularly relegated or dismissed,
but that dismissal or relegation is less common for Bundesliga referees.
The DFB official attributed this to incentives effects arguing that
"much more is at stake for Bundesliga referees" and to
selection effects saying that Bundesliga referees are "simply
better referees as they have been promoted from lower leagues in several
rounds upon very positive judgements of DFB observers." The latter
statement implies that there is considerable heterogeneity in the extent
to which individuals are affected by social pressure. There is evidence
in the data that more experienced referees tend to be less biased, which
suggests that individuals can learn to resist social pressure.
To understand how and to what extent individuals can become immune
to social influences, it is important to ascertain whether referees
consciously submit to social pressure or whether they are affected
subconsciously. The latter conjecture appears to be more plausible. It
is likely that referees' objective judgment capabilities are
impaired by the emotional atmosphere in the stadium. Consistent with
that explanation, Nevill, Balmer, and Williams (2002) have provided
experimental evidence that soccer referees are affected by the
crowd's noise. They showed videotaped tackles from an English
Premier League soccer game to qualified referees who had to decide
whether or not to award a foul. One group watched the videotape without
the noise of the crowd being played, while the other group heard the
noise. The latter group called 15.5% fewer fouls against the home team.
Since referees have to judge a situation in a split of a second having
very little time for deliberation, their decision-making process might
be heavily influenced by cues in the environment and the atmosphere in
the stadium.
ABBREVIATION
DFB: German Football Association
REFERENCES
Akerlof, G. A. "A Theory of Social Custom, of Which
Unemployment May Be One Consequence." Quarterly Journal of
Economics, 94, 1980, 749-75.
Asch, S. E. "Effects of Group Pressure Upon the Modification
and Distortion of Judgments," in Groups, Leadership and Men, edited
by H. Guetzkow. Pittsburgh, PA: Carnegie Press, 1951, 177-90.
Austen-Smith, D., and R. G. Fryer. "The Economics of
'Acting White'." Quarterly Journal of Economics, 120,
2005, 551-83.
Becker, G. S., and K. M. Murphy. Social Economics. Market Behavior
in a Social Environment. Cambridge, MA: Harvard University Press, 2000.
Bernheim, D. B. "A Theory of Conformity." Journal of
Political Economy, 102, 1994, 841-77.
Brock, W. A., and S. N. Durlauf. "Interactions-Based
Models," in Handbook of Econometrics, Vol. 5, edited by J. J.
Heckman and E. Leamer. Amsterdam, The Netherlands: Elsevier Science,
1999, 3297-380.
Chan, W., P. Courty, and H. Li. Suspense. Working Papers tecipa-223. University of Toronto, Department of Economics. 2006.
Coleman, J. S. Foundations of Social Theory. Cambridge, MA,
Cambridge, Massachusetts: The Belknap Press of Harvard University Press,
1990.
Dohmen, T. J. "Social Pressure Influences Decisions of
Individuals: Evidence from the Behavior of Football Referees." IZA Discussion Paper No. 1595, 2005.
Dufwenberg, M., and M. Lundholm. "Social Norms and Moral
Hazard." Economic Journal, 111, 2001, 506-25.
Duggan, M., and S. D. Levitt. "Winning Isn't Everything:
Corruption in Sumo Wrestling." American Economic Review, 92, 2002,
1594-605.
Federation Internationale de Football Association. Laws of the
Game. Zurich, Switzerland: Federation Internationale de Football
Association, 2002.
Festinger, L., S. Schachter, and K. Back. "The Spatial Ecology of Group Formation," in Social Pressure in Informal Groups, L.
Festinger, S. Schachter and K. Back. Stanford, CA: Stanford University Press, 1950, 146-61.
Garicano, L., I. Palacios-Huerta, and C. Prendergast.
"Favoritism Under Social Pressure." Review of Economics and
Statistics, 87, 2005, 208-16.
Goldin, C., and C. Rouse. "Orchestrating Impartiality: The
Impact of 'Blind' Auditions on Female Musicians."
American Economic Review, 90, 2000, 715-41.
Knowles, J., N. Persico, and P. Todd. "Racial Bias in Motor
Vehicle Searches: Theory and Evidence." Journal of Political
Economy, 109, 2001, 203-29.
Linn, G. "Ziel und Ablaufvon
Schiedsrichterbeobachtungen." DFB Schiedsrichter-Zeitung, 1, 2003.
Nevill, A., N. Balmer, and A. Williams. "The Influence of
Crowd Noise and Experience upon Refereeing Decisions in Football."
Psychology of Sport and Exercise, 3, 2002, 261-72.
Prendergast, C., and R. Topel. "Favoritism in
Organizations." Journal of Political Economy, 104, 1996, 958-78.
Sutter, M., and M. G. Kocher. "Favoritism of Agents--The Case
of Referees' Home Bias." Journal of Economic Psychology 25(4),
2004, 461-9.
(1.) Akerlof (1980) argues that social sanctions can induce an
individual to adhere to a social custom even at the cost of forgoing pecuniary gains from breaking it. In Bernheim's (1994) model,
individuals care about social rewards in addition to their intrinsic
consumption utility, and conform to a single homogenous standard of
behavior if social rewards are sufficiently important relative to their
intrinsic utility. Adherence to a norm and persistency of a norm are
determined by the distribution of intrinsic preferences.
(2.) Dufwenberg and Lundholm (2001) argue that social rewards
mitigate moral hazard problems in the context of unemployment insurance
as the desire for social respect induces higher search effort;
Austen-Smith and Fryer (2005) formulate a theory of peer effects; and
Prendergast and Topel (1996) show theoretically how social influences
can create distortions in an agency model.
(3.) Empirical work on social interactions is plagued by
identification problems, which arise due to the complex relationships in
interaction-based models. Consequently, data requirements for empirical
testing are high. Brock and Durlauf (1999) review the empirical work on
interaction-based models and provide a detailed discussion of the
problems that arise in this literature.
(4.) Other studies have provided evidence that reveals innate bias,
that is, intrinsic preferences for discrimination, among arbitrators who
should be impartial. For example, Goldin and Rouse (2000) analyze data
from auditions of symphony orchestras and find suggestive evidence for
sex-biased hiring decisions. Knowles, Persico, and Todd (2001) derive a
test that allows to distinguish between statistical discrimination and
prejudice (i.e., a taste for discrimination); their empirical analysis
uncovers clues for prejudiced racial discrimination if particular
assumptions about individuals' utility functions are made.
(5.) Festinger, Schachter, and Back (1950) find that physical
distance is a key determinant of the intensity of social interaction.
(6.) This was confirmed by the DFB upon request. The DFB official
was reluctant to state why particular referees were not reappointed in
the past.
(7.) A round consists of nine matches, which are typically played
on weekends.
(8.) If two or more teams are tied on points, the superior goal
difference and then the higher number of goals determine which team is
ranked in higher position of the league table.
(9.) Apart from winning the championship or avoiding relegation,
there are other important sporting and financial incentives to finishing
in high ranks of the championship table because the rankings determine
eligibility for various football club competitions on the European level
of which the Union of European Football Associations Champions League is
the most lucrative and prestigious.
(10.) As all these points concern how referees deal with the social
environment on the field, performance in a social environment seems to
be a critical issue for the evaluation and appointment of referees.
These evaluations pay no explicit attention to whether the
referee's decisions are affected by the social environment created
by the crowd.
(11.) Garicano, Palacios-Huerta, and Prendergast (2005) proposed
this classification of matches. Duggan and Levitt (2002) follow a
similar approach of looking at the "relevant margin" in their
study of corruption in Sumo wrestling tournaments.
(12.) The results documented in Figure 1 are largely in accordance
with Garicano, Palacios-Huerta, and Prendergast (2005)'s findings
except for the fact that Spanish referees seem to award more stoppage
time when a match is drawn than when the visiting team is behind by one
goal; the opposite is true in the German data.
(13.) On average, 0.05 goals are scored during each minute of
stoppage time so that the impact of the bias might seem small. However,
this can be argued since goals scored in the last seconds of a match
might have enormous consequences. For example, FC Bayern Munchen won the
German championship in 2001 due to a goal they scored during injury time
in the last match of the season.
(14.) Very similar coefficient estimates, which are reported in
Table 3 of the working paper version (Dohmen 2005), are found in drawn
matches. Interestingly, referees also tend to favor the home team by
lengthening drawn marches in which the home team is more likely to score
next. In addition, games end about 10 sec earlier on average when
neither team had scored during regulation time and matches with more
shots on goal, more tackles, and more crosses last longer. This
indicates that referees lengthen more exciting matches, potentially
being influenced by spectators who like suspense.
(15.) Note that the regression results indicate that a large
fraction of the variation in stoppage time remains unexplained in the
specification in Column 3, even though the regression includes controls
for the host of variables that are explicitly mentioned in the Laws of
the Game. This might reflect the fact that the amount of time that is
wasted varies even for a particular type of match interruption, for
example, yellow carding a player, so that control variables are only an
imperfect measure of the actual time lost in a particular instance. But
the size of the unexplained variation also provides suggestive evidence
of the importance of referee discretion in stoppage time decisions.
(16.) Social pressure exerted by the crowd is likely to increase
not only toward the end of the game but also toward the end of the
season, when more is at stake so that the difference in stoppage time
awarded in close matches is expected to be larger toward the end of the
season. While this difference widens by about 10 sec in the last five
rounds compared to earlier rounds, this increase is not statistically
significant at conventional levels.
(17.) Linear distances are obtained using the program on
http://www.koordinaten.de/online/dist_wel.shtml (last seen on July 11,
2007).
(18.) The constructed popularity index varies slightly for a given
team because observations on matches that involve both opponents are
neglected when calculating this measure. Reassuringly, this proxy for
popularity is highly correlated with the number of fan clubs of the
different teams throughout the country. In fact, the four highest ranked
teams according to the constructed popularity index are FC Bayern
Munchen, Borussia Dortmund, Schalke 04, and Borussia Monchengladbach,
arguably the most popular teams in Germany, and certainly those with
most supporters' clubs.
(19.) An increase of 0.1 in the popularity index of the away team
raises attendance by about 4,900 visitors. The effect of distance is
nonlinear: compared to the reference case in which opponents' home
towns are 300-450 km apart, 4,638 more spectators are on average
attracted if the distance between the opponents' home towns is less
than 150 km and 865 additional spectators attend when this distance is
between 150 and 300 km, while there is no statistically significant
difference in attendance when the distance is even longer. Detailed
results are reported in the working paper version (Dohmen 2005).
(20.) This bias tends to be reduced if the visiting team attracts
more fans when playing away, but this effect is not statistically
significant.
(21.) The regression results also show that neither the number of
spectators nor the attendance-to-capacity ratio significantly affects
the length of stoppage time. Coefficient estimates for interactions
between these variables and the score difference indicator suggest that
a higher attendance-to-capacity ratio and a higher absolute number of
spectators tends to reduce the home bias in stadiums without a running
track and to increase it in stadiums with a running track. Alternative
specifications, in which controls for attendance and
attendance-to-capacity ratio and their interactions with the score
difference are dropped sequentially, indicate that a larger crowd and a
higher attendance-to-capacity ratio is required in stadiums with a
running track to induce referee bias. The difference in the size of the
home bias in stadiums with and without a running track disappears when
both interaction terms are excluded from the regression.
(22.) Home teams were awarded roughly twice as many penalty kicks
as visitors (569 vs. 288) during the seasons from 1993/1994 until the
first half of the season 2003/ 2004. This difference largely results
from different strategies: there is simply more play in the visiting
team's penalty area. Consequently, home teams are also more often
involved in critical situations in which no penalty kick was awarded.
(23.) The null hypothesis that observed differences are purely due
to random chance is rejected by a chi-square test at the 10%
significance level. The [chi square](2) statistic equals 4.75. Grouping
wrong and disputable decisions into one category and testing for the
significance of differences in the observed frequencies of correct and
not correct decisions for the home and visiting team yields a [chi
square](1) test statistic of 4.41 rejecting the null hypothesis that
referees decide in the same way for both teams at the 5% significance
level.
(24.) This effect is statistically significant at the 1% level.
Estimates from a multinomial logit model, in which the categories of the
dependent variable are a correct, disputable, or wrong decision,
indicate that home teams are more likely to be granted disputable goals
and that decisions in stadiums with a track are significantly less
likely to be wrong. The results also indicate that decisions that favor
the home team when it is behind by one or two goals are more likely to
be disputable than plainly wrong, which squares with the fact that
obvious misjudgment might have strong negative repercussions for
referees.
(25.) Home teams are most likely to be awarded a disputable or
illegitimate goal when being behind in score (see table 9 in Dohmen 2005
for estimation results).
THOMAS J. DOHMEN, I am grateful to IMP AG for providing the data. I
would like to thank especially Holger Rahlfs and Jorn Wendland for their
cooperation. I am also indebted to the DFB, particularly to Klaus Low,
for providing detailed information on financial rewards for referees and
their evaluation. I thank Michael Collins, Armin Falk, Luis Garicano,
David Huffman, Ben Kriechel, Winfried Koeniger, Steven Levitt, Canice
Prendergast, Uwe Sunde, two anonymous referees, and the editor for
helpful comments. I am also grateful for comments and suggestions of
seminar participants at Maastricht University, the University of
Chicago, Institute for the Study of Labor in Bonn, participants of the
17th Annual Conference of the European Society for Population Economics
in New York and of the 58th European Meeting of the Econometric Society in Stockholm on earlier drafts of the paper. All errors are mine.
Dohmen: Professor of Economics, Research Centre for Education and
the Labour Market (ROA), Maastricht University, PO Box 616, 6200 MD
Maastricht, The Netherlands. Phone +31-43 388 3647, Fax +3143 388 4914,
E-mail
[email protected]
TABLE 1
Length of Stoppage Time in Close Games
Panel A: Second-Half Stoppage Time
(1) (2)
Home ahead -22.142 *** (4.168) -20.601 *** (4.501)
No. of treatments -0.187 (0.219) 1.364 (0.460)
No. of substitutions 1.915 (1.168) 1.562 (1.602)
No. of fouls 0.284 (0.251) -0.819 ** (0.360)
No. of yellow cards 6.667 *** (0.998) 6.041 *** (0.929)
No. of 2nd yellow cards 3.694 (3.952) 2.871 (3.899)
No. of red cards 11.238 * (6.527) 12.703 ** (5.749)
Season dummies No Yes
Controls for relative No Yes
strength
Fixed effects for home No No
team, visiting team,
and referee
Constant 91.178 *** (11.264) 74.724 *** (14.831)
Observations 1,166 1,117
[R.sup.2] .07 .15
Season dummies
Panel A: Second-Half Panel B: First-Half
Stoppage Time Stoppage Time
(3) (4)
Home ahead -21.660 *** (4.478) -6.11 * (3.651)
No. of treatments 1.262 (0.447) 0.781 *** (0.156)
No. of substitutions 2.172 (1.421) 18.396 *** (2.406)
No. of fouls -0.558 * (0.311) 0.058 (0.172)
No. of yellow cards 6.118 *** (1.162) 5.754 *** (1.160)
No. of 2nd yellow cards 1.056 (4.231) 15.433 *** (5.816)
No. of red cards 12.519 ** (5.176) 27.724 *** (6.943)
Season dummies Yes No
Controls for relative Yes No
strength
Fixed effects for home Yes No
team, visiting team,
and referee
Constant 73.615 *** (16.947) 34.52 *** (7.020)
Observations 1,117 1,495
[R.sup.2] .31 .10
Panel B: First-Half Stoppage Time
(5) (6)
Home ahead -7.285 * (3.828) -7.833 ** (3.786)
No. of treatments 1.201 *** (0.330) 1.317 *** (0.369)
No. of substitutions 18.131 *** (2.197) 17.89 *** (2.214)
No. of fouls -0.066 (0.258) 0.046 (0.260)
No. of yellow cards 5.926 *** (1.129) 5.818 *** (1.176)
No. of 2nd yellow cards 13.989 ** (5.822) 11.154 * (6.459)
No. of red cards 31.182 *** (7.129) 33.767 *** (7.315)
Season dummies Yes Yes
Controls for relative Yes Yes
strength
Fixed effects for home No Yes
team, visiting team,
and referee
Constant 52.914 *** (11.366) 63.567 *** (16.483)
Observations 1,403 1,403
[R.sup.2] .13 .26
Notes: Ordinary least squares estimates. The dependent variable in
Panel A is the length of second-half stoppage time (in seconds) in
matches in which one team was leading by one goal after regular
time. The dependent variable in Panel B is the length of first-half
stoppage time (in seconds) in matches in which one team was leading
by one goal after regular first-half playing time. The variable
Home ahead takes the value 1 if the home team is one goal ahead and
0 if the home team is one goal behind. All listed independent
variables except for the total number of corresponding events in
the respective half of the match except for "No. of fouls" for
which separate information for each half is not available. Controls
for relative strength include the relative number of tackles won,
shots on goal, fouls, corner kicks, and crosses as well as the
absolute difference in rankings before the match and the ranking of
the home team. These variables are jointly not significant in the
regression models of Panel A (p value corresponding to Wald test
statistic exceeds .7). The rank of the home team is significant at
the 5% level in Specification (5) of Panel B. Information on
relative strength is missing for the majority of matches in the
season 1992/1993, which causes the number of observations to drop
when relative strength is controlled for. Standard errors, given in
parentheses, allow for correlation between observations of the same
referee.
* Significant at the 10% level; ** significant at the 5% level;
*** significant at the 1% level.
TABLE 2
The Effects of Crowd Composition
Distance
[less than or equal to] Distance
150 km > 150 km
(1) (2)
Score difference -4.199 -12.055 *** (2.590)
Attraction x -20.335 * (11.704) 3.596 (4.170)
score
difference
Observations 199 918
[R.sup.2] .58 .34
Notes: Ordinary least squares estimates. The dependent
variable is second-half stoppage time in close matches. The
regressions include controls for the number of treatments,
substitutions, fouls, and cards and controls for relative
strength, season dummies, referee dummies, and team dummies.
The variable "Attraction x score difference" is the
product of the Score difference variable, which takes the
value -1 if the home team is one goal behind and 1 if
the home team is one goal ahead at the end of regulation
time, and a dummy that takes the value 1 if the visiting team
ranks in the top third of relative crowd composition index.
This index is equal to the ratio of the average attractiveness
of the visitor (i.e., the average attendance-to-capacity ratio
in all away games except those that are played on the current
opponent's turf) to the average attendance-to-capacity
ratio in home games of the home team (calculated over all
home matches except for those against the current opponent).
The variable "Distance" measures the shortest
distance between the home cities of opponents. Standard
errors are given in parentheses.
* Significant at the 10'Y, level; *** significant at the 1% level.
TABLE 3
The Stadium, the Crowd, and Second-Half Stoppage Time in Close Games
Stadiums without Stadiums with
Track Track
(1) (2)
Score difference -29.297 ** (13.938) -7.392 (7.569)
Attendance (in thousands) 0.986 (0.680) -0.367 (0.612)
Attendance x score
difference 0.126 (0.231) -0.123 (0.223)
Ratio of attendance-
to-capacity -39.833 (33.191) 8.775 (34.761)
Ratio of attendance-to-
capacity x score
difference 18.447 (18.257) -2.129 (14.381)
No. of treatments 1.499 ** (0.618) -0.412 (0.527)
No. of substitutions 2.941 (2.390) 1.250 (2.299)
No. of yellow cards 8.004 *** (1.952) 5.943 *** (1.737)
No. of 2nd yellow cards -3.135 (8.970) 1.419 (6.379)
No. of red cards 15.202 (9.831) 9.210 (7.328)
Constant 45.464 (52.650) 76.008 (46.951)
Observations 554 563
[R.sup.2] .38 .41
Notes: Ordinary least squares estimate. The dependent variable is
the length of stoppage time awarded at the end of the match in
matches where the home team is either one goal behind (Score
difference = -1) or one goal ahead (Score difference = 1). The
sample is split into matches that took place in stadiums without a
running track separating the field and the stands (Column 1) and
matches in stadiums with a running track (Column 2). All
regressions include the same set of controls for relative strength
as in Table 1, as well as fixed effects for home teams, visiting
teams, and referees. The effects of controls for relative strength
are never significant. The results concerning the home bias are not
affected by the inclusion or exclusion of team-fixed effects.
Standard errors are given in parentheses.
** Significant at the 5% level; *** significant at the 1% level.
TABLE 4
Correctness of Decisions to Award Penalties and Goals
(A) Decisions to Award a Goal
Goal for Home Team Decision
Score Difference Wrong Correct Disputable
[less than or
equal to] -2 1.68 92.26 6.06
-1 1.43 93.69 4.88
0 1.56 95.04 3.39
1 1.38 95.48 3.13
[greater than or
equal to] 2 1.01 96.64 2.35
Total 1.43 95.05 3.52
Goal for Visiting Team Decision
Score Difference Wrong Correct Disputable
[less than or
equal to] -2 0.68 97.62 1.70
-1 0.15 97.51 2.34
0 0.88 96.08 3.04
1 1.62 93.73 4.65
[greater than or
equal to] 2 0.95 96.76 2.29
Total 0.92 95.99 3.09
(B) Decisions to Award a Penalty Kick
Penalty Kick for Home Team Decision
Score Difference Wrong Correct Disputable
[less than or
equal to] -2 8.57 45.71 45.71
-1 14.29 62.86 22.86
0 6.38 62.55 31.06
1 5.00 70.00 25.00
[greater than or
equal to] 2 4.05 78.38 17.57
Total 7.38 65.20 27.42
Penalty Kick for Visiting Team Decision
Score Difference Wrong Correct Disputable
[less than or
equal to] -2 11.76 70.59 17.65
-1 0.00 87.10 12.90
0 8.04 68.75 23.21
1 6.25 67.50 26.25
[greater than or
equal to] 2 2.08 81.25 16.67
Total 5.90 72.57 21.53
Notes: The upper panel of the table shows the percentages of goals
for the home team and the visiting team that were wrongly,
correctly, or disputable awarded by score differences. The
information is based on all 10,163 goals (6,025 for the home team
and 4,138 for the visitor team) that were awarded from the start of
the season 1992/1993 until the end of the first half of the season
2003/2004 and for which the correctness indicator is available. The
lower panel of the table shows the percentages of awarded penalties
for the home team and the visiting team that were wrongly,
correctly, or disputably awarded. The information is based on all
penalties that were awarded from the start of the season 1993/1994
until the end of the first half of the season 2003/2004 and covers
857 penalty kick decisions, 569 for the home team and 288 for the
visitor team. The score difference is calculated as the number of
goals that had been scored by the home team minus the number of
goals that had been scored by the visitor team at the time a
decision was made.
TABLE 5
Decisions to Award Penalties and Goals Conditional on Legitimacy
Home Team Visiting Team
Awarded Denied Awarded Denied
(A) Decisions to award a goal
Legitimate 5,199 68 3,637 46
Disputable 194 57 120 43
Illegitimate 79 125 37 84
(B) Decisions to award a penalty kick
Legitimate 207 87 115 64
Disputable 86 214 30 118
Illegitimate 23 192 13 87
W-M-W Test
z Value p Value
(A) Decisions to award a goal
Legitimate -0.175 .569
Disputable 0.852 .197
Illegitimate 1.480 .070
(B) Decisions to award a penalty kick
Legitimate 1.393 .082
Disputable 1.906 .028
Illegitimate -0.597 .725
Notes: The upper panel of the table shows the numbers of
legitimate, disputable, and illegitimate goals that were either
granted or denied for home and visiting teams during all seasons
from 1993/1994 until the end of the first half of the season
2003/2004. The indicator for the correctness of the referee's
decision is missing for two goals that have been awarded in this
period and for one goal that has not been awarded. These
observations are not included when calculating the statistics. The
lower panel of the table shows the numbers of legitimate,
disputable, and illegitimate penalty kicks that were either granted
or denied for home and visiting teams from the start of the season
1998/1999 until the end of the first half of the season 2003/2004.
Statistics are based on ratings of all situations that led to
penalty kicks and all critical actions that could have potentially
lead to a penalty kick but were not penalized. The last column of
each panel shows p values for the hypothesis that referees do not
favor the home team against the one-sided alternative that they are
more likely to award a goal (upper panel) or a penalty (lower
panel) for the home team. W-M-W, Wilcoxon-Mann-Whitney.