What is Multilevel Modeling?
Multilevel modeling (MM) is a family of statistical procedures that try to come to terms with influences that are located on different, well, levels. So naturally the question arises what is meant by "level".
One way to think about it is as follows: People do not live entirely on their own, but rather embedded in social units. Even though today, in a globalized world, we may say that people have relationships with other people all over the world, most people have some relationships that are more special than others. People who are linked together via special relationships frequently communicate among each other, and thus the possibility rises that the people you are linked to influence your views. So we may think about the individuals as one (the lowest) level and their network (whether it consists of people that are met in person or of people communication with whom may take place only via artifical media) as a next (higher) level.
(Note that " low" and " high" are just names; we may well think about things the other way round. "High" just means something like "aggregate"; that is, several individuals – entities on the "low" level – are seen as agglutinated).
A second way: Opportunities structure the behaviour of individuals, and as many people
select their opportunities by local proximity, the region in which a person lives may enhance or restrict his or her opportunity. For instance, if a person lives in a region with high unemployment, this may influence his or her behaviour about acceptable wage levels when looking for a new job.
A third way: Often people, be it voluntarily or not, are subject to common external influences. Take, for instance, a university professor. All the students that come to him or her are subject to her or his way of teaching. Could be that this way of teaching influences these students (even though this certainly – if sometimes fortunately – happens less frequently than we professors might desire).Therefore, again we may think of a multitude of professors as the "higher" level units and of their many students as the "lower" level units.
For a variety of reasons, data referring to more than one level often cannot be analyzed by conventional statistical models. For instance, classical OLS regression analysis requires that residuals from individual observations are not correlated. This requirement becomes doubtful if these individual observations are subject to the same influences or are related to each other in other ways.
After so many words, this page – for the time being – does very little: It provides a few links to MM related pages, and it also provides selected references to the literature, with short comments.
Please not that, regrettably, updates to this page are quite limited:
- The section with links is checked from time to time concerning changes in URLs; if I encounter interesting sites, they will be added. However, during the recent years I have only rarely looked for other sites.
- Only the first part of the literature section, dealing with introductory texts and textbooks, is updated (i.e. augmented) occasionally. The remaining parts of this section were written in 1999 and have not changed since then.
Dear reader, the following remarks are intended to serve as a guide to the growing literature on multilevel modelling. They cover most of the textbooks and many of the "classical" papers by those statisticians who did most of the pathbreaking work.
Regrettably my overview covers only the years up to 1999 (even though I will try to stay up to date as far as textbooks are concerned)! Of course, since that time many useful new papers have appeared, but even though I take an occasional glance at some of them, I do not find the time to include them here. Please accept my apologies!
Note that the Centre for Multilevel Modeling now has a similar website that introduces you particularly to newer, and also to more specialised, books.
Introductory texts (elementary and advanced) (updated every now and then)
- Bickel, Rorbert (2007): Multilevel Analysis für Applied Research. It's Just Regression!, New York, London: Guilford Press
The subtitle runs through the book from the beginning. Bickel of course is right. But in my view the book sometimes is cumbersome to read, as it is rather wordy, which Bickel admits himself. On the other hand, if you like long and wordy explanations, this may be the book of your choice. What's really nice is the short overwiew of useful resources at the end of each section. Another great feat, even though I think this is not the effect intended by Bickel: The examples of how to do multilevel analysis via SPSS's menu system that are scattered over the entire book make it clear how any understanding of what you're doing is utterly obfuscated by doing statistics this way (i.e. using menus instead of commands written in plain text).
- Bressoux, Pascal, Coustère, Paul & Leroy-Audouin, Christine (1997): Les modèles multiniveau dans l'analyse écologique: le case de la recherche en éducation, in: Revue française de sociologie, 38, 67-96.
Some like it French: An introductory paper on multilevel analysis of educational achievement, with good data. No maths.
- Bryk, A. S. & Raudenbush, S. W. (1992): Hierarchical Linear Models. Applications and
Data Analysis Methods. Newbury Park, CA: Sage.
Very good for those who wish to arrive at an advanced understanding.
- DiPrete, Thomas A. & Forristal, Jerry D. (1994): Multilevel Models: Methods and
Substance, in: Annual Review of Sociology, 20, 331-357.
A good overview of the basic ideas and of applications, with most emphasis on
random-coefficient models.
- Ditton, Hartmut (1998): Mehrebenenanalyse. Grundlagen und Anwendungen des
Hierarchisch Linearen Modells. Weinheim und München: Juventa.
This book is specifically useful for those who want to analyze school data (and have to resort to books in German). However, readers of chapter 3 (dealing largely with
centering) should consult chapter 5.2 in Kreft/de Leeuw 1998 and the paper by Kreft/de Leeuw/Aiken 1995.
- Engel, Uwe (1998): Einführung in die Mehrebenenanalyse. Opladen: Westdeutscher
Verlag (WV Studium 182).
The wide coverage of this book has much to recommend it - for users who already have acquired an elementary understanding.
- Goldstein, Harvey (1995): Multilevel Statistical Models. London: Arnold.
A more advanced introduction by one of the " fathers" of multilevel
modeling, now (2003) in its third edition. An earlier version of the book can be downloaded here. Some material concerning the third edition, not least some errata, may be found here.
- Hox, Joop (2002): Multilevel Analysis. Techniques and Applications. Mahwah, NJ: Erlbaum.
This book is the expanded and updated version of an earlier book, Multilevel Analysis, Amsterdam: TT Publishers, 1995, which is downloadable for free (attention, this is a PDF file with a size of several MB).
- Hox, Joop J. & Kreft, Ita G. G. (1994): Multilevel Analysis Methods, in: Sociological
Methods & Research, 22, 283-299.
This introduction to a special issue of SMR gives a brief overview of problems
resulting from using traditional approaches for multilevel data, of the basic structure of the random coefficient model, of available software, and of current problems and possible future developments.
- Kreft, Ita G. G. (1991): Using Hierarchically Linear Models to Analyse Multilevel Data. In: ZUMA-Nachrichten, 29, 44-56.
A brief introduction that should be found in most German sociology libraries, but
(naturally) cannot replace a textbook.
- Kreft, Ita & de Leeuw, Jan (1998): Introducing Multilevel Modeling, London: Sage.
This is certainly the most helpful textbook for those with a strong dislike of maths, formal derivations and the like. The reader is carefully guided through a number of examples. However, one should be aware that there are many advanced topics that are not dealt with in this book.
- Langer, Wolfgang (2004): Mehrebenenanalyse: Eine Einführung für Forschung und Praxis. Wiesbaden: VS-Verlag.
This should definitely be the source for beginners who need a text in German. What I like particularly is the author's approach to start with detailed exploratory analyses and only then to proceed to more advanced modeling strategies. Complex mathematical detail (for instance, concerning estimation methods) is omitted, but the practical implications are dealt with in a fairly comprehensive manner. Examples use MLA, a free software for estimating multilevel models.
A website provides data sets and command files used in this book as well as links to software and other sources.
- Longford, N. (1993): Random coefficient models. Oxford: Oxford University Press.
In this book, statistical reasoning is paramount, but it is also applied to several datasets with helpful discussions of the results.
- Luke, Douglas A. (2004): Multilevel Modeling. Thousand Oaks, CA: Sage (Sage university papers series: Quantitative applications in the social sciences, 143).
Like all other volumes in this series, this is a nice and accessible introduction which, due to the usual limitations of space, cannot discuss all the complications that may arise during a particular analysis.
- Ohr, Dieter (1999): Modellierung von Kontexteffekten: Voraussetzungen, Verfahren und
eine empirische Anwendung am Beispiel des politischen Informationsverhaltens, in: ZA-Information 44, 39-63.
A fine brief introduction, but I feel that it is somewhat infortunate that MM is
introduced via an example where the gains from using a MM approach are almost nil
(statistically speaking, i.e. the MM analysis barely differs from OLS regression results).
- Singer, J. D. & Willett, J. B. (2003): Applied Longitudinal Data Analysis: Modeling Change and Event Occurrence. New York: Oxford University Press.
Although focussing on one specific application of multilevel modelling, i.e. within-individual change, this book certainly is worthwhile reading. – There is also a website for this book with data sets, errata and other stuff.
- Snijders, Tom & Bosker, Roel (1999): Multilevel Analysis. An introduction to basic and advanced multilevel modeling. London, Thousand Oaks: SAGE.
This is a very thorough introduction that requires quite some effort on part of the student - but it pays. Especially chapters 8 and 9 should be studied carefully, as I have found no comparable discussion of heteroscedascity and the basic assumptions (and how to check them) of multilevel modeling.
Other papers on multilevel random coefficient (or variance components) models (not updated since 1999)
- Aitkin, M. & Longford, N. (1986): Statistical Modelling Issues in School Effectiveness
Studies, in: Journal of the Royal Statistical Society, Series A 149, 1-43.
This paper introduces a variance component (or "random intercept" and "random
slope") model based on a Maximum Likelihood algorithm. Results of this model on a school
data set are contrasted to some individual level models (including one with
context effects) and to an aggregate level analysis.
- Bock, Darrell R. (Ed.) (1989): Multilevel Analysis of Educational Data. San Diego:
Academic Press.
See statisticians work. This book is largely about how to assess school effectiveness. Some chapters give you a good idea of what Empirical Bayes estimation is about.
- Bryk, Anthony S. & Raudenbush, Stephen (1987): Application of Hierarchical Linear
Models to Assessing Change, in: Psychological Bulletin, 101, 147-158.
A fine paper with a very clear structure, demonstrating the possibilities of multilevel modeling in the analysis of longitudinal data.
- de Leeuw, Jan & Kreft, Ita G. G. (1986): Random Coefficient Models for Multilevel Analysis, in: Journal of Educational Statistics 11, 57-85.
Discusses nicely some of the limitations of traditional contextual analysis and then proceeds to elaborate at some length the statistical properties of different estimation methods for RC models. These methods eventually are applied to a Dutch school example.
- de Leeuw, Jan & Kreft, Ita G. G. (1995): Questioning Multilevel Models, in: Journal of
Educational and Behavioral Statistics 20, 171-189.
Illuminating discussion of some implications of multilevel modeling.
- Draper, D. (1995): Inference and hierarchical modeling in the social sciences, in: Journal of Educational and Behavioral Statistics 20(2), 115-147.
Technical discussion (but without any maths) of inferential issues in multilevel modelling; pleas for more frequent use of Markov-Chain Monte Carlo techniques.
- Goldstein, Harvey & Rasbash, J. (1996): Improved approximations for multilevel models with binary responses, in: Journal of the Royal Statistical Society, Series A, 159, 505-513.
This paper develops a penalized quasi-likelihood (PQL) method for the estimation of multilevel models with a dichotomous dependent variable. It demonstrates that this method is superior to the earlier marginal quasi-likelihood approach, especially in the case of extreme data constellations such as a very skewed dependent variable.
- Goldstein, Harvey, Rasbash, Jon, Yang, Min, Woodhouse, Geoffrey, Pan, Huiqi, Nuttall, Desmond & Thomas, Sally (1993): A Multilevel Analysis of School Examination Results, in: Oxford Review of Education 19(4), 425-433.
This paper highlights some of the problems of comparing schools; see also Goldstein & Spiegelhalter (1996) for more extensive discussion of these topics
- Goldstein, Harvey & Spiegelhalter, David J. (1996): League tables and their limitations: statistical issues in comparisons of institutional performance, in: Journal of the Royal Statistical Society, Series A, 159(3), 385-443.
- Heath, Anthony, Yang, M. & Goldstein, Harvey (1996): Multilevel analysis of the changing relationship between class and party in Britain 1964-1992, in: Quality & Quantity 30, 389-404.
A sociological example with a binary dependent variable. Focus is not on statistical basics, but on substantive meaning of multilevel vs. single-level modelling.
- Hox, Joop J. (1994): Hierarchical Regression Models for Interviewer and Respondent Effects, in: Sociological Methods & Research, 22, 300-318.
A useful substantive application, especially for those who are tired of school studies
....
- Kreft, Ita G. G. (1993): Using Multilevel Analysis to Assess School Effectiveness: A
Study of Dutch Secondary Schools, in: Sociology of Education 66, 104-129.
This is an applied paper dealing mainly with the substantive results of multilevel
(random coefficient) modeling.
- Kreft, G. G. & de Leeuw, E. D. (1988): The See-Saw Effect: a multilevel problem?, in:
Quality & Quantity 22, 127-137.
The see-saw effect is a difference -- perhaps even an (apparent) contradiction --
between individual and aggregate level effects, as for instance when a variable (say, sex) has a positive slope on the individual level, but the same attribute measured on the aggregate level (the proportion of girls in a class) has a zero or even negative slope. This paper discusses the conditions under which such effects may occur and in addition propagates random coefficient models as procedures to get appropriate estimates of standard errors, appropriate specifically when compared to the "Cronbach procedure" that is discussed as a contrast.
- Kreft, Ita G. G. & de Leew, Jan (1994): The Gender Gap in Earning. A Two-Way Nested
Multiple Regression Analysis with Random Effects, in: Sociological Methods & Research,
22, 319-341.
Another substantive application that does not focus on school data.
- Kreft, Ita G. G., de Leew, Jan & Aiken, Leona S. (1995): The Effects of Different Forms of Centering in Hierarchical Linear Models, in: Multivariate Behavioral Research, 30, 1-21.
Mandatory reading for all who want to know more about centering of variables.
- Nuttall, D. L., Goldstein, Harvey, Prosser, R. & Rasbash, J. (1989): Differential school effectiveness, in: International Journal of Educational Research, 13, 769-776.
Focusses on substantive implications; no maths.
- Raudenbush, Stephen W. (1988): Educational Applications of Hierarchical Linear
Models: A Review, in: Journal of Educational Statistics 13, 85-116.
Despite its title, this paper is basically a rather formal exposition of the estimation theory in multilevel models, accessible only to readers with substantial statisticaltraining.
- Raudenbush, Stephen & Bryk, Anthony S. (1986): A Hierarchical Model for Studying School Effects, in: Sociology of Education 59, 1-17.
A short presentation of a random coefficient model with applications on a school data set to demonstrate the different variance components and cross-level interaction.
- Snijders, Tom A.B. und Bosker, Roel J. (1994): Modeled Variance in Two-Level-Models, in: Sociological Methods & Research 22, 342-363.
A useful paper that discusses the problem or "variance accounted for" (or "modeled variance") in multilevel models.
Literature with emphasis on the "pre-random coefficients" stage in thinking about multilevel models
- Blalock, H. M. (1984): Contextual-Effects Models: Theoretical and Methdological Issues, in: Annual Review of Sociology 10, 353-372.
Main focus here is on substantive interpretation and discussion of theoretical implications.
- Burstein, L. (1980): The Analysis of Multilevel Data in Educational Research and Evaluation, in: Review of Research in Education 8, 158-223.
This paper first discusses the problems of drawing inferences from aggregate level analysis at individual level relationships (cross level inference). Next, discussions about the appropriate level of analysis (individual or aggregate) are summed up. Another section discusses cross-level effects, that is, possible effects of aggregate level on individual level characteristics. Finally, some statistical procedures for a multilevel analysis are discussed, such as the "slope-as-outcomes" strategy developped by Burstein et al. in other papers. This paper is quite thorough, but probably of interest mainly to those with a strong interest in the developments that occurred during the 1970s.
- Burstein, Leigh, Linn, Robert L. & Capell, Frank J. (1978): Analyzing Multilevel Data in the Presence of Heterogeneous Within-Class Regressions, in: Journal of Educational Statistics, 3, 347-383.
Compares three strategies of analysis, (1) a Cronbach analysis (see Cronbach & Webb 1976), (2) a Slopes-as-Outcomes analysis, and (3) a Keesling-Wiley procedure. Concludes that all three procedures can give misleading estimates of class effects on mean class outcomes, but that they may also indicate sources of model mis-specification.
- Cheung, K. C., Keeves, J. P., Sellin, N. & Tsoi, S. C. (1990): The analysis of multilevel data in educational research: Studies of problems and their solutions, in: International Journal of Educational Research 14, S. 215-319.
This is sort of a "transitional" document. It discusses and criticizes at great length the "context analysis" of the 1970s and earlier 1980s, and then proceeds to present the more recent approaches by Goldstein, Bryk/Raudenbush, or Longford, which at that time were in an early stage, especially computer-wise.
- Iversen, G. (1991): Contextual Analysis, Newbury Park: Sage (Sage University Paper Series on Quantitative Applications in the Social Sciences).
This small volume is a careful and elementary introduction to modeling individual and aggregate level effects, but only in the final chapter random coefficient models are mentioned briefly and the problems inherent in the older approach are not really discussed. Iversen seems to imply (even though I am not sure whether he says so explicitly) that smaller standard errors for estimated coefficients are superior to larger standard errors (see p. 58 et seq.), but of course this is only true if the smaller errors are indeed valid, which in this case is very doubtful.
This page is a process initiated and maintained by
Prof. Dr. Wolfgang Ludwig-Mayerhofer
Universität Siegen
FB 1 - Soziologie
57068 Siegen
Homepage at the University of Siegen
E-Mail: 
This e-mail address is a graphics file in order to protect me from spam mails. Sorry for the inconvenience –: you have to type it into your e-mail client.
Top of page | Last update: 30 Aug 2009