ANTHROPOLOGY 4559/7559:
QUANTITATIVE
ANALYSIS II


Description
This is a second course in statistical methods useful in many disciplines, including archaeology, anthropology, and environmental sciences. The goal is to equip students with statistical skills useful in analyzing empirical variation, deciphering links to the environmental and historical contexts in which that variation occurs, and using the results to advance scientific understanding. Coverage includes probability distributions, basics of maximum-likelihood and Bayesian estimation, linear and generalized-linear models, non-parametric smoothing, multivariate distances, Mantel regression, and ordination methods (principal component analysis, correspondence analysis, and multidimensional scaling). The course emphasizes practical data analysis using R.

There is one required text for the course:

Dalgaard, Peter
2008 Introductory Statistics with R, Second Edition. Springer, New York.

Consult the Syllabus for additional required reading, all of which will be available on Collab


Requirements
Written work for the course includes Problems Sets and a Final Project. Problem Sets will be assigned nearly every week. Completed problem sets will be due the following week, at the beginning of class, when we will discuss them. Your write up should include not only your numerical results, illustrated with appropriate graphics, but also a description of how you got them and what you think they mean, both in statistical and archaeological terms. The text should be no more than four pages, doubled spaced. You should also be prepared to offer the class a succinct summary of your work, as a contibution to the class discussion of the problem sets. Late problem sets will not be accepted.

The Final Project is your opportunity to use the methods we have covered in the course to tease substantive meaning out of archaeological data. Note that the emphasis here is on meaning: the project should emphasize analytical methods as tools to make and evaluate defensible inferences about past dynamics from material evidence. You will need to identify an historical or anthropological issue and one or more sets of quantitive archaeological data that might be used to illuminate it. You are encouraged to choose issues and data in which you have a personal research interest and, therefore, basic familiarity with the current background literature. The data you use should contain information on several different variables that are of potential relevance to your problem. A key part of the project will be defending their relevance. The data should contain additional variables that independently document either the temporal or spatial contexts of the objects (assemblages, artifacts) characterized by the "relevant" variables. Over the first several weeks of the course, we will work together to identify suitable issues and datasets for everyone. The topic and dataset(s) for your project must be approved by the instructor. Your write up for the final project should not exceed 15 pages (double-spaced), exclusive of graphics.

Grades of the course will be computed as follows: PS*.6 + FP*.3 + CD*.1, where PS=mean score for all problem sets, FP=score for the final project, and CD=mean score for contributions to class discussion.


Some R Links


Datasets
  • Sherd thickness in Woodland-Period Illinois.
  • JamestownPipeData.xls.
    An Excel file that contains stem-hole diameter data for imported and locally made pipes from Jamestown Island sites. Source: Cotter, John 1958, Archaeological Excavations at Jamestown Virginia , National Park Service, Washington DC.
  • HarringtonsData.xls.
    An Excel file that contains stem-hole diameter data from Harrington's 1954 article.Source:Harrington, J.C. 1954, Dating Stem Fragments of 17th and 18th Century Clay Tobacco Pipes. Quarterly Bulletin of the Archaeological Society of Virginia.
  • Site size and population in 16th-century Peru.
  • CoddingData.csv.
    A CSV file that contains NISP for Deer and Rabbits from multiple time periods on the Pecho Coast, California. Source:Codding, Brian F., Judith F. Porcasi, and Terry L. Jones 2009 Explaining prehistoric variation in the abundance of large prey: A zooarchaeological analysis of deer and rabbit hunting along the Pecho Coast of Central California. Journal of Anthropological Archaeology 29(1): 47-61.
  • GalleData.csv.
    A CSV file that contains mean ceramic dates and counts for ceramic ware types and wine bottle glass from late-eighteenth century slave-quarter sites in Virginia. Source:Galle, Jillian. 2010 Costly Signaling and Gendered Social Strategies among Slaves in the Eighteenth-Century Chesapeake. American Antiquity 75(1) 19-43.
  • MimbresFauna.csv.
    A CSV file with data on artiodactyl and leporid frequencies in faunal assemblages from Mimbres sites in New Mexico. Source: Cannon, Michael D. 2003 A model of central place forager prey choice and an application to faunal remains from the Mimbres Valley, New Mexico. Journal of Anthropological Archaeology 22: 1-25
  • CannonData.csv.
    A CSV file with more -- and more complex -- data on faunal assemblages from Mimbres sites in New Mexico. Source:Cannon, Michael D, 2000 Large Mammal Relative Abundance in Pithouse and Pueblo Period Archaeofaunas from Southwestern New Mexico: Resource Depression among the Mimbres-Mogollon? Journal of Anthropological Archaeology 19:317-314.
  • Ceramic type frequencies from layers within room blocks at Los Muertos Pueblo
  • Woodland-Period ceramic assemblages from the Deep South
  • Stylistic Variation in Ceramics from Woodland-Period Illinois
  • Conkey.csv.
    The Conkey's canonical diversity dataset. Source: Conkey, M.W. 1980 The Identification of Prehistoric Aggregation Sites: The Case for Altimira. Current Anthropology21:609-630.


Seriation Tool
  • Seriation Diagram Tool
    An Excel spreadsheet with VBA code that draws seriation diagrams. Beta version by Tim Hunt.


Diversity Bootstrap
  • BootDiversity.sas
    SAS code for bootstrapped diversity estimates. Uses Conkey's data as an example.