STATS 7054 - Statistical Modelling

North Terrace Campus - Semester 1 - 2017

One of the key requirements of an applied statistician is the ability to formulate appropriate statistical models and then apply them to data in order to answer the questions of interest. Most often, such models can be seen as relating a response variable to one or more explanatory variables. For example, in a medical experiment we may seek to evaluate a new treatment by relating patient outcome to treatment received while allowing for background variables such as age, sex and disease severity. In this course, a rigorous discussion of the linear model is given and various extensions are developed. There is a strong practical emphasis and the statistical package R is used extensively. Topics covered are: the linear model, least squares estimation, generalised least squares estimation, properties of estimators, the Gauss-Markov theorem; geometry of least squares, subspace formulation of linear models, orthogonal projections; regression models, factorial experiments, analysis of covariance and model formulae; regression diagnostics, residuals, influence diagnostics, transformations, Box-Cox models, model selection and model building strategies; ; logistic regression models; Poisson regression models.

  • General Course Information
    Course Details
    Course Code STATS 7054
    Course Statistical Modelling
    Coordinating Unit Mathematical Sciences
    Term Semester 1
    Level Postgraduate Coursework
    Location/s North Terrace Campus
    Units 3
    Contact Up to 3 hours per week
    Available for Study Abroad and Exchange Y
    Prerequisites STATS 2107 or (MATHS 2201 and MATHS 2202)
    Assumed Knowledge Experience with the statistical package R such as would be obtained from STATS 1005 or STATS 2107
    Assessment ongoing assessment 30%, exam 70%
    Course Staff

    Course Coordinator: Dr Tyman Stanford

    Course Timetable

    The full timetable of all activities for this course can be accessed from Course Planner.

  • Learning Outcomes
    Course Learning Outcomes
    1. Explain the mathematical basis of the general linear model and its extensions to multilevel models and logistic regression.

    2. Use the open source programming language R for the analysis of data arising from both observational studies and designed experiments.

    3. Explain the role of statistical modelling in discovering information, making predictions and decision making in a range of applications including medicine, engineering, science and social science.
    University Graduate Attributes

    This course will provide students with an opportunity to develop the Graduate Attribute(s) specified below:

    University Graduate Attribute Course Learning Outcome(s)
    Deep discipline knowledge
    • informed and infused by cutting edge research, scaffolded throughout their program of studies
    • acquired from personal interaction with research active educators, from year 1
    • accredited or validated against national or international standards (for relevant programs)
    Critical thinking and problem solving
    • steeped in research methods and rigor
    • based on empirical evidence and the scientific approach to knowledge development
    • demonstrated through appropriate and relevant assessment
    Career and leadership readiness
    • technology savvy
    • professional and, where relevant, fully accredited
    • forward thinking and well informed
    • tested and validated by work based experiences
  • Learning Resources
    Required Resources
    There is no prescribed text for this course.
    Recommended Resources
    The following references are recommended reading:

    1. W. Venables and B. Ripley. Modern Applied Statistics with S, Fourth Edition, Springer, 2002.
    2. T. Snijders and R. Bosker. Multilevel analysis (Second Edition), Sage, 2012.
    3. A. Dobson and A. Barnett. An introduction to generalised linear models (Third Edition), Chapman & Hall/CRC, 2008.
    4. P. McCullagh and J. Nelder. Generalised linear models (Second Edition), Chapman & Hall/CRC, 1989.
    Online Learning
    This course uses MyUni for providing electronic resources, such as lecture notes, assignment papers, tutorial and computing exercises. Students should check their email and MyUni announcements for this course regularly for any notices or correspondence.

    Link to MyUni login page:
  • Learning & Teaching Activities
    Learning & Teaching Modes
    The lecturer guides the students through the course material in 24 lectures. Students are expected to prepare for lectures by reading the printed notes in advance of the lecture, and by engaging with the material in the lectures. Students are expected to attend all lectures, but lectures will be recorded (where appropriate facilities are available) to help with occasional absences and for revision purposes. In the fortnightly tutorials, students will discuss their solutions in groups and present them to the class on the board. These exercises will be further supplemented by the fortnightly computing practical sessions during which students will work under guidance on practical data analysis and develop more advanced computing skills using R. A series of five homework assignments builds on the tutorial and practical material and provides students with the opportunity to gauge their progress and understanding of the course material.

    The information below is provided as a guide to assist students in engaging appropriately with the course requirements.

    Activity Quantity Workload
    Lectures 24 72
    Tutorials 6 18
    Practicals 6 18
    Assignments 5 48

    Learning Activities Summary
    Week: Topics covered
    1. Introduction, matrix notation, multiple regression. The linear regression model.
    2. Estimable functions and best linear unbiased estimates. The Gauss-Markov Theorem. Inference for multiple regression. Prediction. Symbolic specification of linear models.
    3. Factors and contrasts. The marginality principle.
    4. Regression diagnostics; influence diagnostics. Leverage and Cook's distance.
    5. Model building: variance reduction in randomised experiments; observational studies and quasi-experiments. The role of predictor variables.
    6. Model selection algorithms: forwards, backwards and stepwise selection procedures. Box-Cox transformation and the profile likelihood.
    7. Generalised least squares (GLS). The geometry of least squares; orthogonal projections.
    8. The geometry of least squares (cont.). Estimation of the residual variance; estimable functions; hypothesis tests; expected mean squares.
    9. Orthogonality of hypotheses. Extension to generalised least squares. Multistratum experiments and random effects models.
    10. Expected mean squares for the split plot experiment. Least squares estimates for balanced factorial experiments. Averaging operators.
    11. Logistic regression; maximum likelihood estimation; inference for regression coefficient. Hypotheses concerning several parameters.
    12. Logistic regression: common odds rations for several 2x2 tables. Prospective and retrospective studies. Model fit and overdispersion.

  • Assessment

    The University's policy on Assessment for Coursework Programs is based on the following four principles:

    1. Assessment must encourage and reinforce learning.
    2. Assessment must enable robust and fair judgements about student performance.
    3. Assessment practices must be fair and equitable to students and give them the opportunity to demonstrate what they have learned.
    4. Assessment must maintain academic standards.

    Assessment Summary
    Component Assessment mode Week Due Weighting Learning
    Tutorials  Formative & summative 2,4,6,8,10,12 5% All
    Practicals  Formative & summative 1,3,5,7,9,11 5% All
    Assignments  Formative & summative Week Set: 1,3,5,7,9 3,5,7,9,11 20% All
    Examination Summative 70% All
    Assessment Related Requirements
    An aggregate final score of at least 50% is required to pass the course.
    Assessment Detail
    Attendance at five out of six tutorials will contribute 5% to the assessment for this course, and attendance at five out of six computing practicals will contribute 5% to the assessment for this course, for a total of 10%. Tutorials will be in the even weeks, commencing in Week 2. Computing practicals will be in the odd weeks, commencing in Week 1. If students are unable to attend classes owing to illness or compassionate reasons, please let the lecturer know.

    Assessment item Distributed    Week Due Weighting
    Assignment 1 1 3 4%
    Assignment 2 3 5 4%
    Assignment 3 5 7 4%
    Assignment 4 7 9 4%
    Assignment 5 9 11 4%
    Tutorials 2,4,6,8,10,12 5%
    Practicals 1,3,5,7,9,11 5%

    All written assignments are to be submitted to the designated hand-in boxes within the School of Mathematical Sciences with a signed cover sheet attached.

    Late assignments will not be accepted.

    Assignments will have a two week turn-around time for feedback to students.
    Course Grading

    Grades for your performance in this course will be awarded in accordance with the following scheme:

    M10 (Coursework Mark Scheme)
    Grade Mark Description
    FNS   Fail No Submission
    F 1-49 Fail
    P 50-64 Pass
    C 65-74 Credit
    D 75-84 Distinction
    HD 85-100 High Distinction
    CN   Continuing
    NFE   No Formal Examination
    RP   Result Pending

    Further details of the grades/results can be obtained from Examinations.

    Grade Descriptors are available which provide a general guide to the standard of work that is expected at each grade level. More information at Assessment for Coursework Programs.

    Final results for this course will be made available through Access Adelaide.

  • Student Feedback

    The University places a high priority on approaches to learning and teaching that enhance the student experience. Feedback is sought from students in a variety of ways including on-going engagement with staff, the use of online discussion boards and the use of Student Experience of Learning and Teaching (SELT) surveys as well as GOS surveys and Program reviews.

    SELTs are an important source of information to inform individual teaching practice, decisions about teaching duties, and course and program curriculum design. They enable the University to assess how effectively its learning environments and teaching practices facilitate student engagement and learning outcomes. Under the current SELT Policy ( course SELTs are mandated and must be conducted at the conclusion of each term/semester/trimester for every course offering. Feedback on issues raised through course SELT surveys is made available to enrolled students through various resources (e.g. MyUni). In addition aggregated course SELT data is available.

  • Student Support
  • Policies & Guidelines
  • Fraud Awareness

    Students are reminded that in order to maintain the academic integrity of all programs and courses, the university has a zero-tolerance approach to students offering money or significant value goods or services to any staff member who is involved in their teaching or assessment. Students offering lecturers or tutors or professional staff anything more than a small token of appreciation is totally unacceptable, in any circumstances. Staff members are obliged to report all such incidents to their supervisor/manager, who will refer them for action under the university's student’s disciplinary procedures.

The University of Adelaide is committed to regular reviews of the courses and programs it offers to students. The University of Adelaide therefore reserves the right to discontinue or vary programs and courses without notice. Please read the important information contained in the disclaimer.