|
We discuss the differing management requirements of data and code.
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Language
English Date Created
February 7th, 2021
|
|
We will think about the pros and cons of notebooks versus programs
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Language
English Date Created
February 7th, 2021
|
|
We discuss reproducible research, the motivation for aspects of software engineering practices in data science.
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Language
English Date Created
February 7th, 2021
|
|
We discuss the issue of statistical versus practical significance, think a little about the ethics of A/B testing, and situate A/B testing in the wider class of statistical inference from 2 samples.
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Non Commercial Language
English Date Created
January 28th, 2021
|
|
We work through how to apply large-sample theory to A/B testing.
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Non Commercial Language
English Date Created
January 28th, 2021
|
|
We look at how to obtain more precise and estimates in A/B testing - and how to avoid some pitfalls.
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Non Commercial Language
English Date Created
January 28th, 2021
|
|
We explain the principle of A/B testing, and how to derive confidence intervals for the difference in population proportions using the bootstrap.
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Non Commercial Language
English Date Created
January 28th, 2021
|
|
We introduce the principle of maximum likelihood, and show how to derive the likelihood as a function of the coefficients for logistic regression. We also mention one topical application of logistic…
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Non Commercial No Derivatives Language
English Date Created
January 20th, 2021
|
|
We show how to make a logistic regression classifier, and, in the context of ethics, consider how transparent logistic regression can be made to people whose lives it affects. We also compare…
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Non Commercial No Derivatives Language
English Date Created
January 20th, 2021
|
|
We extend logistic regression to multiple independent variables, and show how we can use the boostrap to estimate confidence intervals for coefficients, and to test hypotheses.
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Non Commercial No Derivatives Language
English Date Created
January 20th, 2021
|
|
We look at the meaning of logistic regression coefficients, relating them to odds, log odds and odds ratios.
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Non Commercial No Derivatives Language
English Date Created
January 19th, 2021
|
|
We introduce the principle of logisitc regression, illustrating its application to a credit approval dataset. We also introduce the concepts of odds and odds ratios.
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Non Commercial No Derivatives Language
English Date Created
January 19th, 2021
|
|
We consider 3 issues in hypothesis testing: Type I and Type II errors; cherry-picking, p-value hacking etc., and a short discussion about the possibility that hypothesis-driven research can hinder…
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Non Commercial No Derivatives Language
English Date Created
January 18th, 2021 Retain Source File
Yes
|
|
We'll extend the previous example to a one-way contingency table with multiple categories, in which the chi-squared "goodness-of-fit" statistic is used. We then extend it further to a…
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Non Commercial Language
English Date Created
January 17th, 2021
|
|
We introduce P-values, an important but tricky concept in hypothesis testing
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Non Commercial Language
English Date Created
January 17th, 2021
|
|
We introduce the principle of hypothesis testing
Course Code
INFR08030 Publisher
David Sterratt Licence Type
Creative Commons - Attribution Non Commercial Language
English Date Created
January 17th, 2021
|