···11+\documentclass[a4paper]{article}
22+% To compile PDF run: latexmk -pdf {filename}.tex
33+% Math package
44+\usepackage{amsmath}
55+%enable \cref{...} and \Cref{...} instead of \ref: Type of reference included in the link
66+\usepackage[capitalise,nameinlink]{cleveref}
77+% Enable that parameters of \cref{}, \ref{}, \cite{}, ... are linked so that a reader can click on the number an jump to the target in the document
88+\usepackage{hyperref}
99+% UTF-8 encoding
1010+\usepackage[T1]{fontenc}
1111+\usepackage[utf8]{inputenc} %support umlauts in the input
1212+% Easier compilation
1313+\usepackage{bookmark}
1414+1515+\begin{document}
1616+ \title{Week 7 - Evidence and experiments}
1717+ \author{
1818+ Jai Bheeman \and Kelvin Davis \and Jip J. Dekker \and Nelson Frew \and Tony
1919+ Silvestere
2020+ }
2121+ \maketitle
2222+2323+ \section{Introduction} \label{sec:introduction}
2424+2525+ \section{Method} \label{sec:method}
2626+2727+ \section{Results} \label{sec:results}
2828+2929+ \section{Discussion} \label{sec:discussion}
3030+3131+\end{document}
···11+\documentclass[a4paper]{article}
22+% To compile PDF run: latexmk -pdf {filename}.tex
33+44+% Math package
55+\usepackage{amsmath}
66+%enable \cref{...} and \Cref{...} instead of \ref: Type of reference included in the link
77+\usepackage[capitalise,nameinlink]{cleveref}
88+% Enable that parameters of \cref{}, \ref{}, \cite{}, ... are linked so that a reader can click on the number an jump to the target in the document
99+\usepackage{hyperref}
1010+% UTF-8 encoding
1111+\usepackage[T1]{fontenc}
1212+\usepackage[utf8]{inputenc} %support umlauts in the input
1313+% Easier compilation
1414+\usepackage{bookmark}
1515+1616+\begin{document}
1717+ \title{Week 8 - Quantitative data analysis}
1818+ \author{
1919+ Jai Bheeman \and Kelvin Davis \and Jip J. Dekker \and Nelson Frew \and Tony
2020+ Silvestere
2121+ }
2222+ \maketitle
2323+2424+ \section{Introduction} \label{sec:introduction}
2525+2626+ \section{Method} \label{sec:method}
2727+2828+ \section{Results} \label{sec:results}
2929+3030+ \section{Discussion} \label{sec:discussion}
3131+3232+\end{document}
wk9/Tennis players 2017-09 final.xlsx
This is a binary file and will not be displayed.
wk9/pearson.png
This is a binary file and will not be displayed.
wk9/spearman.png
This is a binary file and will not be displayed.
+113
wk9/week9.tex
···11+\documentclass[a4paper]{article}
22+% To compile PDF run: latexmk -pdf {filename}.tex
33+44+% Math package
55+\usepackage{amsmath}
66+%enable \cref{...} and \Cref{...} instead of \ref: Type of reference included in the link
77+\usepackage[capitalise,nameinlink]{cleveref}
88+% Enable that parameters of \cref{}, \ref{}, \cite{}, ... are linked so that a reader can click on the number an jump to the target in the document
99+\usepackage{hyperref}
1010+% UTF-8 encoding
1111+\usepackage[T1]{fontenc}
1212+\usepackage[utf8]{inputenc} %support umlauts in the input
1313+% Easier compilation
1414+\usepackage{bookmark}
1515+\usepackage{graphicx}
1616+1717+\begin{document}
1818+ \title{Week 9 - Correlation and Regression}
1919+ \author{
2020+ Jai Bheeman \and Kelvin Davis \and Jip J. Dekker \and Nelson Frew \and Tony
2121+ Silvestere
2222+ }
2323+ \maketitle
2424+2525+ \section{Introduction} \label{sec:introduction}
2626+ We present a report on the relationship between the heights and weights of the
2727+ top tennis players as catalogued in provided data. We use statistical analysis
2828+ techniques to numerically describe the characteristics of the data, to see how
2929+ trends are exhibited within the data set. We conclude the report with a brief
3030+ discussion of the implications of the analysis and provide insights on
3131+ potential correlations that may exist.
3232+3333+ \section{Method} \label{sec:method}
3434+ Provided with a set of 132 unique records of the top 200 male tennis players,
3535+ we sought to investigate the relationship between the height of particular
3636+ individuals with their respective weights. We conducted basic statistical
3737+ correlation analyses of the two variables with both Pearson's and Spearman's
3838+ correlation coefficients to achieve this. Further, to understand the
3939+ correlations more deeply, we carried out these correlation tests on the full
4040+ population of cleaned data (removed duplicates etc), alongside several random
4141+ samples and samples of ranking ranges within the top 200. To this end, we made
4242+ use of Microsoft Excel tools and functions of the Python library SciPy.
4343+4444+ We specifically have made use of these separate statistical analysis tools in the
4545+ interest of sanity checking our findings. To do this, we simply replicated the
4646+ correlation tests within other software environments.
4747+4848+ \section{Results} \label{sec:results}
4949+ We performed separate statistical analyses on 10 different samples of the
5050+ population, as well as the population itself. This included 11 separate
5151+ subsets of the rankings:
5252+ \begin{itemize}
5353+ \item The top 20 entries
5454+ \item The middle 20 entries
5555+ \item The bottom 20 entries
5656+ \item The top 50 entries
5757+ \item The bottom 50 entries
5858+ \item 5 randomly chosen sets of 20 entries
5959+ \end{itemize}
6060+\vspace{1em}
6161+ Table \ref{tab:excel_results} shows the the results for the conducted tests.
6262+6363+ \begin{table}[ht]
6464+ \centering
6565+ \label{tab:excel_results}
6666+ \begin{tabular}{|l|r|r|}
6767+ \hline
6868+ \textbf{Test Set} & \textbf{Pearson's Coefficient} & \textbf{Spearman's Coefficient} \\
6969+ \hline
7070+ \textbf{Full Population} & 0.77953 & 0.73925 \\
7171+ \textbf{Top 20} & 0.80743 & 0.80345 \\
7272+ \textbf{Middle 20} & 0.54134 & 0.36565 \\
7373+ \textbf{Bottom 20} & 0.84046 & 0.88172 \\
7474+ \textbf{Top 50} & 0.80072 & 0.78979 \\
7575+ \textbf{Bottom 50} & 0.84237 & 0.81355 \\
7676+ \textbf{Random Set \#1} & 0.84243 & 0.80237 \\
7777+ \textbf{Random Set \#2} & 0.56564 & 0.58714 \\
7878+ \textbf{Random Set \#3} & 0.59223 & 0.63662 \\
7979+ \textbf{Random Set \#4} & 0.65091 & 0.58471 \\
8080+ \textbf{Random Set \#5} & 0.86203 & 0.77832
8181+ \\ \hline
8282+ \end{tabular}
8383+ \caption{Table showing the correlation coefficients between height and
8484+ weight using different test sets. All data is rounded to 5 decimal
8585+ places}
8686+ \end{table}
8787+8888+ \begin{figure}[ht]
8989+ \centering
9090+ \label{fig:scipy}
9191+ \includegraphics[width=0.6\textwidth]{pearson.png}
9292+ \includegraphics[width=0.6\textwidth]{spearman.png}
9393+ \caption{The Pearsion (top) and Spearman (bottom) correlations coefficients
9494+ of the data set as computed by the Pandas Python library}
9595+ \end{figure}
9696+9797+ \section{Discussion} \label{sec:discussion}
9898+ The results generally indicate that there is a fairly strong positive
9999+ correlation between the weight and weight of an individual tennis player,
100100+ within the top 200 male players. The population maintains a strong positive
101101+ correlation with both Pearson's and Spearman's correlation coefficients,
102102+ indicating that a relationship may exist. Our population samples show
103103+ promising consistency with this, with 6 seperate samples having values above
104104+ 0.6 with both techniques. The sample taken from the middle 20 players,
105105+ however, shows a relatively weaker correlation compared with the top 20 and
106106+ middle 20, which provides some insight into the distribution of the strongest
107107+ correlated heights and weights amongst the rankings. All five random samples
108108+ of 20 taken from the population indicate however that there does appear to be
109109+ a consistent trend through the population, which corresponds accurately with
110110+ the coefficients on the general population.
111111+112112+113113+\end{document}