Part I: Basic statistics, frequencies, histogram, z-scores.
A standard is merely something against which one measures something.
The Dausokele bridge over the Nett River is 3.81 kilometers from where I start my runs.
A run out to the east side of the bridge has been a standard distance of mine for the past seven years.
The times in the table are in minutes.
__________ What level of measurement is the data?
__________ Find the sample size n for the data.
__________ Find the minimum.
__________ Find the maximum.
__________ Find the range.
__________ Find the midrange.
__________ Find the median.
__________ Find the mode.
__________ Find the sample mean x.
__________ Find the sample standard deviation sx.
__________ Find the sample coefficient of variation CV.
__________ If this data were to be divided into five classes, what would be the width of a single class?
Determine the frequency and calculate the relative frequency using five bins. Record your results in the table provided.
Class upper limits
Frequency (f)
Rel. Freq. p(x)
Sum:
Sketch a histogram chart of the data anywhere it fits, labeling your horizontal axis and vertical axis as appropriate.
____________________ What is the shape of the distribution?
____________________ Use the sample mean x and standard deviation sx calculated above to determine the z-score for a 20 minute run to the far side of the Nett River bridge.
____________________ Is the z-score for a 20 minute run an ordinary or unusual z-score?
Part II: Linear Regression
Date
Time to or from NBFS
2/20/08
24.5
3/19/08
24.0
3/23/08
27.7
5/22/08
23.6
5/25/08
26.8
3/21/08
26.7
10/25/07
23.5
10/2/07
23.6
6/30/07
27.9
6/17/07
27.2
10/17/06
20.0
5/10/06
25.1
5/10/06
28.2
10/8/05
26.8
10/8/05
27.3
10/15/04
27.3
10/11/03
26.3
11/6/03
27.6
12/31/02
31.9
11/12/01
25.3
Again use the data spreadsheet to avoid data entry errors. Note that the dates are simply another number to a spreadsheet. All of the usual functions can be applied.
__________ Does the relationship appear to be linear, non-linear, or random?
__________ Calculate the slope of the linear regression line for the data. Be careful: use format:cells to increase the decimal places to five decimal places and record the slope to five decimal places.
__________ Calculate the y-intercept of the linear regression for the data. Two decimal places are sufficient for the y-intercept.
__________ Is the correlation positive, negative, or neutral?
__________ Am I faster, slower, or staying the same in terms of my time to Nett bridge?
__________ Determine the correlation coefficient r.
__________ What is the strength of the correlation?
__________ Determine the coefficient of determination r² .
__________ Toughie. What is my predicted time to Nett Bridge far side on 1/1/2009? Be careful: use your spreadsheet. Enter the date 1/1/2009 into a cell and calculate the slope-intercept equation using the spreadsheet. Use "point=and-click" to calculate the result. You can NOT get this one right using a calculator. Your answer should "make sense" - it should be a time that it would appear I could actually achieve!
Discuss whether predictions can or cannot be accurately made based on the correlation.