MS 150 Statistics spring 2008 mx • Name:

Pedometer
DaySteps
Sun3611
Mon10477
Tue7343
Wed8399
Thu15880
Fri6943
Sat1080
Sun1518
Mon11102
Tue3687
Wed9443
Thu9038
Fri3966
Sat6500

Part I: Basic statistics, frequencies, histogram, z-scores, normal distribution.

The pedometer data in the table was gathered by the instructor between 20 January and 02 February 2008

  1. __________ What level of measurement is the data?
  2. __________ Find the sample size n for the data.
  3. __________ Find the minimum.
  4. __________ Find the maximum.
  5. __________ Find the range.
  6. __________ Find the median.
  7. __________ Find the mode.
  8. __________ Find the sample mean x.
  9. __________ Find the sample standard deviation sx.
  10. __________ Find the sample coefficient of variation CV.
  11. __________ If this data were to be divided into five bins, what would be the width of a single bin?
  12. Determine the frequency and calculate the relative frequency using five bins. Record your results in the table provided.
Temperature bins (x)Frequency (f)Rel. Freq. p(x)
Sum:  
  1. Sketch a histogram chart of the data anywhere it fits, labeling your horizontal axis and vertical axis as appropriate.
  2. ____________________ What is the shape of the distribution?
  3. p(x < 4040) = ____________________ What is the probability I will walk 4040 steps or less per day?
  4. p(x > 9960) = ____________________ The goal of the 10000 step program is to walk 10000 steps per day. What is the probability that I will walk greater than 9960 steps or more per day?
  5. ____________________ Use the sample mean x and standard deviation sx calculated above to determine the z-score for a day with 10000 steps
  6. ____________________ Is the z-score for a 10000 step day an ordinary or unusual z-score?

Part II: Linear regression

pedometers

Imi HaleWHO
00
20062
278194
319164
655396

Pohnpei public health has been handing out pedometers provided by the Pacific Diabetes Education Program through a grant from Imi Hale. The FSM Department of Health has recently received pedometers from WHO. The WHO pedometers use a different mechanism and algorithm than the Imi Hale pedometers. Trials in which both pedometers were worn at the same time provided the data in the accompanying table.

  1. __________ Does the relationship appear to be linear, non-linear, or random?
  2. __________ Calculate the slope of the linear regression line for the data.
  3. __________ Calculate the y-intercept of the linear regression for the data.
  4. __________ Is the correlation positive, negative, or neutral?
  5. __________ Determine the correlation coefficient r.
  6. __________ What is the strength of the correlation?
  7. __________ Determine the coefficient of determination r² .
  8. __________ If the Imi Hale pedometer has recorded 450 steps, what is the predicted WHO pedometer step count?
  9. __________ If the WHO pedometer has recorded 250 steps, what is the predicted Imi Hale pedometer step count?
Table of basic statistical functions used by OpenOffice
Statistic or ParameterSymbolEquationsOpenOffice
Square root=SQRT(number)
Sample sizen=COUNT(data)
Minimum=MIN(data)
Maximum=MAX(data)
Median=MEDIAN(data)
Mode=MODE(data)
Sample meanx Σx/n =AVERAGE(data)
Sample standard deviationsx=STDEV(data)
Sample Coefficient of VariationCVsx/ x =STDEV(data)/AVERAGE(data)
Calculate a z value from an xz= z-score from a sample mean =STANDARDIZE(x;x;sx)
Linear regression functions used by OpenOffice.org
Slopeb=SLOPE(y data;x data)
Intercepta=INTERCEPT(y data;x data)
Correlationr=CORREL(y data;x data)
Coefficient of Determination =(CORREL(y data;x data))^2

zscores (3K)