0% found this document useful (0 votes)
22 views

1st Midterm - Spring 2019 OSL

The document presents a histogram showing the percentage of men who had heart attacks based on their birth weight, ranging from 5-8.5 lbs. It shows the likelihood of having a heart attack is greatest for lighter babies below 6 lbs, decreases for babies from 6-7 lbs, and increases slightly for the heaviest babies above 7.5 lbs. This suggests that a man's birth weight influences his chances of having a heart attack later in life, with lower and higher birth weights associated with higher risks.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views

1st Midterm - Spring 2019 OSL

The document presents a histogram showing the percentage of men who had heart attacks based on their birth weight, ranging from 5-8.5 lbs. It shows the likelihood of having a heart attack is greatest for lighter babies below 6 lbs, decreases for babies from 6-7 lbs, and increases slightly for the heaviest babies above 7.5 lbs. This suggests that a man's birth weight influences his chances of having a heart attack later in life, with lower and higher birth weights associated with higher risks.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 10

1.

A health magazine reported that a man’s weight at birth has a significant impact on the
chance that the man will suffer a heart attack during his life. A statistician analyzed a data set
for a sample of 798 men, and produced the histogram shown below. Determine how birth
weight influences the chances that a man will have a heart attack.

Percentage of men with heart attack

60.00%

50.00%

40.00%

30.00%

20.00%

10.00%

0.00%
5-5.5 5.5-6 6-6.5 6.5-7 7-7.5 7.5-8 8-8.5 8.5-9
Birth weight

The above pivot table shows counts (as percentages of row) of heart attack versus birth weight,
where birth weight has been grouped into categories. The percentages in each category with
heart attacks have then been plotted versus weight at birth as shown in the histogram. It appears
that the likelihood of a heart attack is greatest for light babies, and then decreases steadily, but
increases slightly for the heaviest babies.
2. The histogram below represents scores achieved by 250 job applicants on a personality profile.

Relative Frequency

0.30

0.20 0.20 0.20


0.20

0.10 0.10 0.10 0.10


0.10

0.00
0 10 20 30 40 50 60 70
56. What percentage of the job applicants scored between 30 and 40?

ANSWER:
10%

57. What percentage of the job applicants scored below 60?

ANSWER:
90%

58. How many job applicants scored between 10 and 30?

ANSWER:
100

59. How many job applicants scored above 50?

ANSWER:
50

60. Seventy percent of the job applicants scored above what value?

ANSWER:
20
61. Half of the job applicants scored below what value?

ANSWER:
30

62. What percentage of the applicants scored at most 50?

ANSWER:
80%

63. How many applicants scored between 10 and 50?

ANSWER:
175

TRUE / FALSE QUESTIONS

64. A frequency table indicates how many observations fall within each category, and a
histogram is its graphical analog.

ANSWER: T

65. Individual observations within each category may be found in a frequency table.

ANSWER: F

66. In the term “frequency table,” frequency refers to the number of data values falling
within each category.

ANSWER: T

67. A frequency table is a listing of the individual observations arranged in ascending or


descending order.
ANSWER: F

68. Both ordinal and nominal variables are categorical.

ANSWER: T

69. A variable is some characteristic of a population, while data are the observed values of a
variable based on a sample.

ANSWER: F

70. Age, height, and weight are examples of numerical data.

ANSWER: T

71. Categorical variables can be coded numerically or left uncoded.

ANSWER: T

72. Data can be categorized as cross-sectional or time series.

ANSWER: T

73. All nominal data may be treated as ordinal data.

ANSWER: F

74. Four different shapes of histograms are commonly observed: symmetric, positively
skewed, negatively skewed, and bimodal.

ANSWER: T

75. A histogram is skewed to the right (or positively skewed) if it has a single peak and the
values of the distribution extend much further to the left of the peak than to the right of
the peak.

ANSWER: F

76. A histogram is skewed to the left (or negatively skewed) if it has a single peak and the
values of the distribution extend much further to the right of the peak than to the left of
the peak.

ANSWER: F

77. A histogram is said to be symmetric if it has a single peak and looks approximately the
same to the left and right of the peak.
ANSWER: T

78. A skewed histogram is one with a long tail extending either to the right or left. The
former is called negatively skewed, and the later is called positively skewed.

ANSWER: F

79. Some histograms have two or more peaks. This is often an indication that the data come
from two or more distinct populations.

ANSWER: T

80. A bimodal histogram is one with two peaks equal in height.

ANSWER: F

81. Creating a histogram can be a tedious task, but an add-in such as StatPro makes it
relatively easy. However, you must be prepared to specify the categories.

ANSWER: T

82. The scatterplot is a graphical technique used to describe the relationship between two
numerical variables.

ANSWER: T
83. If we draw a straight line through the points in a scatterplot and most of the points fall
close to the line, we say that there is a strong positive linear relationship between the two
variables.

ANSWER: F

84. Time series data are often graphically depicted on a line chart, which is a plot of the
variable of interest over time.

ANSWER: T

85. Statisticians often refer to the pivot tables as contingency tables or crosstabs.

ANSWER: T

86. Numerical variables usually represent membership in groups or categories.

ANSWER: F
87. The time required to drive from Iowa City to Lansing is an example of a discrete random
variable.

ANSWER: F

88. The number of car insurance policy holders is an example of a discrete random variable

ANSWER: T

89. A population includes all elements or objects of interest in a study, whereas a sample is a
subset of the population used to gain insights into the characteristics of the population.

ANSWER: T

90. A variable (or field) is an attribute, or measurement, on members of a population,


whereas an observation (or case or record) is a list of all variable values for a single
member of a population.

ANSWER: T

91. A variable is usually listed in a row; an observation is usually listed in a column.

ANSWER: F

92. Phone numbers, Social Security numbers, and zip codes are examples of numerical
variables.

ANSWER: F

93. Cross-sectional data are data on a population at a distinct point in time, whereas time
series data are data collected across time.

ANSWER: T
56. If all values in a data set are negative, the value of the standard deviation may be either
positive or negative.
ANSWER: F

57. Assume that the histogram of a data set is symmetric and bell shaped, with mean of 72
and standard deviation of 10. Then, using the “rules of thumb”, we can say that 95% of
the data values were between 52 and 92.
ANSWER: T

58. If a histogram has a single peak and looks approximately the same to the left and right of
the peak, we should expect no difference in the values of the mean, median, and mode.
ANSWER: T

59. The mean is a measure of central location.


ANSWER: T

60. In a negatively skewed distribution, the mean is larger than the median and the median is
larger than the mode.
ANSWER: F

61. Since the population is always larger than the sample, the population mean is always
larger than the sample mean.
ANSWER: F

62. The length of the box in the boxplot portrays the interquartile range.
ANSWER: T

63. In a positively skewed distribution, the mean is smaller than the median and the median is
smaller than the mode.
ANSWER: F

64. The interquartile range is considered the weakest measure of central location.
ANSWER: F

65. the standard deviation always exceeds that of the variance.


ANSWER: F

66. The difference between the first and third quartiles is called the interquartile range.
ANSWER: T

67. The standard deviation is measured in original units, such as dollars and pounds.
ANSWER: T

68. The median is one of the most frequently used measures of variability.
ANSWER: F
69. Each of the covariance and correlation measures the strength and direction of a linear
relationship between two numerical variables.
ANSWER: T

70. Abby has been keeping track of what she spends to rent movies. The last seven week's
expenditures, in dollars, were 6, 4, 8, 9, 6, 12, and 4. The mean amount Abby spends on
renting movies is $7.
ANSWER: T

71. If two data sets have the same range, the smallest and largest observations in both sets
will be the same
ANSWER: F

72. The variance is a measure of the linear relationship between two variables
ANSWER: F

73. Generally speaking, if two variables are unrelated, the covariance will be a positive or
negative number close to zero
ANSWER: T

74. The correlation between two variables is a unitless quantity that is always between –1 and
+1.
ANSWER: T

75. The variance is the positive square root of the standard deviation.
ANSWER: F

76. It is possible that the data points are close to a curve and have a correlation close to 0,
because correlation is relevant only for measuring linear relationships.
ANSWER: T

77. Expressed in percentiles, the interquartile range is the difference between the 25 th and 75th
percentiles.
ANSWER: T

78. A data sample has a mode 0f 140, a median of 130, and a mean of 120. The distribution
of the data is positively skewed.
ANSWER: F

79. A student scores 85, 75, and 80 on three exams during the semester and 90 on the final
exam. If the final is weighted double and the three others weighted equally, the student's
final average would be 80.
ANSWER: F
80. Suppose that a sample of 10 observations has a standard deviation of 3, then the sum of
the squared deviations from the sample mean is 30.
ANSWER: F

81. The value of the mean times the number of observations equals the sum of all of the data
values.
ANSWER: T

82. The difference between the largest and smallest values in a data set is called the range.
ANSWER: T

83. There are four quartiles that divide the values in a data set into four equal parts.
ANSWER: F

2.7 The take home pay, (£), of 40 manual workers from a company for a particular week was:

482 492 599 512 540 544 520 518


546 640 494 465 512 558 528 584
582 494 550 544 540 594 536 520
560 525 600 490 514 454 654 575
490 560 522 600 570 528 480 563

a) Construct a grouped frequency distribution.


b) Draw a histogram.
c) Draw a frequency polygon.
d) Draw a percentage cumulative frequency ogive.

3 The profits of a group of small hotels in a consortium were organised into the following table:
Range of Profits No. of Hotels
less than £20 000 3
£20 000 and less than £30 000 5
£30 000 and less than £40 000 10
£40 000 and less than £50 000 21
£50 000 and less than £60 000 12
£60 000 and less than £100 000 9
a) Estimate the modal profit from the histogram drawn for question 3 in Supplementary
Exercise 2.
b) Estimate the median from the cumulative frequency diagram drawn the same question.
c) Estimate the quartile values and so calculate the interquartile range of the profits.
d) From the frequency data above calculate the mean profit and the standard deviation for
the profits of the whole hotel group.
. e) Estimate the percentage of the hotels which made more than £75 000 profit.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy