Prob Worksheet 3
Prob Worksheet 3
Worksheet-ch5-9
   First, let's list the sample space of two dice throws. Each outcome is an ordered pair (d1
   ,d2) where d1 is the result of the first die and d2 is the result of the second die. There are
   6×6=36 possible outcomes.
       o   Event E: The sum of the dice is odd. The sum is odd if one die is odd and the
           other is even.
           E={(1,2),(1,4),(1,6),(2,1),(2,3),(2,5),(3,2),(3,4),(3,6),(4,1),(4,3),(4,5),(5,2),(5,4),(5
           ,6),(6,1),(6,3),(6,5)}
       o   Event F: The first die lands on 1. F={(1,1),(1,2),(1,3),(1,4),(1,5),(1,6)}
       o   Event G: The sum is 7. G={(1,6),(2,5),(3,4),(4,3),(5,2),(6,1)}
       o   EF (E and F): The intersection of E and F. The sum is odd AND the first die is 1.
           EF=E∩F={(1,2),(1,4),(1,6)}
       o   E U F (E or F): The union of E and F. The sum is odd OR the first die is 1.
           E∪F=E+F−(E∩F)
           E∪F={(1,1),(1,2),(1,3),(1,4),(1,5),(1,6),(2,1),(2,3),(2,5),(3,2),(3,4),(3,6),(4,1),(4,3)
           ,(4,5),(5,2),(5,4),(5,6),(6,1),(6,3),(6,5)}
       o   FG (F and G): The intersection of F and G. The first die is 1 AND the sum is 7.
           FG=F∩G={(1,6)}
       o   EFc (E and not F): The intersection of E and the complement of F. The sum is
           odd AND the first die is NOT 1. First, find Fc (complement of F): The first die is
           NOT 1. Fc={(d1,d2)∣d1∈{2,3,4,5,6},d2∈{1,2,3,4,5,6}}
           EFc=E∩Fc={(2,1),(2,3),(2,5),(3,2),(3,4),(3,6),(4,1),(4,3),(4,5),(5,2),(5,4),(5,6),(6,
           1),(6,3),(6,5)}
       o   EFG (E and F and G): The intersection of E, F, and G. The sum is odd AND the
           first die is 1 AND the sum is 7. From EF={(1,2),(1,4),(1,6)}. Now intersect with
           G. EFG=(E∩F)∩G={(1,2),(1,4),(1,6)}∩{(1,6),(2,5),(3,4),(4,3),(5,2),(6,1)}
           EFG={(1,6)}
5. A class in probability theory consists of 6 men and 4 women. An exam is given and
   the students are ranked according to their performance. Assuming that no two
   students obtain the same score,
       o a) how many different rankings are possible? There are a total of 6+4=10
          students. Since no two students obtain the same score, every permutation of these
          10 students represents a unique ranking. The number of different rankings is the
          number of permutations of 10 distinct items. Number of rankings =
          10!=10×9×8×7×6×5×4×3×2×1=3,628,800
       o b) If all rankings are considered equally likely, what is the probability that
          women receive the top 4 scores? For women to receive the top 4 scores, the first
          4 positions in the ranking must be occupied by the 4 women. The number of ways
          to arrange the 4 women in the top 4 positions is 4!. The number of ways to
          arrange the remaining 6 men in the remaining 6 positions is 6!. Number of
          favorable rankings = (ways to arrange 4 women in top 4 positions) × (ways to
          arrange 6 men in remaining 6 positions) Number of favorable rankings =
          4!×6!=(4×3×2×1)×(6×5×4×3×2×1) =24×720=17,280
          The total number of possible rankings is 10! (from part a). Probability that women
          receive the top 4 scores = (Number of favorable rankings) / (Total number of
          rankings) P(women get top 4 scores)=10!4!×6!=3,628,80017,280=2101 (This can
          also be simplified as 10×9×8×7×6!4!×6!=10×9×8×74!=504024=2101)
          The event that the system works is the union of these two sets: System works
          outcomes = {(1,1,0,0),(1,1,0,1),(1,1,1,0),(1,1,1,1),(0,0,1,1),(0,1,1,1),(1,0,1,1)}
          (Note: (1,1,1,1) is common to both and is listed only once.)
       o c) Let E be the event that components 1 and 3 are both failed. How many
         outcomes are contained in event E? If components 1 and 3 are both failed, then
         x1=0 and x3=0. The states of components 2 (x2) and 4 (x4) can be either 0 or 1.
         Possible outcomes for x2: 0, 1 (2 options) Possible outcomes for x4: 0, 1 (2
         options) The outcomes in event E are of the form (0,x2,0,x4). Number of
         outcomes in E = 1×2×1×2=4 The outcomes are:
         (0,0,0,0),(0,0,0,1),(0,1,0,0),(0,1,0,1)
7. A group of 5 boys and 10 girls is lined up in random order that is, each of the 15!
   permutations is assumed to be equally likely.
There are a total of 5+10=15 people. The total number of permutations is 15!.
       o   a) What is the probability that the person in the 4th position is a boy? To
           calculate this, imagine fixing a boy in the 4th position. Number of choices for the
           4th position: 5 (any of the 5 boys). Number of ways to arrange the remaining 14
           people in the remaining 14 positions: 14!. Number of favorable permutations =
           5×14! Probability =
           Total number of permutationsNumber of favorable permutations=15!5×14!
           =15×14!5×14!=155=31
       o b) What about the person in the 12th position? This is analogous to part (a).
           The specific position doesn't change the probability, assuming random order.
           Number of choices for the 12th position: 5 (any of the 5 boys). Number of ways
           to arrange the remaining 14 people in the remaining 14 positions: 14!. Probability
           = 15!5×14!=155=31
       o c) What is the probability that a particular boy is in the 3rd position? Let's
           pick a specific boy, say Boy A. Number of choices for the 3rd position: 1 (Boy
           A). Number of ways to arrange the remaining 14 people in the remaining 14
           positions: 14!. Number of favorable permutations = 1×14! Probability = 15!1×14!
           =151
8. A committee of size 5 is to be selected from a group of 6 men and 9 women. If the
   selection is made randomly, what is the probability that the committee consists of 3
   men and 2 women?
   Let F be the event that the student is female. Let C be the event that the student is
   majoring in computer science.
Table of salaries:
   | Wife | Husband < $25,000 | Husband > $25,000 | Total (Husband) | | :--------- | :-----------
   ------ | :----------------- | :-------------- | | < $25,000 | 212 | 198 | 410 | | > $25,000 | 36 | 54 |
   90 | | Total (Wife) | 248 | 252 | 500 |
   Let H<25 be the event that the husband earns less than $25,000. Let H>25 be the event
   that the husband earns more than $25,000. Let W<25 be the event that the wife earns less
   than $25,000. Let W>25 be the event that the wife earns more than $25,000.
       o   a) The probability that the husband earns less than $25,000; Total couples =
           500. Number of couples where husband earns < $25,000 = 212 + 36 = 248.
           P(H<25)=Total couplesNumber of H<25 couples=500248=12562=0.496
       o   b) The probability that the wife earns more than $25,000 given that the
           husband earns more than this amount; We need to find P(W>25 | H>25).
           Number of couples where husband earns > $25,000 = 198 + 54 = 252. (This is the
           new sample space for the conditional probability). Among these 252 couples, the
           number where the wife earns > $25,000 is 54.
           P(W>25 | H>25)=Number of H>25Number of (W>25 and H>25)=25254 To
           simplify 25254: Divide by 2: 12627 Divide by 9: 143 So, P(W>25 | H>25)=143
           ≈0.2143
       o   c) The conditional probability that the wife earns more than $25,000 given
           that the husband earns less than this amount? We need to find
           P(W>25 | H<25). Number of couples where husband earns < $25,000 = 212 + 36
           = 248. (This is the new sample space). Among these 248 couples, the number
           where the wife earns > $25,000 is 36.
           P(W>25 | H<25)=Number of H<25Number of (W>25 and H<25)=24836 To
           simplify 24836: Divide by 4: 629 So, P(W>25 | H<25)=629≈0.1452
11. A man is known to speak the truth in 2 out of 3 times. He throws a balanced die and
    reported that the result is five. What is the probability that it was actually five?
   Let T be the event that the man speaks the truth. P(T)=32. Let F be the event that the man
   lies. P(F)=1−P(T)=1−32=31.
   Let R5 be the event that the man reported the result is five. Let A5 be the event that the
   actual result on the die is five. Let not A5 be the event that the actual result on the die is
   not five.
   For a balanced die, the probability of rolling a five is P(A5)=61. The probability of not
   rolling a five is P(not A5)=1−61=65.
   First, let's find P(R5∣A5): This means the die was actually five, and the man reported
   five. This happens if he tells the truth. P(R5∣A5)=P(T)=32.
   Next, let's find P(R5∣not A5): This means the die was not five, but the man reported five.
   This happens if he lies. If he lies when the die is not five, he reports one of the other 5
   numbers (1, 2, 3, 4, 6). Assuming he randomly picks a false number, the probability he
   reports "five" is 51. So,
   P(R5∣not A5)=P(F)×(probability of reporting 5 given he lies and it’s not 5)=31×51=151.
           Let's check condition 1: For x∈[0,1], x≥0 and (1−x)≥0. Therefore, 6x(1−x)≥0. For
           x outside [0,1], f(x)=0, which is ≥0. So, condition 1 is satisfied.
           Let's check condition 2: ∫−∞∞f(x)dx=∫016x(1−x)dx =∫01(6x−6x2)dx
           =[3x2−2x3]01 =(3(1)2−2(1)3)−(3(0)2−2(0)3) =(3−2)−0=1 So, condition 2 is
           satisfied.
Since both conditions are met, yes, f(x) is a probability density function.
15. The annual rainfall (in inches) in a certain region is normally distributed with μ=40,
    σ=4. What is the probability that in 2 of the next 4 years the rainfall will exceed 50
    inches? Assume that the rainfalls in different years are independent.
   Let R be the annual rainfall. R∼N(μ=40,σ=4). First, find the probability that the rainfall
   in a single year will exceed 50 inches. P(R>50)=P(Z>σ50−μ) P(R>50)=P(Z>450−40
   )=P(Z>410)=P(Z>2.5)
   Now, we are interested in the probability that in 2 of the next 4 years the rainfall will
   exceed 50 inches. This is a binomial probability problem, where: n=4 (number of
   trials/years) k=2 (number of successes/years with rainfall > 50 inches) p=0.0062
   (probability of success in a single trial) The binomial probability formula is P(X=k)=(kn
   )pk(1−p)n−k.
   P(X=2)=(24)(0.0062)2(1−0.0062)4−2 P(X=2)=2!2!4!(0.0062)2(0.9938)2
   P(X=2)=6×(0.00003844)×(0.98763844) P(X=2)≈6×3.844×10−5×0.9876
   P(X=2)≈0.000227
16. If each voter is for Proposition A with probability .7, what is the probability that
    exactly 7 of 10 voters are for this proposition?
   This is a binomial probability problem. n=10 (total number of voters) k=7 (number of
   voters for Proposition A) p=0.7 (probability that a single voter is for Proposition A)
   1−p=1−0.7=0.3 (probability that a single voter is not for Proposition A)
P(X=7)=120×0.0823543×0.027 P(X=7)≈0.2668
17. Suppose that the average number of accidents occurring weekly on a particular
    stretch of a highway equals 3. Calculate the probability that there is at least one
    accident this week.
   This is a Poisson distribution problem, as it deals with the number of events (accidents)
   occurring in a fixed interval of time (weekly) at a given average rate. The average
   number of accidents, λ=3 per week. The Poisson probability mass function is
   P(X=k)=k!e−λλk.
   We want to find the probability of at least one accident this week, i.e., P(X≥1). It's easier
   to calculate P(X≥1) as 1−P(X=0). P(X=0)=0!e−330=1e−3×1=e−3
   e−3≈0.049787 P(X≥1)=1−e−3=1−0.049787=0.950213
18. It is known that bacteria of a certain kind occur in water at a rate 2 bacteria per
    cubic centimeter of water. Assuming that the phenomenon obeys a Poisson
    probability law, what is the probability that a sample of two cube centimeters of
    water will contain
   The rate of bacteria is 2 bacteria per cubic centimeter. We are considering a sample of
   two cubic centimeters. So, the average number of bacteria in a 2 cm$^3$ sample,
   λ=2 bacteria/cm3×2 cm3=4 bacteria. This is a Poisson distribution with λ=4.
   This problem seems to be missing information required to directly calculate the expected
   payment. The problem states that "on the average 10 students receive marks of at least
   75% in one or more of these classes". However, the scholarship is awarded to students
   who achieve at least 75% in all three courses. Without knowing the probability or
   average number of students who achieve marks of at least 75% in all three courses, we
   cannot determine the expected payment.
   If we assume (which is not stated in the problem and would be an inference) that the "10
   students who receive marks of at least 75% in one or more classes" is somehow related to
   the number of students getting 75% in all three classes, it would be an unjustified leap.
   Self-correction: If the problem implicitly means that on average 10 students get at least
   75% in each of the classes independently and we are to infer a proportion, that's not
   clearly stated. If the "10 students receive marks of at least 75% in one or more of these
   classes" meant the intersection (all three), then the answer would be
   10 students×$500/student=$5000. But "one or more" implies the union.
20. If the average number of claims handled daily by an insurance company is 5, what
    proportion of days have less than 3 claims? What is the probability that there will
   be 4 claims in exactly 3 of the next 5 days? Assume that the number of claims on
   different days is independent.
   This involves two parts, both using the Poisson distribution for the number of claims
   daily, and then a binomial distribution for the "days" part.
   Part 1: Proportion of days with less than 3 claims Let X be the number of claims on a
   given day. X∼Poisson(λ=5). We need to find P(X<3)=P(X=0)+P(X=1)+P(X=2).
   P(X=k)=k!e−λλk
   P(X=0)=0!e−550=e−5≈0.006738 P(X=1)=1!e−551=5e−5≈5×0.006738=0.03369
   P(X=2)=2!e−552=225e−5≈12.5×0.006738=0.084225
   Part 2: Probability that there will be 4 claims in exactly 3 of the next 5 days First,
   find the probability of a single day having exactly 4 claims (P(X=4)). P(X=4)=4!e−554
   =24e−5×625 P(X=4)=24625×0.006738≈244.21125≈0.17546875 Let
   p′=P(X=4)≈0.17546875.
   Now, we are looking at the next 5 days. This is a binomial experiment: n=5 (number of
   days) k=3 (number of days with 4 claims) p′≈0.17546875 (probability of success - 4
   claims - on a single day) 1−p′≈1−0.17546875=0.82453125
21. It is known that disks produced by a certain company will be defective with
    probability .01 independently of each other. The company sells the disks in packages
    of 10 and offers a money-back guarantee that at most 1 of the 10 disks is defective.
    What proportion of packages is returned? If someone buys three packages, what is
    the probability that exactly one of them will be returned?
   Let p=0.01 be the probability that a single disk is defective. Let n=10 be the number of
   disks in a package. Let X be the number of defective disks in a package.
   X∼Binomial(n=10,p=0.01).
   P(X≤1)=P(X=0)+P(X=1)≈0.90438+0.09135=0.99573
   P(package returned)=1−P(X≤1)≈1−0.99573=0.00427 So, approximately 0.427% of
   packages are returned.
   Part 2: If someone buys three packages, what is the probability that exactly one of
   them will be returned? This is another binomial problem. Let Y be the number of
   returned packages. N=3 (total packages bought) K=1 (exactly one returned package)
   Preturn=P(package returned)≈0.00427 (from Part 1)
   P(Y=1)=(13)(Preturn)1(1−Preturn)3−1 P(Y=1)=3×(0.00427)×(1−0.00427)2
   P(Y=1)=3×0.00427×(0.99573)2 P(Y=1)≈3×0.00427×0.99147
   P(Y=1)≈0.01281×0.99147≈0.01269
   So, the probability that exactly one of the three packages will be returned is
   approximately 0.01269.
22. Women's shoes are manufactured in sizes 2, 3, 4, ..., 8. Size 5 is suitable for a foot of
    length ranging from 9.25 inches to 9.5 inches. If length of women's foot is normally
    distributed with mean 9.4 inches and standard deviation 0.25 inches, how many
    pairs of size 5 are required out of every 10,000 pairs manufactured?
   Let L be the length of a woman's foot. L∼N(μ=9.4,σ=0.25). Size 5 is suitable for lengths
   between 9.25 and 9.5 inches. We need to find P(9.25<L<9.5).
P(9.25<L<9.5)=0.6554−0.2743=0.3811
   This is the proportion of women who would fit size 5. Out of every 10,000 pairs
   manufactured, the number of pairs required for size 5 would be: Number of pairs =
   0.3811×10,000=3811 pairs.
        o   b) Find the probability that the mean of a sample of size 36 will be within 10
            units of the population mean, that is, between 118 and 138. We need to find
            P(118<Xˉ<138). Since the sample size n=36 is large (typically n≥30), by the
            Central Limit Theorem, the sampling distribution of the sample mean Xˉ can be
            approximated by a normal distribution, regardless of the shape of the population
            distribution. So, Xˉ∼N(μXˉ=128,σXˉ=11/3).
P(118<Xˉ<138)≈0.9968−0.0032=0.9936
24. Suppose the mean length of time that a caller is placed on hold when telephoning a
    customer service center is 23.8 seconds, with standard deviation 4.6 seconds. Find
    the probability that the mean length of time on hold in a sample of 1,200 calls will be
    within 0.5 second of the population mean.
   Population mean μ=23.8 seconds. Population standard deviation σ=4.6 seconds. Sample
   size n=1200 calls.
   We need to find the probability that the sample mean Xˉ is within 0.5 seconds of the
   population mean. This means: 23.8−0.5<Xˉ<23.8+0.5 23.3<Xˉ<24.3
First, calculate the mean and standard deviation of the sample means: E(Xˉ)=μ=23.8 σXˉ
   Since n=1200 is a large sample, by the Central Limit Theorem, the distribution of Xˉ is
   approximately normal. Now, convert the values to Z-scores: For Xˉ=23.3: Z1
   =0.1327923.3−23.8=0.13279−0.5≈−3.765 For Xˉ=24.3: Z2=0.1327924.3−23.8
   =0.132790.5≈3.765
   We need to find P(−3.765<Z<3.765). P(−3.765<Z<3.765)=P(Z<3.765)−P(Z<−3.765)
   Using a Z-table or calculator, P(Z<3.765)≈0.99992
   P(Z<−3.765)=1−P(Z<3.765)≈1−0.99992=0.00008
25. Of a large group of men, 5% are less than 60 inches in height and 40% are between
    60 & 65 inches. Assuming a normal distribution, find the mean and standard
    deviation of heights.
Given information:
      1. P(H<60)=0.05
      2. P(60<H<65)=0.40
   From (1), convert to Z-score: P(Z<σ60−μ)=0.05. Looking up 0.05 in the Z-table (or 1-
   0.05 = 0.95 and finding -Z value), the Z-score for 0.05 is approximately -1.645. So,
   σ60−μ=−1.645 (Equation 1) 60−μ=−1.645σ⇒μ=60+1.645σ
      3. μ=60+1.645σ
      4. μ=65+0.125σ
   So, the mean height is approximately 65.41 inches and the standard deviation is
   approximately 3.29 inches.
26. Let X be normally distributed with mean 3 and variance 9 and Y=1−2X
P(−1<X<2)≈0.3707−0.0918=0.2789.
            So, P(X>3 AND Y<2) means P(X>3 AND X>−0.5). If X>3, it automatically
            satisfies X>−0.5. Therefore, P(X>3 AND Y<2)=P(X>3).
   Given: Population standard deviation σ=0.1 mm Sample size n=50 Sample mean xˉ=5.11
   mm Confidence level = 95%
   Since the population standard deviation σ is known and the sample size n≥30, we use the
   Z-distribution to construct the confidence interval. The formula for a confidence interval
   For a 95% confidence level, α=1−0.95=0.05. So, α/2=0.025. Zα/2=Z0.025. Look up the
   Z-score that leaves 0.025 in the upper tail (or 0.975 in the lower tail). This is Z0.025
   =1.96.
Now, calculate the margin of error: Margin of Error (ME) = Zα/2n σ=1.96×50
28. In a random sample of 250 employed people, 61 said that they bring work home
    with them at least occasionally.
Sample size n=250. Number of successes (people who bring work home) x=61.
      o   a. Give a point estimate of the proportion of all employed people who bring
          work home with them at least occasionally. The point estimate of the
          population proportion (p) is the sample proportion (p^). p^=nx=25061=0.244
      o   b. Construct a 99% confidence interval for that proportion. For a confidence
          interval for a proportion, since np^=61≥10 and n(1−p^)=250−61=189≥10, we can
          use the normal approximation. The formula for a confidence interval for a
=0.000737856 ≈0.02716
          The 99% confidence interval for the proportion of all employed people who bring
          work home is (0.174,0.314) or (17.4%,31.4%).
29. The amount of a particular biochemical substance related to bone breakdown was
    measured in 30 healthy women. The sample mean and standard deviation were 3.3
    nanograms per milliliter (ng/mL) and 1.4 ng/mL. Construct an 80% confidence
    interval for the mean level of this substance in all healthy women.
   Given: Sample size n=30 Sample mean xˉ=3.3 ng/mL Sample standard deviation s=1.4
   ng/mL Confidence level = 80%
   Since the population standard deviation is unknown and the sample size is n=30 (which is
   generally considered large enough for the Central Limit Theorem to apply, but also just at
   the threshold where t-distribution is preferred when σ is unknown), we use the t-
   distribution for constructing the confidence interval. Degrees of freedom
   df=n−1=30−1=29.
   For an 80% confidence level, α=1−0.80=0.20. So, α/2=0.10. We need to find tα/2,df
   =t0.10,29. From a t-distribution table, t0.10,29≈1.311.
The formula for a confidence interval for the population mean when σ is unknown is:
xˉ±tα/2,dfn s
   Calculate the standard error of the mean: SExˉ=n        s=30    1.4 30      ≈5.477 SExˉ
   =5.4771.4≈0.2556
   The 80% confidence interval for the mean level of this substance in all healthy women is
   (2.9649 ng/mL,3.6351 ng/mL).
30. A physical therapist studying muscular strength assumed muscle strength scores are
    normally distributed with σ=12. A sample of 15 individuals demonstrates a mean
    muscular strength score of 84.3. Calculate a 95% confidence interval for μ.
    Interpret the meaning of this confidence interval.
   Given: Population standard deviation σ=12 Sample size n=15 Sample mean xˉ=84.3
   Confidence level = 95%
   Since the population standard deviation σ is known, we use the Z-distribution to construct
   the confidence interval, even though the sample size is small (n<30). The problem also
   states that scores are normally distributed. The formula for a confidence interval for the
   Calculate the standard error of the mean: SExˉ=n        σ=15    12 15       ≈3.873 SExˉ
   =3.87312≈3.098
   Interpretation of the confidence interval: We are 95% confident that the true average
   muscular strength score for all individuals in the population falls between 78.228 and
   90.372. This means that if we were to take many samples and construct a 95% confidence
   interval for each, about 95% of these intervals would contain the true population mean.
31. Last year the government made a claim that the average income of the American
    people was $33,950. However, a sample of 50 people taken recently showed an
    average income of $34,076 with a population standard deviation of $324. Is the
    government's estimate too low? Conduct a significance test to see if the true mean is
    more than the reported average. Use an alpha =0.01.
   This is a hypothesis testing problem for a population mean, with known population
   standard deviation.
   Given: Claimed population mean (from government) μ0=$33,950 Sample size n=50
   Sample mean xˉ=$34,076 Population standard deviation σ=$324 Significance level
   α=0.01
   1. Formulate Hypotheses: Null Hypothesis H0: The true mean income is equal to or less
   than the government's claim. H0:μ≤33,950 Alternative Hypothesis HA: The true mean
   income is more than the government's claim. (This addresses "Is the government's
   estimate too low?") HA:μ>33,950 This is a one-tailed (right-tailed) test.
2. Choose the Test Statistic: Since σ is known and n≥30, we use the Z-test for the mean.
   4. Determine the Critical Value: For a one-tailed (right-tailed) test with α=0.01, we
   need to find Zα=Z0.01. Look up the Z-score that leaves 0.01 in the upper tail (or 0.99 in
   the lower tail). Z0.01≈2.33.
   5. Make a Decision: Compare the calculated Z-statistic to the critical value. Calculated
   Z=2.749 Critical value Zcritical=2.33 Since 2.749>2.33, the calculated Z-statistic falls in
   the rejection region.
   6. Conclusion: Reject the null hypothesis (H0). There is sufficient evidence at the 0.01
   significance level to conclude that the true mean income of the American people is
   significantly more than $33,950, suggesting that the government's estimate was too low.
32. A company sells exercise clothing and equipment on the Internet. To design the
    clothing, the company collects data on the physical characteristics of different types
    of customers. It takes a sample of 24 male runners found a mean weight of 61.79
    kilograms. Assume that the population standard deviation of weight is σ=4.5
    kilograms.
   Given: Sample size n=24 Sample mean xˉ=61.79 kg Population standard deviation σ=4.5
   kg
       o   a) Calculate a 95% confidence interval for the mean weight of all such
           runners: Since σ is known, we use the Z-distribution. (The sample size is small,
           but if the population is normally distributed, which is often assumed in such
           problems, the Z-interval is appropriate). Confidence level = 95%. So α=0.05,
           α/2=0.025. Zα/2=Z0.025=1.96.
           The 95% confidence interval for the mean weight of all such runners is
           (59.99 kg,63.59 kg).
           For a two-tailed test, if the hypothesized mean μ0 falls within the confidence
           interval, we do not reject H0. If it falls outside the confidence interval, we reject
           H0. Our 95% confidence interval is (59.99 kg,63.59 kg). The hypothesized mean
           μ0=61.3 kg. Since 61.3 kg is within the interval (59.99,63.59), we do not reject
           H0 at the 5% significance level.
           2. Choose the Test Statistic: Z-test, since σ is known. Test statistic Z=σ/n
           xˉ−μ0
           3. Calculate the Test Statistic: Z=4.5/24               61.79−61.3 Z=4.5/4.8990.49
           =0.91850.49≈0.5335
           4. Determine the Critical Values: For a two-tailed test with α=0.05, the critical
           values are Zα/2=±Z0.025. Z0.025=1.96. So, the critical values are ±1.96.
           6. Conclusion: Do not reject the null hypothesis (H0). There is not enough
           evidence at the 0.05 significance level to conclude that the true mean weight of
           male runners is different from 61.3 kg. This confirms the answer from part b.
33. Use the following data and fit a linear regression line to predict the price of used
    car. The data is the about Asking price of some used Toyota Corollas advertised on
    newspaper on October 2006.
   Data: | Model Year (X) | Asking Price (Y) | | :------------- | :--------------- | | 2004 | $10950
   | | 2003 | $9400 | | 2001 | $8990 | | 1998 | $5800 | | 1997 | $5850 | | 1994 | $3800 | | 1989 |
   $1500 |
   Let's define a new variable for 'Age of the car' (X) relative to 2006 (or the newest model
   year). It's easier to work with smaller numbers for X, so let X be "Years since 1989".
   Alternatively, let's use "Age" = 2006 - Model Year. | Model Year | Age (X) | Price (Y) | |
   :--------- | :------ | :-------- | | 2004 | 2 | 10950 | | 2003 | 3 | 9400 | | 2001 | 5 | 8990 | | 1998 | 8
   | 5800 | | 1997 | 9 | 5850 | | 1994 | 12 | 3800 | | 1989 | 17 | 1500 |
   ∑X=2+3+5+8+9+12+17=56 ∑Y=10950+9400+8990+5800+5850+3800+1500=46290
   ∑X2=22+32+52+82+92+122+172=4+9+25+64+81+144+289=616
   ∑Y2=109502+94002+89902+58002+58502+38002+15002=119902500+88360000+808
   20100+33640000+34222500+14440000+2250000=373635100
   ∑XY=(2×10950)+(3×9400)+(5×8990)+(8×5800)+(9×5850)+(12×3800)+(17×1500)
   =21900+28200+44950+46400+52650+45600+25500=265200
   a) Write the equation of the fitted line The linear regression equation is Y^=a+bX,
   where: b=n(∑X2)−(∑X)2n(∑XY)−(∑X)(∑Y) a=n∑Y−b(∑X)
   b=7(616)−(56)27(265200)−(56)(46290) b=4312−31361856400−2592240=1176−735840
   ≈−625.714
   a=746290−(−625.714)(56) a=746290+35040=781330≈11618.57
   The equation of the fitted line is Y^=11618.57−625.714X, where X is the age of the car
   (in years) and Y is the predicted asking price.
   b) What is the slope of the line? What does it tell you? The slope of the line is
   b=−625.714. This means that for every additional year of age, the asking price of a used
   Toyota Corolla is predicted to decrease by approximately $625.71.
c) What is the correlation coefficient value? The correlation coefficient r is given by:
r=[n(∑X2)−(∑X)2][n(∑Y2)−(∑Y)2] n(∑XY)−(∑X)(∑Y)
   d) What proportion of the variability of the asking price explained by the age of the
   car? The proportion of variability explained is given by the coefficient of determination,
   R2, which is the square of the correlation coefficient (r2). R2=r2=(−0.9872)2≈0.9746
   So, approximately 97.46% of the variability in the asking price can be explained by the
   age of the car. This indicates a very strong linear relationship.
34. It has been observed that the amount of soil eroded (in Kg) per day (Y) is
    determined by the wind velocity (X) in that day (Km/Hr) Data obtained from the
    ministry of agriculture for a certain area gave the following regression line:
    Y=0.25+0.1123 X
   The given regression line is Y^=0.25+0.1123X. Here, Y^ is the predicted amount of soil
   eroded (in Kg) and X is the wind velocity (in Km/Hr).