Statistics Assignment: Descriptive & Inferential Analysis

Statistics Assignment: Descriptive & Inferential Analysis

Question 1

The height and weight of 10 students were measured and the data is shown in Table 1.

Students12345678910
Height (cm)174166173160162180175177164170
Weight (kg)77607550558077795269

Table 1: Height and Weight of Students

  • Describe the data from Table 1 by calculating the mean, variance and standard deviation of the height and weight. Round all numbers to TWO (2) decimal places.

(5 marks)

  • Use software to generate the boxplots for both height and weight. Interpret the plots and check for outliers.

(5 marks)

  • Use software to generate a scatter plot for weight vs height with the weight on the vertical axis. Thereafter, calculate the correlation coefficient between weight and height.

(5 marks)

  • Show the mathematical proof that if the unit of measurement for height is change from centimetre to feet, the correlation coefficient will not be affected.

(5 marks)

Question 2

  • A factory produces light bulbs, and 95% of the light bulbs pass quality control while 5% are defective. If a quality control inspector randomly selects 10 light bulbs for inspection,
  • What is the probability that exactly 2 of the selected light bulbs are defective?

(5 marks)

  • What is the probability of finding more than 2 defective bulbs in the sample of 10?

(5 marks)

  • The mean score of an entire cohort of students for a course is 67.8 with standard deviation (þÿ) of 11.5. Apply the normal distribution,
  • Find the probability of students scoring higher than 70 marks for this course.

(5 marks)

  • If 25% of the cohort were awarded A grade, what is the lowest possible score for an A grade? Round the number to a whole number.

(5 marks)

Question 3

Table 3 shows the scores for an exam for a sample of 25 students from the Class of 2025 and another sample of 25 students from the Class of 2024.

S/NoClass of 2025Class of 2024
17050
26441
34687
45775
54066
67441
78365
85963
96778
104963
114569
126072
135572
144766
156347
167790
174652
185377
195751
205355
216042
227454
238490
245858
258561

Table 3: Exam Scores for Class of 2025 and 2024

  • Apply a statistical method and calculate the upper and lower limit for a 95% confidence interval for the data from the Class of 2025.

(5 marks)

  • Apply a statistical method and calculate the upper and lower limit for a 95% confidence interval for the data from the Class of 2024.

(5 marks)

  • Interpret the two confidence intervals determine if there is any difference between the scores of 2025 versus 2024.

(5 marks)

  • Using the data from Table 3, generate a 95% confidence interval for 2024 and 2025 using the bootstrapping method. Use numpy.random.seed(88) and repeat the bootstrap process 5000 times. Show the new confidence intervals and the Python codes in your writeup.

(5 marks)

Question 4

Due to work commitments, 20 students did not pass a course in the July 2024 semester. They repeated the same course in January 2025 and their scores in January 2025 and July 2024 are shown in Table 4.

StudentsJan 2025Jul 2024
163.0014.00
272.008.00
366.0021.00
455.0026.00
575.0032.00
640.0024.00
744.009.00
840.0030.00
960.007.00
1075.0015.00
1144.0026.00
1271.0019.00
1371.0013.00
1454.0013.00
1540.0034.00
1654.005.00
1770.0033.00
1867.009.00
1962.0028.00
2048.0028.00

Table 4: Scores From July 2024 and January 2025 Semester

  • Determine whether the data provided in Table 4 represents independent or dependent samples and explain your reasoning.

(5 marks)

  • Apply an appropriate hypothesis test at 5% significance level to test whether students who repeated a course will improve their grades by at least 10 points. Conduct the test in a step-by-step manner.

(10 marks)

  • Interpret the results for this test.

(5 marks)

Question 5

A nutritionist claims that the average sugar content in a popular brand of energy bars is 15 grams. To test this claim, a researcher randomly selects 10 energy bars of this brand and measures their sugar content (in grams). The data is shown in Table 5 and assumes the sugar content in energy bars is normally distributed.

14.215.814.914.816.115.314.815.615.814.9

Table 5: Weigths of 10 Enery Bars

  • Apply a statistical method and calculate the upper and lower limit for a 95% confidence interval for the sugar content. Is the claim of 15 grams reasonable?

(5 marks)

  • Apply an appropriate hypothesis test at 5% significance level to test whether the mean sugar content is equal to 15 grams. Conduct the test in a step-by-step manner.

(10 marks)

  • Interpret the results for this experiment.

(5 marks)

dissertation structure

Want Help Structuring Your Answers!!

✔ Native Singaporean Experts

✔ All Subjects Covered

✔ Professional Guidance

Get Help With Statistics Assignment

Statistics Assignment Answers: Experts Answer on Above Descriptive & Inferential Analysis

Descriptive statistics and correlation

The calculation of mean, variance and standard deviation shows that the mean for height is 170.1 and weight is 69.9. The variance for height is 47.43 nd weight is 96.3 and standard deviation for height is 6.89 and weight is 9.82

Box plot interpretation

With respect to height, the boxplot is identified as symmetrical and there are no outliers identified. The weight shows slightly right skewed, and from this analysis it is evaluated that the height and weight of the majority of the students are within normal variation.

Caterplot and correlation

The scatter plot indicates a moderate positive linear relationship, as it is identified that taller students have higher weights. The correlation Coefficient is calculated is .78 which shows a strong positive correlation. The analysis also indicates that the correlation is not affected by the change in the unit.

Probability and normal distribution

In case of defective light bulb, the n is 10 and p is 0.05. P(X-2) = 0.0116

Normal distribution

The lowest A grade is identified as 76 marks, and 42.5% of students scored above 70.

Want a Full Worked Out Answer with References?

Related answers