Solutions: https://www.paulamoraga.com/course-aramco/99-problems-1probclt-solutions.html
R Syntax
# Choose a file interactively
file.choose()
# Read CSV file
<- read.csv(pathfile)
d
# Read text file.
# Set header = TRUE if the first row of the data
# corresponds to the names of the variables
<- read.table(pathfile, header = TRUE)
d
# Vector with values 3, 6, 7
c(3, 6, 7)
# Value of data d in row 3 and column 7
3, 7]
d[
# Row 3 of data d
3, ]
d[
# Column 7 of data d
7]
d[,
# Rows 3 and 5 of data d
c(3, 5), ]
d[
# Columns 7 and 9 of data d
c(7, 9)]
d[ ,
# Data d without rows 3 and 5
-c(3, 5), ]
d[
# Data d without columns 7 and 9
-c(7, 9)] d[ ,
R
If you do not have R installed, you can google “online R compiler”
Datasets
What percent of a standard normal distribution \(N(\mu = 0, \sigma = 1)\) is found in each region? Be sure to draw a graph.
Suppose weights of the checked baggage of airline passengers follow a nearly normal distribution with mean 45 pounds and standard deviation 3.2 pounds. Most airlines charge a fee for baggage that weigh in excess of 50 pounds. Determine what percent of airline passengers incur this fee.
The average daily high temperature in June in LA is 77\(^o\)F with a standard deviation of 5\(^o\)F. Suppose that the temperatures in June closely follow a normal distribution.
The mean score for Verbal Reasoning section for all the Graduate Record Examination (GRE) takers was 151 with a standard deviation of 7, and the mean score for the Quantitative Reasoning was 153 with a standard deviation of 7.67. Suppose that both distributions are nearly normal.
The distribution of the number of eggs laid by a certain species of hen during their breeding period has a mean of 35 eggs with a standard deviation of 18.2. Suppose a group of researchers randomly samples 45 hens of this species, counts the number of eggs laid during their breeding period, and records the sample mean. They repeat this 1,000 times, and build a distribution of sample means.
For each of the following situations, state whether the parameter of interest is a mean or a proportion. It may be helpful to examine whether individual responses are numerical or categorical.
As part of a quality control process for computer chips, an engineer at a factory randomly samples 212 chips during a week of production to test the current rate of chips with severe defects. She finds that 27 of the chips are defective.