Preview

t value and regression

Good Essays
Open Document
Open Document
1344 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
t value and regression
S CHOOL OF M ATHEMATICS , S TATISTICS AND O PERATIONS R ESEARCH
STAT 392

Tutorial – Ratio and Regression Estimation

1. Regression Estimation (from Lohr, Ex 3.6.4)
Foresters want to estimate the average age of tress in a stand. Determining age is cumbersome because one needs to count the tree rings on a core taken from the tree. In general, though, the older the tree, the larger the diameter, and diameter is easy to measure. The foresters measure the diameter of all
1132 tress and find that the population mean is 26.2 cm. They then randomly select 20 trees for age measurement. Tree, k
1
2
3
4
5
6
7
8
9
10

Diameter, xk
30.5
29.0
20.1
22.9
26.7
20.1
18.5
25.9
29.7
28.7

Age, yk
125
119
83
85
99
117
69
133
154
168

Tree, k
11
12
13
14
15
16
17
18
19
20

Diameter, xk
14.5
20.3
26.2
30.5
23.4
21.6
17.8
27.2
23.6
20.8

Age, yk
61
80
114
147
122
106
82
88
97
99

¯
(a) Treating the trees as a simple random sample, estimate the mean age of trees in the stand Y , with a variance estimate, 95% confidence interval, and RSE. Comment on the quality of the estimate.
(b) Draw a scatterplot of these data (make sure the x and y axes both start at zero). Fit a regression line y = α +βx+ε to the data, and draw it on to the plot.
(c) Determine whether ratio estimation using diameter as the auxiliary variable would be beneficial.
[You will need to compute the correlation coefficient of x and y, and their respective coefficients of variation.]
¯
(d) Make a ratio estimate of the mean age of trees in the stand Y , with a variance estimate, 95% confidence interval, and RSE. Comment on the quality of the estimate.
i. Fit the zero intercept regression line y = Rx + ε to the data ii. Add this line to your scatterplot.
¯
iii. Estimate Y with
¯
¯
Y R = RX
¯
where X is the population mean value of x. iv. Compute the residuals ek = yk − yk = yk − Rxk
v. Compute the variance of the residuals s2
e

You May Also Find These Documents Helpful

  • Powerful Essays

    d) What is the value of the coefficient of determination? Give an interpretation of this value in context.…

    • 909 Words
    • 5 Pages
    Powerful Essays
  • Good Essays

    Nt1310 Unit 4 Lab Report

    • 2595 Words
    • 11 Pages

    (c) Find the 95% two-sided confidence interval to estimate the mean. Comment on your result.…

    • 2595 Words
    • 11 Pages
    Good Essays
  • Good Essays

    Nt1330 Unit 5 Study Guide

    • 398 Words
    • 2 Pages

    3. Use MS Excel to find the least-squares regression line for these data. Record the equation, paying attention to precision.…

    • 398 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Nt1310 Unit 7-1

    • 1558 Words
    • 7 Pages

    The regression graph is shown above. b will depend on students' freehand line. Using a calculator, we find b =…

    • 1558 Words
    • 7 Pages
    Good Essays
  • Good Essays

    In Dunlap forest, a spot on the trail was chosen, and a stake was put down. Using a tape measure, we walked a number of meters into the forest that corresponded to the randomly generated X number, where another stake was put down. Using another tape measure, we walked a number of meters left that corresponded to the randomly generated Y number. A stake was put down, and that location was dubbed the sampling point. The area around the sampling point was divided into four quadrants, and the overstory tree closest to the sampling point in each quadrant was…

    • 207 Words
    • 1 Page
    Good Essays
  • Satisfactory Essays

    3. Question : In a manufacturing process a random sample of 36 bolts manufactured has a mean length of 3 inches with a standard deviation of .3 inches. What is the 99% confidence interval for the true mean length of the bolt?…

    • 904 Words
    • 4 Pages
    Satisfactory Essays
  • Satisfactory Essays

    C) We can find K by using the regression line method and time series data or cross sectional data.…

    • 381 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    A3 5 AppliedStatistics

    • 1129 Words
    • 8 Pages

    Statistics are commonly used in manufacturing processes to control and maintain quality. This activity will allow you to apply statistics in order to analyze and determine the quality of a set of wooded cubes.…

    • 1129 Words
    • 8 Pages
    Good Essays
  • Good Essays

    Pdf Chapter 9

    • 601 Words
    • 3 Pages

    30. A random sample of 85 group leaders, supervisors, and similar personnel revealed that a person spent an average 6.5 years on the job before being promoted. The population standard deviation was 1.7 years. Using the 0.95 degree of confidence, what is the confidence interval for the population mean? A) 6.99 and 7.99 B) 4.15 and 7.15 C) 6.14 and 6.86 D) 6.49 and 7.49 Answer: C 31. The mean weight of trucks traveling on a particular section of I-475 is not known. A state highway inspector needs an estimate of the mean. He selects a random sample of 49 trucks passing the weighing station and finds the mean is 15.8 tons. The population standard deviation is 3.8 tons. What is the 95 percent interval…

    • 601 Words
    • 3 Pages
    Good Essays
  • Satisfactory Essays

    PS 8

    • 422 Words
    • 4 Pages

    Following are the regression results for the data using Excel. In this problem, you will be interpreting the regression results. (For Practice, you may want to see if you can replicate these results using the data above in Excel.) (8 Points)…

    • 422 Words
    • 4 Pages
    Satisfactory Essays
  • Good Essays

    Stats Final guide

    • 3002 Words
    • 13 Pages

    (1) A study of the number of cars sold looked at the number of cars sold at 500…

    • 3002 Words
    • 13 Pages
    Good Essays
  • Good Essays

    STATSMidtermReview

    • 3397 Words
    • 14 Pages

    5. A forester surveys a sample of trees in a certain state forest and records the following information about each tree: species, height, diameter of trunk 4 feet above the ground, and type of leaves (needle or…

    • 3397 Words
    • 14 Pages
    Good Essays
  • Good Essays

    Exercise Week 3

    • 550 Words
    • 2 Pages

    3. Explore the distribution of the Age variable via histogram and moments. Overlay a Normal curve on the…

    • 550 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Econometrics: Exercises

    • 1186 Words
    • 11 Pages

    Report your results in equation form along with the number of observations and R2. What…

    • 1186 Words
    • 11 Pages
    Good Essays
  • Good Essays

    Instructor: Frank Wood 1. (20 points) In the file ”problem1.txt”(accessible on professor’s website), there are 500 pairs of data, where the first column is X and the second column is Y. The regression model is Y = β0 + β1 X + a. Draw 20 pairs of data randomly from this population of size 500. Use MATLAB to run a regression model specified as above and keep record of the estimations of both β0 and β1 . Do this 200 times. Thus you will have 200 estimates of β0 and β1 . For each parameter, plot a histogram of the estimations. b. The above 500 data are actually generated by the model Y = 3 + 1.5X + , where ∼ N (0, 22 ). What is the exact distribution of the estimates of β0 and β1 ? c. Superimpose the curve of the estimates’ density functions from part b. onto the two histograms respectively. Is the histogram a close approximation of the curve? Answer: First, read the data into Matlab. pr1=textread(’problem1.txt’); V1=pr1(1:250,1); V2=pr1(1:250,2); T1=pr1(251:500,1); T2=pr1(251:500,2); X=[V1;V2]; Y=[T1;T2]; Randomly draw 20 pairs of (X,Y) from the original data set, calculate the coefficients b0 and b1 and repeat the process for 200 times b0=zeros(200,1); b1=zeros(200,1); i=0 for i=1:200 indx=randsample(500,20); x=X(indx); 1…

    • 1398 Words
    • 6 Pages
    Good Essays