DOUBLE SAMPLING FOR RATIO ESTIMATOR

                                                                                          ALIFATHIMA ABSANA H

                                                                                                     2048111

INTRODUCTION:                                    

      In many human surveys, information is in the most cases are not obtained from all the units in the survey, even after some callbacks. An estimate obtained from such incomplete data may be misleading especially when the respondents differ from the non-respondents because the estimate can be biased. Hansen and Hurwitz (1946) proposed a technique for adjusting for non-response to address the bias problem. Their idea is to take a subsample from the non-sample respondents to get an estimate for the subpopulation represented by the non-respondents. 

DOUBLE SAMPLING:

     Double Sampling can also be referred to as the Two-phase Sampling. Auxiliary information has always been seems effective in increasing the precision of estimates in survey sampling in which the precision of estimates of the mean of the variable of interest is increased by the presence of highly correlated auxiliary variables. There are situations when auxiliary information is available at the population level and the cost of collecting the variable of interest per unit is affordable, then single-phase sampling is more appropriate. But when prior information on auxiliary variable is lacking, then it is neither practical nor economical to conduct a census for this purpose. Therefore, an appropriate technique employed to get estimates of auxiliary variables on the basis of samples is Double-phase sampling. This technique is used when the cost of obtaining estimates of the variable of interest directly from the population is expensive or impracticable. The theory of Double sampling is presented under the assumption that one of the sample is a subsample of the other. This type of sampling technique is called Double Sampling Technique or Sampling followed by sub-sampling.

RATIO ESTIMATION WITH DOUBLE SAMPLING:

  • yi - variable of interest
  • xi - auxiliary variable
  • n' - number of units in the first sample (which includes the second sample)
  • n - number of units in the second sample

Only in the second samples, both xi and yi values are observed. In the remaining units, (in the first but not the second sample), xi but not yi are observed. Note that observing yi's are expensive whereas observing xi's are not.

If xi and yi are highly linearly correlated and approximately passing through the origin, then the ratio estimate with double sampling may lead to improved estimates. While using the ratio estimate for double sampling, the ratio will be estimated using samples where both (xy) are observed, i.e., the second sample, whereas τx will be estimated by the larger first sample.

The ratio estimator is:



ANALYSIS OF A REAL LIFE APPLICATON USING R:

A forest resource manager is interested in estimating the total number of dead trees in a 400-acre area of heavy infestation. She subdivides the area into 200 plots of equal sizes and uses photo counts to find the number of dead trees in 18 randomly sampled plots. She then randomly samples 8 plots out of these 18 plots and conducts a ground count on these 8 plots. Compute the estimated variance of the ratio estimator. the ratio estimate for the population total.

Let x denote the number of dead trees in the plot by photo count and y the number of dead trees by ground count. The data are given as:

Plot

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

x'

5

7

10

6

7

9

3

6

8

11

5

9

12

13

3

20

15

4

 

Out of these 18 plots, 8 are randomly selected and a ground count is conducted.

Plot

2

3

5

6

12

15

16

17

x

7

10

7

9

9

3

20

15

y

9

13

10

11

10

4

25

17

y-rx

0.3375

0.6250

1.3375

-0.1375

-1.1375

0.2875

0.2500

-1.5625

 

 

 

 

 

 

 

 

 

 


                         

                          

                           

                                       

APPLICATIONS:

      Double sampling can be employed for estimating the number of transmission sources in a Poisson process, for biomass estimates, for correcting helicopter counts of moose, for estimating sizes of animal populations, for estimation related to welfare programs, for evaluating a new medical diagnostic test, for estimating employment rates, for estimation in forestry. The double sampling method can be employed for determining the sample size for estimation related to rare items

ADVANTAGES:

      The potential advantage of using a double sampling plan is that it may reduce the average number of inspections, compared to the equivalent single sampling plan. It means, while the double sampling plan is as effective as the single sampling plan, the former is less expensive to conduct. From a psychological viewpoint, it is desirable to have a second chance Sometimes using a double sampling plan is the only way of chance. Sometimes using a double sampling plan is the only way of convincing upper management to implement a statistically valid sampling plan

DISADVANTAGES:

   Unless curtailment(curtailment refers to reject a lot without complete inspection of the second sample)is used on the second sample, under some circumstances double sampling may require more total inspection than would be required in a single-sampling plan that offers the same protection. So, unless double sampling is used carefully, its potential economic advantage may be lost. The second disadvantage of double sampling is that it is administratively more complex, which may increase the opportunity for the occurrence of inspection errors.

Comments

Popular posts from this blog

Comparing the efficiency of SRSWOR and SRSWR with the help of R Programming

Selection of samples:SRSWR vs SRSWOR(2048114)

pps (probability proportional to size) Systematic Sampling