As we all know that while dealing with a large population data set, it becomes imperative during the data analysis stage to pull a sample and use that as a reference data set to perform data analysis.
Stratified sampling is a one of the approaches for sample extraction. It is very easy to develop a stratified sampling code (I have used SQL) to extract the sample from the underlying population. The question that I have is how can we verify that the extracted sample is indeed a stratified sample? I have tried using Kolmogorov-Smirnov Test but have not been successful in using the results of the test to prove that the sample I extracted was indeed a stratified sample. By not being successful I mean my p value is less than 0.0001 with my null hypothesis being that the sample is a stratified sample. In this situation, I am pretty sure that the sample I pulled is indeed a stratified sample.
What other tests can I perform? Or What additional points should I consider before performing the K-S test?
0 comments:
Post a Comment