Skip to main content

3 steps for p-value ( p value ) analysis


 p-value:
=======
Probability (or the area) at the tail of a bell shaped curve, where,

    center of bell = population mean,
    marker = sample mean
    p-value = area remaining at the tail after deducting the are
 
intuition:
=========
the lower the area ==> the higher the distance between centre and sample mean . So, we can reject hypothesis.

Steps to calculate p-value


1. Assume null hypethesis ( i.e.  Population mean)
====================================
It( actually the opposite of this ) will be the Null Hypothesis

2. Calculate sample statistics
=====================
Now, calculate statistics from available data.

3. Calculate p-value
==================
If p-value is small ==> distance between population mean and sample mean is high ==> Reject Null Hypo

If p-value is big==> distance between population mean and sample mean is small ==>Fail to Reject Null Hypo



         Fig : p-value (source: wiki )


As, we can not accept a null hypothesis() , we try to reject null hypo. So, it is wise to use the hypo opposite to what we are trying to establish as Null Hypo.

Comments

Popular posts from this blog

MATLAB cross validation

// use built-in function samplesize = size( matrix , 1); c = cvpartition(samplesize,  'kfold' , k); % return the indexes on each fold ///// output in matlab console K-fold cross validation partition              N: 10    NumTestSets: 4      TrainSize: 8  7  7  8       TestSize: 2  3  3  2 ////////////////////// for i=1 : k    trainIdxs = find(training(c,i) ); %training(c,i);  // 1 means in train , 0 means in test    testInxs  = find(test(c,i)       ); % test(c,i);       // 1 means in test , 0 means in train    trainMatrix = matrix (  matrix(trainIdxs ), : );    testMatrix  = matrix (  matrix(testIdxs  ), : ); end //// now calculate performance %%  calculate performance of a partition     selectedKfoldSen=[];selectedKfoldSpe=[];selectedKfoldAcc=[];     indexSen=1;indexSpe=1;indexAcc=1;     if ( kfold == (P+N) )% leave one out         sensitivity = sum(cvtp) /( sum(cvtp) + sum(cvfn) )         specificity = sum(cvtn) /( sum(cvfp) + sum(cvtn) )         acc

R tutorial

Install R in linux ============ In CRAN home page, the latest version is not available. So, in fedora, Open the terminal yum list R  --> To check the latest available version of r yum install R --> install R version yum update R --> update current version to latest one 0 find help ============ ?exact topic name (  i.e.   ?mean ) 0.0 INSTALL 3rd party package  ==================== install.packages('mvtnorm' , dependencies = TRUE , lib='/home/alamt/myRlibrary/')   #  install new package BED file parsing (Always use read.delim it is the best) library(MASS) #library(ggplot2) dirRoot="D:/research/F5shortRNA/TestRIKEN/Rscripts/" dirData="D:/research/F5shortRNA/TestRIKEN/" setwd(dirRoot) getwd() myBed="test.bed" fnmBed=paste(dirData, myBed, sep="") # ccdsHh19.bed   tmp.bed ## Read bed use read.delim - it is the  best mybed=read.delim(fnmBed, header = FALSE, sep = "\t", quote = &q