**Interval Estimation by exclusion of data**

Interval Estimation can be done without assuming underlying probability distributions for the data. For example in Statistics we have to usually assume that the error term for the regression model is normally distributed to develop a confidence interval for any regression coefficient ** m** in

**.**

*y=mx+b*The interpretation for interval estimation in SWOP is this: In the discovery of knowledge based on observation effect sizes can be relatively loose or relatively tight. When measuring the gravitational pull of the Earth in physics we may have a relatively tight effect size. In contrast measuring effect sizes in certain topics in the humanities and social sciences we may have a relatively loose effect size. This is just for example. If the interval overlaps the null effect there may be little or no effect to quantify in the study. This interpretation differs from the frequentist interpretation that in 20 such similar studies, 19 studies will have a 95% confidence interval that contains the population effect size. This is a very important difference between the interpretation of SWOP and frequentist confidence intervals.

This method bypasses such a normality assumption by generating intervals based on each point ** (x,y)** and its influence on

**.**

*m*In ** Y_{i} = mX_{i} + b**,

**and**

*Y*_{i}

*X*_{i}are vectors of data with Real Entries.

** m** is the regression coefficient and is calculated via least squares. This is well known as simple linear regression.

** (x,y)_{i}** denotes the

**point of the data.**

*i*_{th}(1)

In equation (1) ** inf_{i}** is the influence of the point

**in calculating**

*i***. This is defined as the following:**

*m*** m_{U}** is the effect size

**calculated for all points in the dataset, where**

*m***denotes the entire dataset.**

*U*** m_{U-i}** is

**calculated without the**

*m***data point.**

*i*_{th}We then sort the data by values of descending ** inf_{i}**. The set of data comprising of the top 15% of

**is labelled as “**

*inf*_{i}**UL**” for upper limit and the bottom 15% of

**is labeled as “**

*inf*_{i }**LL**” or lower limit.

By removing the UL set of data we calculate the total upper influence statistic

The above excludes UL datasubset.

The above exludes LL datasubset.

The confidence intervals by excluding the influence from a 15% of data are:

This method does not rely on the probability distribution of any variable ** X_{i}** or

**. It does not rely on the equation between**

*Y*_{i}**or**

*X*_{i}**. It does not even require for the**

*Y*_{i}*subjects to be independent or uncorrelated. It just requires that m can be calculated reproducibly using some method like least squares.*

**i**Using this method we can generate confidence intervals at different levels from excluding less than 1% to anywhere below 50% of either end of data after sorting for *inf*_{i}.

**Interval Estimation by exclusion of effect**

By summating the positive and negative components of the influence statistic *inf*_{i} we get the Positive Sum Coefficient and the Negative Sum Coefficient.

for i where

for i where

The Upper and Lower Confidence Intervals by a 30\% percentage effect are as below: