Professional Documents
Culture Documents
Introduction Probability, Distributions and Correlation Estimating Under Uncertainty Tight Clastics / Carbonate Assessment Shale Assessment Reservoir Flow Valuation Techniques
Theresnothingismoredangerousthanasampleofone
QuotefromE.C.Capen
Definitions
Event: One of two or more things which can occur What does occur Subjective confidence about the likelihood of an uncertain future event, given trials An orderly portrayal of related data samples selected from a population; Portrayal Types include frequency, % frequency cumulative frequency, cum. % frequency log - cumulative probability (probit) Various shapes available in commercial software for curve fitting and forecasting Outcome: Probability: Distribution:
Frequency Distributions
60 50
Distribution of gross pay intervals, your trend Helpful distribution, lets call the peak frequency the mode. What % of the outcomes are > or = than 80? Distribution not that helpful anymore we need another way of looking at the data
Frequency
40 30 20
mode
10 0
20
20% 0%
40
60
80
100
Feet
Probability <=
Cumulative % Distributions
Distribution of gross pay intervals, your trend Distribution not that helpful anymore we need another way of looking at the data Lets accumulate the outcomes from small to large Lets also convert the frequency to a % of the total population
- 100 % - 80 % - 60 % - 40 % - 20 % -0%
Probability of < =
20
40
60
80
100
Feet
Frequency
40 30 20 10 0
20 40 60
mode
80
100
Feet
Probability <=
Cumulative % Distributions
Distribution of gross pay intervals, your trend
- 100 %
Distribution not that helpful anymore - 80 % we need another way of looking at the data Lets accumulate the - 60 % outcomes from large to small - 40 % Lets also convert the frequency to a % of the total population
Probability of > =
- 20 % -0%
20
40
60
80
100
Feet
Cumulative % Distributions
Distribution of gross pay intervals, your trend
Probability of < =
100 % 80 % 60 % 40 % 20 % 0%-
- 100 % - 80 % - 60 % - 40 % - 20 % -0%
Probability of > =
20
40
60
80
100
Feet
One of the main standards in risk analysis is selecting how you accumulate frequency distributionsin this course, we will accumulate from the large end, referred to as the greater than convention (P10 is a big number), which is consistent with the latest PRMS guidelines. A Percentile is a ?.
Plotting Conventions
In a Greater Than convention: P10 is the larger number P90 is the smaller number
In the LE convention
P10 is the value on the distribution for which there is a 10% probability that a random selection from that distribution will be less than or equal to that value this is a small number P90 is the value on the distribution for which there is a 90% probability that a random selection from that distribution will be less than or equal to that value this is a large number
10
Frequency
1
Rose & Associates, LLP
DieFaceOutcome
11
6
Ch 2 - Probability, Distributions, and Correlations AAPG Cartagena 2D course, Sept. 2013
Note how the Population average is reached with large sample sizes. The Population mean is not an actual die face. The concept of the average comes from observations of numerous repeated trials.
5orless
Count orFrequency
3orless
1orless
Dieface
12
2ormore
Count orFrequency
4ormore
6ormore
Dieface
Count orFrequency
25%
Dieface
13
Die Two
Outcome
Frequency
14
Frequency
100 80 60 40 20 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
Die Two
Outcome
15
Frequency
Frequency
400
600
800
1000
1200
16
Niobrara Shale
Coskey (2011)
17
Linear Scale
Log Scale
A logarithmic x axis telescopes the uncertainty to a manageable scale, and the probit y axis was developed to permit highly asymmetrical distributions to appear as a straight line
18
+3 +2 +1
-1 -2 -3
A logarithmic x axis telescopes the uncertainty to a manageable scale, and the probit y axis was developed to permit highly asymmetrical distributions to appear as a straight line
P99
P98 P95
P01
P02 P05
P90
P80 P70 P60
P10
P20 P30 P40
P50
P40 P30 P20
P50
P60 P70 P80
P10
P05 P02
P90
P95 P98
P01
P99
19
P99
P98 P95
P01
P02 P05
P90
P80 P70 P60
P10
P20 P30 P40
P50
P40 P30 P20
P50
P60 P70 P80
P50
P90
P10
P05 P02
P90
P95 P98
P10
P01
1 10
P99
100
+1
-1
If the x axis was linear then normal distributions would plot as straight lines
A logarithmic x axis telescopes the uncertainty to a manageable scale, and the probit y axis was developed to permit highly asymmetrical distributions to appear as a straight linenote also how the scale balances the area of a standard normal distribution under equal percentile segments
20
Geologically possible; but extremely unlikely Reasonable Maximum Half below, Half above, the median Reasonable Minimum As small as it could be . . . Yet detectable
P01P00? P01
P10
P90
P50
P50
P01 P100-P99? 1
P99
10 100 1,000 10,000
21
Mean:
Where,
The average of all outcomes. 2 Mean = e( + 0.5 ) = natural log of the median (mean of natural logs) = [Ln(P90) + Ln(P10)] / 2 = standard deviation of natural logs = [Ln(P10) Ln(P90)] / (2 x 1.282)
Measures of Dispersion
P10 / P90 2 Variance
SPREAD
Standard
Deviation Skewness Range
n (n 1)(n 2)
_ xi - x
s
22
Geologically possible; but extremely unlikely Reasonable Maximum Half below, Half above, the median Reasonable Minimum As small as it could be . . . Yet detectable
P01
P90
P10
P50
P50
P10
P90
P01
1 10 100 1,000
P99
10,000
Avg rate MCF / D in the Year of Maximum Production Fruitland Coals, SJ Basin (n = 564)
P10 = 900 P50 = 90 P90 = 9 4.5 1.8
From best fit line Statistical Mean = 452 Mode = 4 P10 / P90 = 900 / 9 = 100
23
Which single well metric is best transferable from the single well level to the program level?
P99
P98 P95
P01
P02 P05
P90
P80 P70 P60
P10
P50
P40 P30 P20
P50
P60 P70 P80
P10
P05 P02
P90
P95 P98
P01
0.1 1
P99
10
P90
P50
P10
Which single well metric is best transferable from the single well level to the program level?
Mode ~ 0.4 EUR from a prospect
Mean ~ 1.50
24
To simulate a Program of Single Wells we use a Monte Carlo Analysis simulation approach
Heres how it works
Which single well metric is best transferable from the single well level to the program level?
10x Mode = 4 from the sum of 10 distributions modes = mode of sum? NO
25
Which single well metric is best transferable from the single well level to the program level? MEAN
10x Mode = 4 from the sum of 10 distributions modes = mode of sum? NO
10x Median = 10 from the sum of 10 distributions medians = median of sum? NO 10x Mean = 15 from the sum of 10 distributions means = Mean of sum? YES
Only the mean appears transferable from the well to the program level .The mean of the sum is the sum of the means.
Lognormal Distributions:
Estimates of the population Mean Arithmetic Mean = (sum of n values) / n Statistical Mean =
Mz = e ( + 0.5
2)
= Natural Log of the Median = [Ln(P90) + Ln(P10)] / 2 = Standard Deviation of the Natural Logs = [Ln(P10) Ln(P90)] / (2 x 1.282)
26
P01
P02 P05
P90
P80 P70 P60
The P10 is representative of the mean value of the top 30% of the curve
P10
P20 P30
P50
P40 P30 P20
The P50 is representative of the mean value of the central 40% of the curve
P40
P50
P60 P70 P80
P10
P05 P02
The P90 is representative of the mean value of the lower 30% of the curve
P90
P95 P98
P01
P99
10
100
1,000
10,000
27
Exercise 4: Production Rate Distribution Production Rate (Mcfd): 850, 2500, 570, 1100, 160, 333, 1333
Using the mid-point approach, build a Rate distribution on log probability paper: MCFD Rank %tile a. Arrange all production rates by size,
largest first and assign Rank b. Determine the Percentile Values n = 7; Percentile = (100/n) * (Rank 0.5) c. Tabulate production rates sizes with percentiles (%tile)
1 2 3 4 5 6 7
7.2
28
200
120 150
~2/3 ~1/2 ~1/3
130
100
Patterns of Bias
200
From 100 to 200 120 is 0.30 of the distance from 100 to 200 (about 1/3) 130 is 0.47 of the distance from 100 to 200 (about 1/2) 150 is 0.70 of the distance from 100 to 200 2/3)c (about 2/3)
1,000
29
30
P01
P02 P05
P10
P20 P30 P40
P50
P40 P30 P20
P50
P60 P70 P80
P10
P05 P02
P90
P95 P98
P01
P99
10
Rose & Associates, LLP
100
31
1,000
10,000
Ch 2 - Probability, Distributions, and Correlations AAPG Cartagena 2D course, Sept. 2013
32
2. What is the median & P50 production rate in this trend? 3. How do you know that this trend shows a lognormal distribution? 4. What is the arithmetic mean of this distribution? 5. Calculate the mean of this distribution using Swanson's Rule Swansons Mean = (0.3) * P90 + (0.4) * P50 + (0.3) * P10
Rose & Associates, LLP 33 Ch 2 - Probability, Distributions, and Correlations AAPG Cartagena 2D course, Sept. 2013
Beta Distributions
Beta distributions range from zero to one (0 to 1)
Just about any variable that ranges from zero to one can be represented by a Beta distribution and not require truncation It can take on a variety of shapes including symmetric, right or left skewed or U-shaped It can be uniquely defined by mean and variance or P10 and P90 Beta distributions are ideal for porosity, saturation, recovery efficiency, or N/G
Rose & Associates, LLP 35 Ch 2 - Probability, Distributions, and Correlations AAPG Cartagena 2D course, Sept. 2013
0.2
0.1
0.0 0.00
0.20
0.40
0.60
0.80
1.00
0.4
0.3
0.2
0.1
Right skewed water saturation, oil recovery efficiency, N/G in sand-shale sequences Left skewed gas recovery efficiency, N/G of massive sands
0.0 0.00
0.20
0.40
0.60
0.80
1.00
36 Ch 2 - Probability, Distributions, and Correlations AAPG Cartagena 2D course, Sept. 2013
0.2
0.1
0.0 0.00
0.20
0.40
0.60
0.80
1.00
0.4
0.3
0.2
0.1
Low variance, largely symmetric porosity U-shaped percent oil or gas volume (high probability of one or the other, low probability of both together)
0.0 0.00
0.20
0.40
0.60
0.80
1.00
37 Ch 2 - Probability, Distributions, and Correlations AAPG Cartagena 2D course, Sept. 2013
N/G data from deepwater reservoirs Three distributions with mean and variances that match the data Note the lack of fit with both the lognormal and normal curves The best fit is clearly the Beta distribution
P01 P05 P10 P20 P30 P40 P50 P60 P70 P80 P90 P95 P99 P99.9 1000%
100%
Triangular Distributions
No naturally occurring process is triangularly distributed Artificial distribution, developed as a computational shortcut Particularly harmful as variance increases Ok for narrow Normal Distributions
Max @ P01?
10
Correlation:
Establishing whether - and the extent to which variables interrelate is probably more important than deciding on the distributions Assuming independence when a dependency really exists biases the results, often quite significantly
Correlation:
Correlation coefficient is a measure of the degree of correlation between parameters. A high positive correlation implies that given a high input, that the correlated value will also sample a high value. In this course we will only use linear Correlation Coefficients as a measure of the linear association between variables. Correlation coefficients range from -1.0 to 1.0 Negative coefficients occur when we want to model inverse correlation such as the relationship between interest rates and housing starts
41
0.50
0.75
42
Correlation: coefficients
How does a Correlation Coefficient of 0.66 work?
Variable B
r=
Variable A Sqrt [
n i=1 n i=1
(x-mean)(y-mean) (x-mean)2
n i=1
(y-mean)2]
Possible outcomes of B as a function of A are constrained to 34% of the full distribution of B. Hence the Correlation Coefficient r = 1- 0.34 = 0.66
Rose & Associates, LLP 43 Ch 2 - Probability, Distributions, and Correlations AAPG Cartagena 2D course, Sept. 2013
Correlation: coefficients
Cumulative Probability 100
Cumulative Probability
100
Variable B
xx
x x x
~1/3
44
100
200
500
1000
R=0
Mean
100
Prob or greater % Prob or greater %
100
200
500
1000
45
46
47
48
49
Mean = 80
50
Key Points
Central Limit Theorem Distribution type, shape and variance Lognormal shapes are very common in our business The mean is the best single point representation of a large programs average result Establishing the nature and scope of correlation and dependency can be more critical than distribution type The larger the P10/P90 of inputs, the stronger the effect of your correlations The best distribution is the one you can best defend
Rose & Associates, LLP 51 Ch 2 - Probability, Distributions, and Correlations AAPG Cartagena 2D course, Sept. 2013
52
Recommended Website:
53