Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Pie chart
A pie chart is a segmented circle whose segments
portray the relative frequencies of the categories
of some qualitative variable.In this example, the
variable. Region is proportionallydivided into 4
parts.
Bar chart
A bar chart depicts the frequency or the relative
frequency for each category of the qualitative data
as a bar rising vertically from the horizontal
axis.For example, Adidas‘ sales may be
proportionallycompared for each Region over
these two periods.
Skewed distribution
• Positively skewed -data form a long,
narrow tail to the right.
• Negatively skewed -data form a long,
narrow tail to the left.
Polygon
A polygonis a visual representation of a frequency
or a relative frequency distribution.Plot the class
midpoints on x-axis and associated frequency (or
relative frequency) on y-axis.Neighboring points
are connected with a straight line.
Here is a polygon for the house-price data.
Age = 36
Ogive
An ogiveis a visual representation of a cumulative
frequency or a cumulative relative frequency
distribution.Plot the cumulative frequency (or
cumulative relative frequency) of each class above
the upper limit of the corresponding class.The
neighboring points are then connected. Here is an Scatterplots
ogive for the house-price data. A scatterplot is used to determine if two variables
are related. Each point is a pairing: (x1,y1), (x2,y2),
etc.This scatterplot shows income against
education.
CHAPTER 3
NUMERICAL DECRIPTIVE MEASURES
Investment Decision
As an investment counselor at a large bank,
Rebecca Johnson was asked by an inexperienced
investor to explain the differences between two
top-performing mutual funds:
Curvilinear relationship • Vanguard‘s Precious Metals and Mining
• As x increases, y increases at an increasing fund (Metals)
(or decreasing) rate. • Fidelity‘s Strategic Income Fund (Income)
• As xincreases y decreases, at an increasing The investor has collected sample returns for
(or decreasing) rate. these two funds for years 2000 through 2009.
These data are presented in the next .
Population Mean
This mean does not reflect the typical salary! Consider the sorted data from the introductory
case.
Median
The medianis another measure of central location
that is not affected by outliers.When the data are
arranged in ascending order, the median isthe For the 25th percentile, we locate the position:
middle value if the number of observations is odd,
orthe average of the two middle values if the
number of observations is even.
Consider the sorted salaries of employees at Similarly, for the 75th percentile, we first find:
Acetech (odd number).
Mode Both L25 = 2.75 and L75 = 8.25 are not integers,
The modeis another measure of central location. thusThe 25th percentile is located 75% of the
• The most frequently occurring value in a distance between the second and third
data set observations, and it is
• Used to summarize qualitative data
• A data set can have no mode, one mode
(unimodal), or many modes (multimodal). The 75th percentile is located 25% of the distance
between the eighth and ninth observations, and it
Consider the salary of employees at Acetech is
Using the results obtained from the Metals fund where nis the number of multiperiod growth rates.
data, we can label the box plot with the 5 summary
values: For observations x1, x2, . . . , xn, the average
growth rate Ggis calculated as
Note that IQR = Q3-Q1 =43.48
Detecting outliers
Calculate IQR = 43.48
Calculate 1.5 ×IQR, or 1.5 ×43.48 = 65.22
are outliers if :
Q1 –S > 65.22, or if
L –Q3 > 65.22
There are no outliers in this data set.
The average growth rate using the simplified
The Geometric Mean formula is:
Remember that the arithmetic mean is an additive
average measurement.Ignores the effects of
compounding.The geometric meanis a
multiplicative average that incorporate
compounding. It is used to measure:
• Average investment returns over several
years, Measures of Dispersion
• Average growth rates. Measures of dispersion gauge the variability of a
data set.Measures of dispersion include:
The Geometric Mean 1. Range
For multiperiod returns R1, R2, . . . , Rn,the 2. Mean Absolute Deviation (MAD)
geometric mean return GRis calculated as:
Muhammad Firman (University of Indonesia - Accounting ‘12 ) 116
Masterbook of Business and Industry (MBI)
Measures of Dispersion
Mean Absolute Deviation (MAD)
MAD is an average of the absolute difference of Calculate the coefficient of variation (CV) using the
each observation from the mean. data from the Metals fund and the Income fund
where
Reconsider the example of the lecture class with The square root of the variance yields a standard
280 students with a mean score of 74 and a deviation of $103.13.
standard deviation of 8. Assume that the
distribution is symmetric and bell-shaped. Weighted Mean
Approximately how many students scored within Let w1, w2, . . . , wndenote the weights of the
58 and 90? sample observations x1, x2, . . . , xnsuch that, then
Muhammad Firman (University of Indonesia - Accounting ‘12 ) 118
Masterbook of Business and Industry (MBI)
Let‘ s calculate the covariance and the correlation After collecting data from 600 recent purchases,
coefficient for the Metals (x) and Income (y) funds. Annabel wants to determine weather age
Positive Relationship influences brand choice.
By definition, 0! = 1.
For example, in how many ways can a little-league
coach assign nine players to each of the nine team
positions (pitcher, catcher, first base, etc.)?
Solution:
The Total Probability Rule conditional on two
outcomes
The total probability rule conditional on two The Combination Formula
events, B and Bc, is The number of ways to choose x objects from a
total of n objects, where the order in which the x
or equivalently, objects are listed does not matter, is referred to as
a combinationand is calculated as :
Bayes’ Theorem
A procedure for updating probabilities based on
new information.Prior probability is the original The Permutation Formula
(unconditional) probability (e.g., P(B) ).Posterior The number of ways to choose x objects from a
probability is the updated (conditional) probability total of n objects, where the order in which the x
(e.g., P(B | A) ).Given a set of prior probabilities for objects is listed does matter, is referred to as a
an event andsome new information, the rule for permutationand is calculated as:
updating theprobability of the event is called
Bayes’ theorem.
Random variable
A function that assigns numerical values to the Discrete Probability Distributions
outcomes of a random experiment.Denoted by A discrete probability distribution may be viewed
uppercase letters (e.g., X ).Values of the random as a table, algebraically, or graphically.For
variable are denoted by corresponding lowercase example, consider the experiment of rolling a six-
letters.Corresponding values of the random sided die. A tabular presentation is:
variable: x1, x2, x3, . . .
Random variables may be classified as:
Each outcome has an associated probability of 1/6.
1. Discrete : The random variable assumes a Thus, the pairs of values and their probabilities
countable number of distinct values. form the probability mass function for X.Another
tabular view of a probability distribution is based
Muhammad Firman (University of Indonesia - Accounting ‘12 ) 124
Masterbook of Business and Industry (MBI)
Using this formula we can find Expected Value, Variance, and Standard
Deviation
Summary measures for a random variable include
the
1. Mean (Expected Value)
A probability distribution may be expressed 2. Variance
graphically.The values x of X are placed on the 3. Standard Deviation
horizontal axis and the associated probabilities on
the vertical axis.A line is drawn such that its height
is associated with the probability of x.For example,
here is the graph representing the six-sided die
experiment: E(X) is the long-run average value of the random
variable over infinitely many independent
repetitions of an experiment.For a discrete
random variable Xwith values x1, x2, x3, . . . that
occur with probabilities P(X= xi), the expected
value of Xis
2. Calculate the variance and standard Portfolio is defined as a collection of assets such as
deviation of the annual bonus amount. stocks and bonds.
• Let X and Y represent two random
Solution: Let the random variable X denote the variables of interest, denoting, say, the
bonus amount (in $1,000s) for an employee. returns of two assets.
• Since an investor may have invested in
both assets, we would like to evaluate the
portfolio return formed by a linear
combination of X and Y .
Properties of random variables useful in
evaluating portfolio returns.
Given two random variables X and Y,The expected
value of Xand Yis
Risk Neutrality and Risk Aversion where Cov(X,Y) is the covariance between Xand
Risk averse consumers: Y.For constants a, b, the formulas extend to
• Expect a reward for taking a risk.
• May decline a risky prospect even if it
offers a positive expected gain.
Risk neutral consumers: Expected return, variance, and standard
• Completely ignore risk. deviationof portfolio returns.
• Always accept a prospect that offers a Given a portfolio with two assets, Asset Aand
positive expected gain. Asset B,the expected return of the portfolio E(Rp)
is computed as:
Risk loving consumers:
• May accept a risky prospect even if the
expected gain is negative.
where :
Application of Expected Value to Risk wAand wBare the portfolio weights
Suppose you have a choice of receiving $1,000 in wA+ wB= 1
cash or receiving a beautiful painting from your E(RA) and E(RB) are the expected returns on
grandmother.The actual value of the painting is assetsAand B, respectively.
uncertain. Here is a probability distribution of the
possible worthof the painting. What should you Using the covariance or the correlation coefficient
do? of the two returns, the portfolio variance of return
is:
Portfolio Returns
Investment opportunities often use both:
• Expected return as a measure of reward.
• Variance or standard deviation of return as
a measure of risk.
Muhammad Firman (University of Indonesia - Accounting ‘12 ) 126
Masterbook of Business and Industry (MBI)
CHAPTER 6
What is the probability that a customer visits the CONTINUOUS PROBABILITY DISTRIBUTIONS
chain five times in a 5-day period?
Demand for Salmon
Akiko Hamaguchi, manager of a small sushi
restaurant, Little Ginza, in Phoenix, Arizona, has to
The Hypergeometric Probability Distribution estimate the daily amount of salmon
A binomial random variable X is defined as the needed.Akiko has estimated the daily
number of successes in the n trials of a Bernoulli consumption of salmon to be normally distributed
process, and according to a Bernoulli process, with a mean of 12 pounds and a standard
those trials are deviation of 3.2 pounds.Buying 20 lbs of salmon
1. Independent every day has resulted in too much
2. The probability of success does not change wastage.Therefore, Akiko will buy salmon that
from trial to trial. meets the daily demand of customers on 90% of
the days.
In contrast, the hypergeometric probability
distribution is appropriate in applications where Based on this information, Akiko would like to:
we cannot assume the trials are independent.Use
Muhammad Firman (University of Indonesia - Accounting ‘12 ) 128
Masterbook of Business and Industry (MBI)
1. Symmetric
2. Bell-shaped
Closely approximates the probability distribution
of a wide range of random variables, such as the
• Heights and weights of newborn babies
• Scores on SAT
• Cumulative debt of college graduates
Serves as the cornerstone of statistical inference.
Characteristics of the Normal Distribution
Symmetricabout its mean
Mean = Median = Mode
Asymptotic—that is, the tails get closer and closer
to the horizontal axis, but never touch it
Example: Finding Probabilities for a Standard • A return of 10% is one standard deviation
Normal Random Variable Z. above the mean, or
One month after the marketing period ends, Anne survey. These respondents may be atypical
surveys 50 of her iced-coffee customers and finds: of the population as a whole.
• 46% were women.
• 34% were teenage girls. Sampling Methods
• They spent an average of $4.26 on the Simple random sample is a sample of n
drink. observations which has the same probability of
being selected from the population as any other
Anne wants to use this survey information to sample of n observations.Most statistical methods
calculate the probability that: presume simple random samples.However, in
• Customers spend an average of $4.26 or some situations other sampling methods have an
more on iced coffee. advantage over simple random samples.
• 46% or more of iced-coffee customers are
women. Example: In 1961, students invested 24 hours per
• 34% or more of iced-coffee customers are week in their academic pursuits, whereas today‘s
teenage girls. students study an average of 14 hours per week.A
dean at a large university in California wonders if
Sampling this trend is reflective of the students at her
Population—consists of all items of interest in a university. The university has 20,000 students and
statistical problem. the dean would like a sample of 100. Use Excel to
• Population Parameter is unknown. draw a simple random sample of 100 students.
Sample—a subset of the population. In Excel, choose Formulas > Insert function >
• Sample Statisticis calculated from sample RANDBETWEEN and input the values shown here.
and used to make inferences about the
population.
Bias—the tendency of a sample statistic to
systematically over-or underestimate a population
parameter.
Classic Case of a ―Bad‖ Sample: The Literary
Digest Debacle of 1936
• During the1936 presidential election, the
Literary Digest predicted a landslide victory
for Alf Landon over Franklin D. Roosevelt
(FDR) with only a 1% margin of error.
• They were wrong! FDR won in a landslide
election. Stratified Random Sampling
• The Literary Digest had committed Divide the population into mutually exclusive and
selection biasby randomly sampling from collectively exhaustive groups, called
their own subscriber/ membership lists, strata.Randomly select observations from each
etc. stratum, which are proportional to the stratum‘s
• In addition, with only a 24% response rate, size.
the Literary Digest had a great deal of non-
response bias.LO 7.2 Explain common Advantages:
sample biases. 1. Guarantees that the each population
subdivision is represented in the sample.
Selection bias 2. Parameter estimates have greater
a systematic exclusion of certain groups from precision than those estimated from
consideration for the sample. simple random sampling.
• The Literary Digest committed selection
bias by excluding a large portion of the Cluster Sampling
population (e.g., lower income voters). Divide population into mutually exclusive and
collectively exhaustive groups, called
Nonresponse bias clusters.Randomly select clusters.
a systematic difference in preferences between Sample every observation in those randomly
respondents and non-respondents to a survey or a selected clusters.
poll.
• The Literary Digesthad only a 24% Advantages and disadvantages:
response rate. This indicates that only 1. Less expensive than other sampling
those who cared a great deal about the methods.
election took the time to respond to the 2. Less precision than simple random
sampling or stratified sampling.
Muhammad Firman (University of Indonesia - Accounting ‘12 ) 134
Masterbook of Business and Industry (MBI)
Acceptance Sampling
Used at the completion of a production process or Control Charts for Quantitative Data
service.If a particular product does not conform to Upper controllimit (UCL):
certain specifications, then it is either discarded or
repaired.
Disadvantages Lower controllimit (LCL):
1. It is costly to discard or repair a product.
2. The detection of all defective products is
not guaranteed.LO 7.9
Detection Approach
Inspection occurs during the production process in
order to detect any nonconformance to
specifications.Goal is to determine whether the
production process should be continued or
adjusted before producing a large number of
defects.
Types of variation:
1. Chance variation. Process is in control—all pointsfall within the
2. Assignable variation. control limits.
Types of Variation Control Charts for Qualitative Data
Chance variation (common variation) is:
• Caused by a number of randomly occurring • chart (fraction defective or percent
events that are part of the production defective chart).
process. • Tracks proportion of defects in a
• Not controllable by the individual worker production process.
or machine. • Relies on central limit theorem for normal
• Expected, so not a source of alarm as long approximation for the sampling
as its magnitude is tolerable and the end distribution of the sample proportion.
product meets specifications. • Centerline—the mean when the process is
under control.
Assignable variation (special cause variation) is: • Upper control limit—set at +3 from the
• Caused by specific events or factors that centerline.
can usually be indentified and eliminated. Points falling above the upper control limit are
• Identified and corrected or removed. considered to be out of control.
Lower control limit—set at −3 from the
Control Charts centerline.
A plot of calculated statistics of the production Points falling below the lower control limit are
process over time.Production process is in considered to be out of control.
control‖ if the calculated statistics fall in an
expected range.Production process is out of Upper controllimit (UCL):
control‖ if calculated statistics reveal an
undesirable trend.
1. For quantitative data— chart.
2. For qualitative data— chart Lower controllimit (LCL):
Since
Confidence Intervals:
We get • 90%, a = 0.10, a/2 = 0.05, za/2 = z.05 =
1.645.
• 95%, a = 0.05, a/2 = 0.025, za/2 = z.025 =
1.96.
• 99%, a = 0.01, a/2 = 0.005, za/2 = z.005 =
Which, after algebraically manipulating, isequal to 2.575.
or, with 95% confidence, the mean weight of information from the sample of Granola Crunch
allcereal boxes falls between 1.008 and cereal boxes.
1.032pounds.
Interpreting a Confidence Interval This confidence interval width has increased from
Interpreting a confidence interval requires care. 0.024 to 2(0.015) = 0.030.
Incorrect: The probability that falls in
theinterval is 0.95.
Example:
Correct: If numerous samples of size n are
drawnfrom a given population, then 95% of the
intervalsformed by the formula will contain
.Since there are many possible samples, wewill be
right 95% of the time, thus giving us95%
confidence.
The Width of a Confidence Interval
Margin of Error
’
Confidence Interval Width
The t Distribution
The width of the confidence interval isinfluenced If repeated samples of size n are taken from
by the: anormal population with a finite variance, then
- Sample size n. thestatistic T follows the t distributionwith (n - 1)
- Standard deviation . degrees of freedom, df.
- Confidence level 100(1 - a)%.
a 2zn a 2 2z n
1.) For a given confidence level 100(1 -a)% and Degrees of freedom determinethe extent of the
sample size n, the width of the interval is wider, broadness of the tails of thedistribution; the fewer
the greater the population standard deviation . the degrees of freedom,the broader the tails.
Example: Let the standard deviation of the Summary of the tdfDistribution
population of cereal boxes of Granola Crunchbe • Bell-shaped and symmetric around 0 with
0.05 instead of 0.03. Compute a 95% confidence asymptotic tails (the tails get closer and
interval based on the same sample closer to the horizontal axis, but never
information.This confidence interval width has touch it).
increased from 0.024 to 2(0.020) = 0.040. • Has slightly broader tails than the
zdistribution.
2.) .For a given confidence level 100(1 - a)% and • Consists of a family of distributions where
population standard deviation , the width of the the actual shape of each one depends on
interval is wider, the smaller the sample size n. the df. As dfincreases, the tdfdistribution
becomes similar to the zdistribution; it is
Example: Instead of 25 observations, let the identical to the zdistribution when
sample be based on 16 cereal boxes of Granola dfapproaches infinity.
Crunch. Compute a 95% confidence interval using
a sample mean of 1.02 pounds and a population The tdfDistribution with Various Degrees of
standard deviation of 0.03. Freedom
Note that the tdfvalues change with the degrees Confidence Interval of the Population
of freedom. Further, as df increases, the Let the parameter p represent the proportion of
tdfdistribution begins to resemble the z successes in the population, where success is
distribution. defined by a particular outcome. is the point
estimator of the population proportion p.By the
A 100(1 - a)% confidence interval of thepopulation central limit theorem, can be approximated by a
mean when the population standarddeviation normal distribution for large samples (i.e., np >5
and n(1 -p) >5).Thus, a 100(1-a)% confidence
is not known is computed as interval of thepopulation proportion is
or equivalently,
or
where s is the sample standard deviation.
Example: Recall that Jared Beane wants to
estimate the mean mpg of all ultra-green cars. Use
the sample information to construct a 90% where is used to estimate the
confidence interval of the population mean. populationparameter p.
Assume that mpg follows a normal distribution.
Example: Recall that Jared Beane wants to
Solution: Since the population standard deviation estimate the proportion of all ultra-green cars that
is not known, the sample standard deviation has obtain over 100 mpg. Use the sample information
to be computed from the sample. As a result, the to construct a 90% confidence interval of the
90% confidence interval is population proportion.
Solution: Note that In addition, the normality
assumption is met since np >5 and n(1-p) >5. Thus,
Using Excel to construct confidence intervals.
The easiest way to estimate the mean when the
population standard deviation is unknown is as
follows:
Selecting a Useful Sample Size
1. Open the MPG data file. Precisionin interval estimates is implied by a low
2. From the menu choose Data > Data margin of error.The larger nreduces the margin of
Analysis > DescriptiveStatistics > OK. error for the interval estimates.How large should
the sample size be for a given margin of error?
Muhammad Firman (University of Indonesia - Accounting ‘12 ) 142
Masterbook of Business and Industry (MBI)
Selecting n to Estimate
Consider a confidence interval for m with aknown
and let D denote the desired margin oferror.
Since Where is a reasonable estimate of p in the
planning stage.
we may rearrange to get Example: Recall that Jared Beane wants to
construct a 90% confidence interval of the
proportion of all ultra-green cars that obtain over
100 mpg.Jared does not want the margin of error
to be more than 0.10.How large a sample does
If is unknown, estimate it with . Jared need for his analysis of the population
proportion?
For a desired margin of error D, the minimum
sample size n required to estimate a 100(1-a)%
confidence interval of the population mean is
CHAPTER 9
HYPOTHESIS TESTING
Where is a reasonable estimate of in the
planning stage.
Undergraduate Study Habits
Example: Recall that Jared Beane wants to Are today‘s college students studying hard or
construct a 90% confidence interval of the mean hardly studying?
mpg of all ultra-green cars.Suppose Jared would
like to constrain the margin of error to within 2 A recent study asserts that over the past five
mpg. Further, the lowest mpg in the population is decades the number of hours that the average
76 mpg and the highest is 118 mpg.How large a college student studies each week has been
sample does Jared need to compute the 90% steadily dropping (The Boston Globe, July 4,
confidence interval of the population mean? 2010).In 1961, students invested 24 hours per
week in their academic pursuits, whereas today‘s
students study an average of 14 hours per week.
As dean of a large university in California, Susan
Knight wonders if the study trend is reflective of
Selecting n to Estimate p students at her university.Susan randomly
Consider a confidence interval for p and let selected 35 students to ask about their average
Ddenote the desired margin of error.Since study time per week. Using these results, Susan
wants to
1. Determine if the mean study time of
students at her university is below the
1961 national average of 24 hours per
week.
Where is the sample proportionwe may 2. Determine if the mean study time of
rearrange to get students at her university differs from
today‘s national average of 14 hours per
week.
Introduction to Hypothesis Testing
Hypothesis tests resolve conflicts between two
Since comes from a sample, we must use a competing opinions (hypotheses).In a hypothesis
reasonable estimate of p, that is . test, define
• H0, the null hypothesis, the presumed
For a desired margin of error D, the minimum default state of nature or status quo.
sample size n required to estimate a 100(1-a)% • HA, the alternative hypothesis, a
confidence interval of the population proportion p contradiction of the default state of nature
is or status quo.
In statistics we use sample information to make
inferences regarding the unknown population
parameters of interest.We conduct hypothesis 3.Include some form of the equality sign in H0and
tests to determine if sample evidence contradicts use HAto establish a claim.
H0.On the basis of sample information, we either :
―Reject the null hypothesis‖, Sample evidence Example: A trade group predicts that back-to-
isinconsistent with H0. school spending will average $606.40 per family
―Do not reject the null hypothesis, Sample this year. A different economic model is needed if
evidence is not inconsistent with H0, We do not the prediction is wrong.
have enough evidence to accept‖ H0. • Parameter of interest is msince we are
interested in the average back-to-school
Defining the Null Hypothesis and Alternative spending.
Hypothesis • Since we want to determine if the
General guidelines: population mean differs from $606.4 (i.e,
• Null hypothesis, H0, states the status quo. ≠), it is a two-tail test.
• Alternative hypothesis, HA, states
whatever we wish to establish (i.e.,
contests the status quo).
Use the following signs in hypothesis tess Example: A television research analyst wishes to
test a claim that more than 50% of the households
will tune in for a TV episode. Specify the null and
the alternative hypotheses to test the claim.
• Parameter of interest is psince we are
interested in the proportion of households.
Note that H0always contains the equality.‖ • Since the analyst wants to determine
whether p> 0.50,it is a one-tail test.
One-Tailed versus Two-Tailed Hypothesis Tests
Two-Tailed Test
Reject H0on either side of the hypothesized value Type I and Type II Errors
of the population parameter. Type I Error: Committedwhen we reject H0when
H0is actually true.Occurs with probability a. ais
For example: chosen apriori.
Type II Error: Committed when we do not reject
The ≠‖ symbol in HAindicates that both tail areas H0and H0is actually false.Occurs with probability
of the distribution will be used to make the β. Power of the test = 1-β
decision regarding the rejection of H0.
For a given sample size n, a decrease in αwill
One-Tailed Test increase βand vice versa.Both αand βdecrease as
Reject H0only on one side of the hypothesized n increases.
value of the population parameter.
This table illustrates the decisions that may be
For example: made when hypothesis testing:
Decision Null Null
Hypothesis is hpothesis is
true false
Note that the inequality in HAdetermines which
tail area will be used to make the decision Reject the null Type I error Correct
regarding the rejection of H0. hpothesis decision
Three Steps to Formulate Hypotheses
1.Identify the relevant population parameter of
interest (e.g., or p). Don’t reject Correct Type II error
2.Determine whether it is a one-or a two-tailed the null decision
test hpothesis
Correct Decisions:
• Reject H0when H0is false.
• Do not reject H0when H0is true.
Incorrect Decisions:
Decision Rule:
Unstandardized Normal Distribution:
Standardized Normal Distribution:
Determining the critical value(s) depending on the
Step 3. Now compute the p-value: specification of the competing hypotheses.
Note that since ,this is a right-tail test.
Decision rule:
Step 1
Test Statistic for When is Unknown . When
the population standard deviation is unknown, the
test statistic fortesting the population mean is Step 2. Compute the value of the test statistic as:
assumedto follow the tdf distribution with (n -
1)degrees of freedom (df). The value of tdf is
computed as:
Step 3. Compute the p-value.
Since t34=1.94 > 0, the p-value for a two-tailed
test is2P(T34 > t34). Referencing the tdf table for
df = 34, wefind that the exact probability P(T34 >
Example The Critical value approach 1.94) cannot bedetermined.
Consider the following:
Look up t34= 1.94 in the t-table to find the p-value.
Step 1. State the hypotheses:
Thus,
Note that t34= 1.94 lies between 1.691 and
Step 2. Because n = 35 (i.e, n > 30), 2.032.Thus, 0.025 < P(T34>1.94) < 0.05. However,
we canassume that the sample mean is because this is a two-tail test, we multiply by two
normallydistributed and thus compute the value to get 0.05 < p-value < 0.10.
of the teststatistic as
0.05 < p-value < 0.10
α= 0.05.
Step 4.State the conclusion and interpret the
results.
Since the p-value satisfies 0.05 < p-value < 0.10, it
must be greater than a= 0.05.Thus, we do not
reject H0and conclude that the mean study time
Step 3.Based on HA,this is a left-tail test of students at the university is not statistically
For α= 0.05 and n-1 = 34 df, thecritical value different from today‘s national average of 14
ista,df= t0.05,34= 1.691 hours per week.
(-1.691 due to symmetry).
Muhammad Firman (University of Indonesia - Accounting ‘12 ) 148
Masterbook of Business and Industry (MBI)
Compute the test statistic using = 67/180 = 0.3722: Sampling distribution of is assumed tobe
normally distributed. A linear combination of
normally distributed randomvariables is also
normally distributed. If underlying distribution is
not normal, then by the centrallimit theorem, the
Step 3.Compute the p-value. sampling distribution of isapproximately
Based on HA: p< 0.4, this is a left-tailed test. normal only if both n1 > 30 and n2 > 30.
Compute the p-value as: P(Z<z) = P(Z <-0.76) =
0.2236. . Let the significance level a= 0.10. 1. If are known, a 100(1 -
a)%confidence interval of the difference
betweentwo population means is given by
Where
where and are the mean and
standarddeviation, respectively, of the n
sampledifferences, and d0 is a given
And hypothesizedmean difference.
Features of the F distribution
Inferences about the ratio of two
3. If are unknown and cannot be populationvariances are based on the ratio of
assumedequal, then thetest statistic is assumed thecorresponding sample variances . These
tofollow the tdf distribution andits value is inferences are based on a new
calculated as: distribution: the F distribution.. iIt is common to
use the notation F(df1,df2) whenreferring to the F
distribution.The distribution of the ratio of the
sample variances is the
F(df1,df2)distribution.Since the
F(df1,df2)distribution is a family of distributions,
Where each one is defined by two degrees of freedom
parameters, one for the numerator and one for
Muhammad Firman (University of Indonesia - Accounting ‘12 ) 150
Masterbook of Business and Industry (MBI)
Calculating MSE