ECON 5360 Class NotesQualitative De

ECON 5360 Class Notes
Qualitative Dependent Variable Models
Here we consider models where the dependent variable is discrete in nature.
1 Linear Probability Model
Consider the linear probability (LP) model
yi =
0
xi + i
where E(i
) = 0. The conditional expectation
E(yi
jxi) =
0
xi
is interpreted as the probability of an event occurring given xi
. There are a couple of drawbacks to the LP
model that limits its use:
1. Heteroscedasticity. Given that yi = f0; 1g, the error term can take on two values with probability
i f(i
)
1

0
xi
0
xi

0
xi 1

0
xi
so that the variance is
var(i
) =
0
xi(1

0
xi)
2 + (1

0
xi)(

0
xi)
2
=
0
xi(1

0
xi)
= E(yi)[1
E(yi)]:
2. Predictions outside [0,1]. The predicted probabilities from the LP model, y^i =
0
xi
, can be less than
zero and greater than one.
1
2 Binomial Probit and Logit Models
The drawbacks of the LP model are solved by letting the probability of an event (i.e., y = 1) be given by a
well-deÖned cumulative density function
P rob(yi = 1jx) = Z x
0
1
f(t)dt = F(x
0): (1)
In this manner, the predicted probabilities will always be bounded between zero and one. If F(x
0) is the
cdf for a standard normal random variable, we get the probit model. If
F(x
0) = e
x
0
1 + e
x0
;
then we get the logit model. Estimates from the logit and probit models often give similar results. The
logit model is less computationally intense because F(x
0) has a closed form, however, the logistic pdf f()
has fatter tails than the standard normal pdf. Because yi = f0; 1g is discrete, while (1) implies continuity,
we replace yi with the latent variable y

i
. This produces
y

i =
0
xi + i
.
y

i
can be interpreted as an unobservable index function that measures individual iís propensity to choose
y = 1. For example, y

i
could be the net beneÖts (beneÖts less costs) of selecting option A. Alternatively,
y

i
could be interpreted as the di§erence in utility derived from choosing option A less the utility of choosing
option B. Therefore, we assume
if y

i > 0 then yi = 1
if y

i 0 then yi = 0.
The choice of zero as a threshold is innocuous if the vector xi
includes a constant term.
2.1 Estimation
The parameters of the model are estimated via maximum likelihood. The relevant probability can be written
as
P rob(yi = 1jx) = P rob(y

i > 0jx) = P rob(
0
xi + i > 0jx) = P rob(i >

0
xi
jx):
Assuming a symmetric, mean-zero pdf for i
, we have
P rob(i >

0
xi
jx) = P rob(i < 0
xi
jx):
2
It will be convenient to standardize i
, which gives
P rob(
i

< (

)
0xi
jx) = ((

)
0xi),
where () and are the cdf and standard deviation for i
, respectively. Therefore, the parameters are
only identiÖable up to a scalar , which is commonly set to unity (i.e., = 1). The likelihood function is
given by
L =
Yn
i=1

yi
i
f1
ig
1yi

and the log-likelihood function is given by
lnL() = Xn
i=1
fyi
ln(i) + (1
yi) ln(1
i)g: (2)
Maximization of (2) will require nonlinear optimization methods, such as Newtonís algorithm.
2.2 Marginal E§ects
The estimated coe¢ cients, ^ML, are problematic in two senses:
1. The true s are not identiÖed. Recall, that all we can really estimate is =.
2. Aside from problem #1, we know that
^
k =
@y
i
@xi;k
.
Because y

i
is an unobservable index function, it is di¢ cult to interpret this derivative.
A simple solution is to calculate
^i;k =
@P rob(yi = 1)
@xi;k
= ((

)
0xi)
k

(3)
where () is the pdf for i
. The advantage of the estimated marginal e§ect, ^i;k, is that it only depends
on = (so that it is identiÖable) and it is easy to interpret. Note that ^i;k depends on the entire vectors
for xi and . The standard errors for ^i;k can be calculated using the delta method, which is based on a
Örst-order Taylor approximation. We have
asy:var:(
^) =
@^
@^
0
!
V

@^
@^
0
!0
where V is the variance-covariance matrix for ^ML.
3

0
xi 
0
xi

0
xi 1

0
xi
so that the variance is
var(i
) = 
0
xi(1

0
xi)
2 + (1

0
xi)(

0
xi)
2
= 
0
xi(1

0
xi)
= E(yi)[1
E(yi)]:
2. Predictions outside [0,1]. The predicted probabilities from the LP model, y^i = 
0
xi
, can be less than
zero and greater than one.
1
2 Binomial Probit and Logit Models
The drawbacks of the LP model are solved by letting the probability of an event (i.e., y = 1) be given by a
well-deÖned cumulative density function
P rob(yi = 1jx) = Z x
0
1
f(t)dt = F(x
0): (1)
In this manner, the predicted probabilities will always be bounded between zero and one. If F(x
0) is the
cdf for a standard normal random variable, we get the probit model. If
F(x
0) = e
x
0
1 + e
x0
;
then we get the logit model. Estimates from the logit and probit models often give similar results. The
logit model is less computationally intense because F(x
0) has a closed form, however, the logistic pdf f()
has fatter tails than the standard normal pdf. Because yi = f0; 1g is discrete, while (1) implies continuity,
we replace yi with the latent variable y

i
. This produces
y

i = 
0
xi + i
.
y

i
can be interpreted as an unobservable index function that measures individual iís propensity to choose
y = 1. For example, y

i
could be the net beneÖts (beneÖts less costs) of selecting option A. Alternatively,
y

i
could be interpreted as the di§erence in utility derived from choosing option A less the utility of choosing
option B. Therefore, we assume
if y

i > 0 then yi = 1
if y

i  0 then yi = 0.
The choice of zero as a threshold is innocuous if the vector xi
includes a constant term.
2.1 Estimation
The parameters of the model are estimated via maximum likelihood. The relevant probability can be written
as
P rob(yi = 1jx) = P rob(y

i > 0jx) = P rob(
0
xi + i > 0jx) = P rob(i >

0
xi
jx):
Assuming a symmetric, mean-zero pdf for i
, we have
P rob(i >

0
xi
jx) = P rob(i < 0
xi
jx):
2
It will be convenient to standardize i
, which gives
P rob(
i

< (

)
0xi
jx) = ((

)
0xi),
where () and  are the cdf and standard deviation for i
, respectively. Therefore, the parameters are
only identiÖable up to a scalar , which is commonly set to unity (i.e.,  = 1). The likelihood function is
given by
L =
Yn
i=1

yi
i
f1
ig
1yi

and the log-likelihood function is given by
lnL() = Xn
i=1
fyi
ln(i) + (1
yi) ln(1
i)g: (2)
Maximization of (2) will require nonlinear optimization methods, such as Newtonís algorithm.
2.2 Marginal E§ects
The estimated coe¢ cients, ^ML, are problematic in two senses:
1. The true s are not identiÖed. Recall, that all we can really estimate is =.
2. Aside from problem #1, we know that
^
k =
@y
i
@xi;k
.
Because y

i
is an unobservable index function, it is di¢ cult to interpret this derivative.
A simple solution is to calculate
^i;k =
@P rob(yi = 1)
@xi;k
= ((

)
0xi)
k

(3)
where () is the pdf for i
. The advantage of the estimated marginal e§ect, ^i;k, is that it only depends
on = (so that it is identiÖable) and it is easy to interpret. Note that ^i;k depends on the entire vectors
for xi and . The standard errors for ^i;k can be calculated using the delta method, which is based on a
Örst-order Taylor approximation. We have
asy:var:(
^) =
@^
@^
0
!
V

@^
@^
0
!0
where V is the variance-covariance matrix for ^ML.
3

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

จดโน้ต 5360 ชโรดรูปแบบตัวแปรเชิงคุณภาพที่นี่เราสามารถพิจารณารุ่นแยกลักษณะตัวแปรแบบจำลองเชิงเส้นความน่าเป็น 1พิจารณาแบบจำลองความน่าเป็นเชิงเส้น (LP)yi = 0ซี + iที่ E (i) = 0. ความคาดหวังแบบมีเงื่อนไขE (ยี่jxi) = 0ซีอานซีกวนถูกตีความว่าเป็นการน่าที่มีเหตุการณ์เกิดรับสิ. มีคู่ของข้อเสียไป LPรุ่นที่จำกัดการใช้งาน:1. Heteroscedasticity กำหนดว่า yi = f0 1g ระยะผิดพลาดสามารถนำสองค่าความน่าเป็นฉัน f (i)10ซีอานซีกวน 0ซีอานซีกวน0ซี 10ซีอานซีกวนเพื่อให้เป็นผลต่างvar (i) = 0ซี (10ซีอานซีกวน)(1 + 20(xi)0ซีอานซีกวน)2= 0ซี (10ซีอานซีกวน)= E(yi) [1E(yi)]:2. คาดคะเนนอก [0, 1] น่าจะคาดการณ์จากแบบ LP, y ^ ฉัน = 0ซีอานซีกวนได้น้อยกว่าศูนย์มากกว่าหนึ่ง12 แบบทวิ Probit และแบบจำลอง Logitข้อเสียของรูปแบบ LP ได้รับการแก้ไข โดยให้ความน่าเป็นของเหตุการณ์ (เช่น y = 1) ได้รับโดยมีฟังก์ชันความหนาแน่นสะสมดี deÖnedP rob(yi = 1jx) = Z x0 1f (t) dt = F (x0): (1)ในลักษณะนี้ น่าจะคาดการณ์จะเสมอจะล้อมรอบระหว่างศูนย์กับหนึ่ง ถ้า F (x0) เป็นการcdf สำหรับตัวแปรสุ่มปกติมาตรฐาน เราได้รับแบบจำลอง probit ถ้าF (x0) = ex0 1 + ex0 ;แล้ว เราได้รับแบบจำลอง logit ประมาณการจากแบบจำลอง probit logit มักจะให้ผลที่คล้ายกัน การแบบจำลอง logit จะ computationally น้อยรุนแรงเนื่องจาก F (x0) มีปิดฟอร์ม อย่างไรก็ตาม แบบโลจิสติก pdf f ()มีหางอ้วนขึ้นกว่ามาตรฐานปกติไฟล์ pdf เนื่องจาก yi = f0 1g คือแยก ในขณะที่ (1) หมายถึงความต่อเนื่องเราแทนยี่กับตัวแปร y แฝงอยู่ผม. นี้ผลิตyฉัน = 0ซี + i.yผมสามารถตีความเป็นฟังก์ชันการ unobservable ดัชนีที่วัดนิสัยชอบ iís แต่ละตัวเลือกy = 1 ตัวอย่างเช่น yผมสามารถสุทธิ beneÖts (beneÖts หักค่าใช้จ่าย) ของตัวเลือก A. อีกyผมสามารถตีความเป็นการ di§erence ในอรรถประโยชน์ที่ได้มาจากการเลือก A น้อยกว่าโปรแกรมอรรถประโยชน์ของการเลือกตัวเลือก b ดังนั้น เราสมมติถ้า yฉัน > 0 แล้ว yi = 1ถ้า yฉัน 0 แล้ว yi = 0ทางเลือกของศูนย์เป็นเกณฑ์คือ innocuous ถ้า xi เวกเตอร์มีคำคง2.1 การประเมินพารามิเตอร์ของแบบจำลองที่ประมาณผ่านโอกาสสูงสุด เขียนได้น่าเป็นที่เกี่ยวข้องเป็นP rob(yi = 1jx) = P ปล้น (yผม > 0jx) = P (ปล้น 0ซี + i > 0jx) = P ปล้น (ฉัน >0ซีอานซีกวนjx):สมมติว่าเป็น pdf สมมาตร ศูนย์หมายถึงอะไรสำหรับฉันเรามีปล้น P (ผม >0ซีอานซีกวนjx) = P ปล้น (ฉัน < 0ซีอานซีกวนjx):2มันจะสะดวกในการกำหนดมาตรฐานผมซึ่งช่วยให้P (ปล้นผม< ()0xijx) = (( )0xi),ที่()และ cdf และส่วนเบี่ยงเบนมาตรฐานสำหรับฉันตามลำดับ ดังนั้น มีพารามิเตอร์identiÖable เฉพาะถึงเ ซึ่งตั้งอยู่ทั่วไปเพื่อความสามัคคี (เช่น = 1) ฟังก์ชั่นความน่าเป็นกำหนดโดยL =Ynฉัน = 1ยี่ผมf1ig1yiและฟังก์ชันล็อกโอกาสโดยlnL () = Xnฉัน = 1เบาln (i) + (1ln ยี่) (1i) g: (2)Maximization (2) จะต้องมีวิธีการเพิ่มประสิทธิภาพเชิงเส้น เช่นอัลกอริทึม Newtonís2.2 กำไร E§ectsCients ลอกประมาณ coe, ^ ML มีปัญหาในสองความรู้สึก:1. s จริงไม่ได้ identiÖed จำ ทั้งหมดที่เราจะได้ประเมินจริง ๆ คือ =2. น้องจากปัญหา #1 เรารู้ว่า^k =@yผม@xi; k.เนื่องจาก yผมเป็นฟังก์ชั่น unobservable ดัชนี เป็นลัทธิลอก di แปลเรื่องนี้วิธีง่ายคือการ คำนวณ^ ผม k =@P rob(yi = 1)@xi; k= (()0xi) k(3)ที่()เป็น pdf สำหรับฉัน. ประโยชน์ของ e§ect ร่อแร่ประมาณ, ^ ฉัน k คือ ว่า เพียงขึ้นกับบน = (ที่ identiÖable) และมันง่ายที่จะตีความ หมายเหตุว่า ^ ฉัน k ขึ้นจากเวกเตอร์ทั้งหมดสำหรับซี และ ข้อผิดพลาดมาตรฐาน ^ ฉัน k สามารถคำนวณได้โดยใช้วิธีการเดลต้า ซึ่งเป็นไปตามสั่ง Örst เทย์เลอร์ประมาณ เรามีasy:var: (^) =@^@^0!V@^@^0! 0โดยที่ V คือ เมทริกซ์ความแปรปรวนแปรปรวนสำหรับ ^ ML3

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

ECON 5360 ชั้นหมายเหตุ
คุณภาพขึ้นอยู่กับรุ่นตัวแปร
ที่นี่เราจะพิจารณารูปแบบที่เป็นตัวแปรต่อเนื่องในธรรมชาติ.
1 น่าจะเป็นเชิงเส้นรุ่น
พิจารณาความน่าจะเป็นเชิงเส้น (LP) รุ่น
Yi =
0
Xi + ผม
ที่อี (ฉัน
) = 0 เงื่อนไขความคาดหวังของ
อี (Yi
jxi) =
0
Xi
ถูกตีความว่าเป็นความน่าจะเป็นของเหตุการณ์ที่เกิดขึ้น
Xi มีสองข้อเสียที่แผ่นเสียงเป็น
รุ่นที่ จำกัด การใช้งาน:
1 heteroscedasticity ระบุว่า Yi = F0; 1G คำข้อผิดพลาดสามารถใช้ในสองค่ากับความน่าจะเป็น
หรือไม่ถ้า (ฉัน
)
1

0
Xi
0
Xi

0
Xi 1

0
Xi
เพื่อให้ความแปรปรวนเป็น
var (ฉัน
) =
0
Xi (1

0
Xi)
2 + (1

0
Xi) (

0
Xi)
2
=
0
Xi (1

0
Xi)
= E (Yi) [1
E (Yi)]:
2 การคาดการณ์นอก [0,1] ความน่าจะเป็นที่คาดการณ์ไว้จากรุ่น LP, y ^ i =
0
Xi
, สามารถน้อยกว่า
ศูนย์และมากกว่าหนึ่ง.
1
2 ทวินาม Probit และ Logit รุ่น
ข้อเสียของรูปแบบแผ่นเสียงจะแก้ไขได้โดยการให้ความน่าจะเป็นของเหตุการณ์ (เช่น Y = 1) ได้รับโดย
ดีdeÖnedฟังก์ชันความหนาแน่นสะสม
P ปล้น (Yi = 1jx) = Z x
0
1
f (t) dt = f (x
0): (1)
ในลักษณะนี้น่าจะเป็นที่คาดการณ์ไว้จะเสมอ ถูกล้อมรอบระหว่างศูนย์และหนึ่ง ถ้า f (x
0) เป็น
CDF สำหรับตัวแปรสุ่มมาตรฐานปกติเราได้รับรูปแบบ probit ถ้า
f (x
0) = E
x
0
1 + E
x0
;
แล้วเราได้รับรูปแบบ logit ประมาณการจาก logit และ probit รุ่นมักจะให้ผลที่คล้ายกัน
รุ่น logit น้อยรุนแรงคอมพิวเตอร์เพราะ f (x
0) มีรูปแบบปิด แต่ F PDF โลจิสติก (?)
มีหางอ้วนขึ้นกว่ารูปแบบไฟล์ PDF แบบปกติมาตรฐาน เพราะ Yi = F0; 1G คือไม่ต่อเนื่องในขณะที่ (1) หมายถึงความต่อเนื่อง
เราแทนที่ Yi กับตัวแปรแฝง Y ฉัน

นี้ผลิต
Y

i =
0
Xi + ฉัน
.
Y

ฉัน
สามารถตีความได้ว่าฟังก์ชั่นดัชนีสำรวจที่วัด IIS แต่ละนิสัยชอบที่จะเลือก
y = 1 ตัวอย่างเช่น Y

ฉัน
อาจจะbeneÖtsสุทธิ (beneÖtsหักต้นทุน) ของตัวเลือกที่เลือก A. ผลัดกัน
Y

ฉัน
อาจจะตีความว่าเป็นdi§erenceในยูทิลิตี้ที่ได้มาจากการเลือกตัวเลือกน้อยยูทิลิตี้ในการเลือก
ตัวเลือกบีดังนั้นเราจึงถือว่า
ถ้า Y

i> 0 แล้ว Yi = 1
ถ้า Y

I? 0 แล้ว Yi = 0
ทางเลือกของศูนย์เป็นเกณฑ์เป็นอันตรายถ้า Xi เวกเตอร์
รวมถึงระยะอย่างต่อเนื่อง.
2.1 การประมาณค่า
พารามิเตอร์ของรูปแบบจะมีการประเมินผ่านโอกาสสูงสุด ความน่าจะเป็นที่เกี่ยวข้องสามารถเขียนได้
เป็น
P ปล้น (Yi = 1jx) = P ปล้น (y

i> 0JX) = P ปล้น (
0
Xi + I> 0JX?) = P ปล้น (i>?

0
Xi
JX):
สมมติว่าสมมาตร ? หมายถึงศูนย์ PDF สำหรับผม
เรามี
P ปล้น (i>?

0
Xi
= P ปล้น JX) (i <0?
Xi
JX):
2
มันจะสะดวกที่จะสร้างมาตรฐานผม
ซึ่งจะช่วยให้
P ปล้น (
หรือไม่ฉัน
?
<(

?
)
0xi
JX) = ((
?
)
0xi)
ที่ไหน (?) และ? เป็น CDF และค่าเบี่ยงเบนมาตรฐานหรือไม่ฉัน
ตามลำดับ ดังนั้นพารามิเตอร์ที่มี
เพียงidentiÖableถึงเกลา? ซึ่งตั้งกันทั่วไปเพื่อความสามัคคี (เช่น? = 1) ฟังก์ชั่นความน่าจะเป็นที่
ได้รับจาก
L =
Yn
i = 1

?
Yi
ฉัน
F1
? IG
1yi

และฟังก์ชั่นการเข้าสู่ระบบโอกาสจะได้รับโดย
LNL () = Xn
i = 1
FYI
LN (? i) + (1
Yi) LN (1
? i) g: (2)
. สูงสุดของ (2) จะต้องมีวิธีการเพิ่มประสิทธิภาพการไม่เชิงเส้นเช่นNewtonísอัลกอริทึม
2.2 Marginal E§ects
ประมาณ Coe ¢ cients ^ ML, มีปัญหาในสองความรู้สึก:
1 ดัชนี S จริงไม่ได้identiÖed จำได้ว่าทุกสิ่งที่เราจริงๆสามารถประเมินเป็น ?. =
2 นอกเหนือจากปัญหา # 1 เรารู้ว่า
^
K =
@y
ฉัน
@xi; K
.
เพราะ Y

ฉัน
เป็นฟังก์ชั่นดัชนีสำรวจมันเป็นดิ¢ลัทธิการตีความอนุพันธ์นี้.
วิธีง่ายๆคือการคำนวณ
^ ฉัน; k =
@P ปล้น (Yi = 1)
@xi; K
? = ((
?
)
0xi)
k
?
(3)
ที่เป็นรูปแบบไฟล์ PDF สำหรับ
I? ข้อได้เปรียบของe§ectร่อแร่ประมาณ ^ ฉัน? k คือว่ามันเพียง แต่ขึ้นอยู่
บน =? (เพื่อให้มันเป็นidentiÖable) และมันเป็นเรื่องง่ายที่จะตีความ โปรดทราบว่าผม ^; K ขึ้นอยู่กับเวกเตอร์ทั้งหมด
สำหรับ Xi และ ข้อผิดพลาดมาตรฐาน ^ ฉัน? k สามารถคำนวณได้โดยใช้วิธีการของเดลต้าซึ่งจะขึ้นอยู่กับ
Örstสั่งเทย์เลอร์ประมาณ เรามี
ASY: var :(
? ^) =
? @ ^
@ ^
0
!
V

? @ ^
@ ^
0
0
. โดยที่ V เป็นเมทริกซ์ความแปรปรวนความแปรปรวนสำหรับ ^ ML
3

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.