3.1. Poisson regression modelIn the

3.1. Poisson regression model

In the Poisson regression model, the i-th observation of the dependent variable yi is modeled as a random Poisson variable with mean λi:

equation(1)
View the MathML source
Turn MathJax on

The most important property of the Poisson regression model is the equality of the mean and variance of the distribution, which is the main shortcoming of this model and blocks the “overdispersion” of accident data. In the Poisson model, the conditional variance is equal to the conditional mean:

equation(2)
View the MathML source
Turn MathJax on

The log-likelihood of the Poisson regression model is given by

equation(3)
View the MathML source
Turn MathJax on

To estimate the regression coefficients by the maximum likelihood method, the derivative of the log-likelihood relative to the vector of coefficients, β, is set equal to zero:

equation(4)
View the MathML source
Turn MathJax on

The estimation of the regression coefficients in the Poisson regression model is not obtained with a direct equation, but the Newton-Raphson iteration procedure is used for estimating the unknown parameters of the model. To perform the estimation process, the corresponding iteration algorithm is analyzed by SAS statistical software to obtain the calculated coefficients. The predicted value of ŷi is the conditional mean or average number of accidents given xi, which is also denoted as λi, that is, the mean of the random variable yi, which follows a Poisson distribution. This value is typically not an integer number, whereas the observed value yi is a count (Anselin, 2002).

3.2. Negative binomial regression model

In the negative binomial (NB) model yi, the i-th observation of the dependent variable has the following probability distribution function:

equation(5)
View the MathML source
Turn MathJax on

The conditional mean of yi, for the vector of observed independent variables, xi, is given by

equation(6)
View the MathML source
Turn MathJax on

The NB distribution does not have the disadvantage of the Poisson distribution because there is no equality of the mean and variance of the distribution. The relationship between the mean and variance of an NB distribution is

equation(7)
View the MathML source
Turn MathJax on

The variance of the NB distribution is always greater than the mean. Thus, it fits the data with a variance greater than the mean. α = 1/r is the dispersion parameter. For accident data with low dispersion, the Poisson distribution gives plausible results, but, if overdispersion exists in the data, the variance of data will be greater than the mean. In such cases, the NB distribution is preferred. In the NB regression model, the dispersion parameter should be estimated in addition to regression parameters. r must be positive, so r* = lnr is estimated instead, which can take on any value. The log-likelihood function of the NB model for the i-th observation is obtained from the following equation:

equation(8)
View the MathML source
Turn MathJax on

To estimate β and r*, as in the Poisson model, the iteration procedure of Newton-Raphson is applied ( Agresti, 2002). The related iteration algorithm is written in SAS to obtain the unknown parameters. In the NB regression model, the predicted value of ŷi is the conditional mean or average number of accidents given xi. This is also denoted as μi, that is, the mean of the random variable yi, which follows the NB distribution.

3.1. Poisson regression model

In the Poisson regression model, the i-th observation of the dependent variable yi is modeled as a random Poisson variable with mean λi:

equation(1)
View the MathML source
Turn MathJax on

The most important property of the Poisson regression model is the equality of the mean and variance of the distribution, which is the main shortcoming of this model and blocks the “overdispersion” of accident data. In the Poisson model, the conditional variance is equal to the conditional mean:

equation(2)
View the MathML source
Turn MathJax on

The log-likelihood of the Poisson regression model is given by

equation(3)
View the MathML source
Turn MathJax on

To estimate the regression coefficients by the maximum likelihood method, the derivative of the log-likelihood relative to the vector of coefficients, β, is set equal to zero:

equation(4)
View the MathML source
Turn MathJax on

The estimation of the regression coefficients in the Poisson regression model is not obtained with a direct equation, but the Newton-Raphson iteration procedure is used for estimating the unknown parameters of the model. To perform the estimation process, the corresponding iteration algorithm is analyzed by SAS statistical software to obtain the calculated coefficients. The predicted value of ŷi is the conditional mean or average number of accidents given xi, which is also denoted as λi, that is, the mean of the random variable yi, which follows a Poisson distribution. This value is typically not an integer number, whereas the observed value yi is a count (Anselin, 2002).

3.2. Negative binomial regression model

In the negative binomial (NB) model yi, the i-th observation of the dependent variable has the following probability distribution function:

equation(5)
View the MathML source
Turn MathJax on

The conditional mean of yi, for the vector of observed independent variables, xi, is given by

equation(6)
View the MathML source
Turn MathJax on

The NB distribution does not have the disadvantage of the Poisson distribution because there is no equality of the mean and variance of the distribution. The relationship between the mean and variance of an NB distribution is

equation(7)
View the MathML source
Turn MathJax on

The variance of the NB distribution is always greater than the mean. Thus, it fits the data with a variance greater than the mean. α = 1/r is the dispersion parameter. For accident data with low dispersion, the Poisson distribution gives plausible results, but, if overdispersion exists in the data, the variance of data will be greater than the mean. In such cases, the NB distribution is preferred. In the NB regression model, the dispersion parameter should be estimated in addition to regression parameters. r must be positive, so r* = lnr is estimated instead, which can take on any value. The log-likelihood function of the NB model for the i-th observation is obtained from the following equation:

equation(8)
View the MathML source
Turn MathJax on

To estimate β and r*, as in the Poisson model, the iteration procedure of Newton-Raphson is applied ( Agresti, 2002). The related iteration algorithm is written in SAS to obtain the unknown parameters. In the NB regression model, the predicted value of ŷi is the conditional mean or average number of accidents given xi. This is also denoted as μi, that is, the mean of the random variable yi, which follows the NB distribution.

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

3.1. แบบจำลองถดถอย Poissonในแบบจำลองถดถอยของ Poisson ยี่ตัวแปรสังเกต i th จำลองเป็นตัวแปรสุ่มของ Poisson กับ λi หมายความว่า:equation(1)ดูแหล่งข้อมูล MathMLเปิด MathJaxคุณสมบัติสำคัญที่สุดของแบบจำลองถดถอย Poisson เป็นความเสมอภาคของค่าเฉลี่ยและความแปรปรวนของการกระจาย ซึ่งเป็นข้อบกพร่องหลักของรุ่นนี้และบล็อก "overdispersion" ของข้อมูลอุบัติเหตุ ในรูปแบบ Poisson ผลต่างตามเงื่อนไขเท่ากับเฉลี่ยแบบมีเงื่อนไข:equation(2)ดูแหล่งข้อมูล MathMLเปิด MathJaxล็อกโอกาสของรูปแบบการถดถอยของ Poisson ถูกกำหนดโดยequation(3)ดูแหล่งข้อมูล MathMLเปิด MathJaxการประมาณสัมประสิทธิ์การถดถอย โดยวิธีสูงสุดโอกาส อนุพันธ์ของล็อกโอกาสสัมพันธ์กับเวกเตอร์ของสัมประสิทธิ์ β ตั้งค่าเท่ากับศูนย์:equation(4)ดูแหล่งข้อมูล MathMLเปิด MathJaxไม่ได้รับการประมาณค่าสัมประสิทธิ์ถดถอยในแบบจำลองถดถอย Poisson กับสมการทางตรง แต่ใช้กระบวนการซ้ำ Raphson นิวตันสำหรับการประมาณพารามิเตอร์ของแบบจำลองที่ไม่รู้จัก เพื่อดำเนินการกระบวนการการประเมิน วิธีการเกิดซ้ำที่สอดคล้องกันการวิเคราะห์ โดยการคำนวณสัมประสิทธิ์ซอฟต์แวร์ทางสถิติ SAS ค่าของ ŷi คือ หมายถึงเงื่อนไขหรือจำนวนเฉลี่ย xi ซึ่งจะเขียนแทนเป็น λi นั่นคือ ความหมายของยี่แปรสุ่ม การแจกแจงปัวซองที่ ได้รับอุบัติเหตุ ค่านี้มักจะไม่ได้หมายเลขจำนวนเต็ม ในขณะที่ยี่สังเกตค่าเป็นจำนวนนับ (Anselin, 2002)3.2. ติดลบแบบจำลองการถดถอยทวินามในการลบทวินาม (NB) รุ่นยี่ สังเกต i th ของตัวแปรมีฟังก์ชันแจกแจงความน่าเป็นดังต่อไปนี้:equation(5)ดูแหล่งข้อมูล MathMLเปิด MathJaxหมายถึงเงื่อนไขของยี่ สำหรับเวกเตอร์ของตัวแปรอิสระสังเกต xi ถูกกำหนดโดยequation(6)ดูแหล่งข้อมูล MathMLเปิด MathJaxการกระจายของ NB ไม่มีข้อเสียของการแจกแจงปัวซองเนื่องจากมีความไม่เสมอภาคของค่าเฉลี่ยและความแปรปรวนของการกระจาย ความสัมพันธ์ระหว่างค่าเฉลี่ยและความแปรปรวนของการกระจาย NB เป็นequation(7)ดูแหล่งข้อมูล MathMLเปิด MathJaxความแปรปรวนของการกระจาย NB อยู่เสมอมากกว่าหมายถึง ดังนั้น มันเหมาะกับข้อมูล ด้วยผลต่างมากกว่าหมายถึงการ Α = 1/r คือ พารามิเตอร์ของการกระจาย อุบัติเหตุข้อมูลมีการกระจายตัวต่ำ การแจกแจงปัวซองให้ผลลัพธ์ที่เป็นไปได้ แต่ ถ้า overdispersion ที่มีอยู่ในข้อมูล ค่าความแปรปรวนของข้อมูลจะมากกว่าหมายถึง ในกรณีดังกล่าว การกระจาย NB เป็นที่ต้อง ในแบบจำลองถดถอย NB พารามิเตอร์การกระจายควรประเมินนอกจากพารามิเตอร์การถดถอย r ต้องเป็นค่าบวก ดังนั้น r * = lnr ประมาณแทน ซึ่งสามารถใช้ค่าใด ๆ ฟังก์ชันล็อกโอกาสของรุ่น NB สำหรับการสังเกต i th จะได้รับจากสมการต่อไปนี้:equation(8)ดูแหล่งข้อมูล MathMLเปิด MathJaxประเมินβและ r * เป็นในรูปแบบ Poisson ขั้นตอนการเกิดซ้ำของนิวตัน Raphson ใช้ (Agresti, 2002) ขั้นตอนวิธีการเกิดซ้ำที่เกี่ยวข้องเขียนไว้ใน SAS การรับพารามิเตอร์ที่ไม่รู้จัก ในแบบจำลองถดถอย NB ค่าของ ŷi คือ หมายถึงเงื่อนไขหรือเฉลี่ยจำนวนอุบัติเหตุที่ได้รับสิ นี้จะเขียนแทนเป็น μi นั่นคือ ความหมายของยี่แปรสุ่ม ซึ่งกระจาย NB

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

3.1 ตัวแบบการถดถอยปัวซองในรูปแบบการถดถอยปัวซองสังเกต I-TH ของ Yi ตัวแปรตามเป็นแบบจำลองเป็นตัวแปร Poisson สุ่มที่มีλiหมายถึง: สมการ (1) ดูแหล่งที่มา MathML เปิด MathJax ในคุณสมบัติที่สำคัญที่สุดของรูปแบบการถดถอยปัวซอง คือความเท่าเทียมกันของค่าเฉลี่ยและความแปรปรวนของการกระจายซึ่งเป็นข้อบกพร่องหลักของรุ่นนี้และบล็อก "overdispersion" ของข้อมูลอุบัติเหตุ ในรูปแบบ Poisson, ความแปรปรวนเงื่อนไขเท่ากับค่าเฉลี่ยเงื่อนไข: สมการ (2) ดูแหล่งที่มา MathML เปิด MathJax บนเข้าสู่ระบบความน่าจะเป็นของรูปแบบการถดถอยปัวซองจะได้รับจากสมการ (3) ดูแหล่งที่มา MathML เปิด MathJax ในการ ประมาณการค่าสัมประสิทธิ์การถดถอยโดยวิธีโอกาสสูงสุดที่มาของการเข้าสู่ระบบความน่าจะเทียบกับเวกเตอร์ของค่าสัมประสิทธิ์ที่βถูกตั้งค่าเท่ากับศูนย์: สมการ (4) ดูแหล่งที่มา MathML เปิด MathJax ในการประมาณค่าสัมประสิทธิ์การถดถอยใน รูปแบบการถดถอยปัวซองไม่ได้รับกับสมการโดยตรง แต่ Newton-Raphson ซ้ำขั้นตอนที่ถูกนำมาใช้ในการประมาณค่าพารามิเตอร์ที่ไม่รู้จักของรูปแบบ เพื่อดำเนินการขั้นตอนการประเมินขั้นตอนวิธีการทำซ้ำที่สอดคล้องกันคือการวิเคราะห์ทางสถิติโดยซอฟต์แวร์ SAS ที่จะได้รับการคำนวณค่าสัมประสิทธิ์ มูลค่าที่คาดการณ์ของ Yi เป็นค่าเฉลี่ยเงื่อนไขหรือค่าเฉลี่ยของจำนวนการเกิดอุบัติเหตุได้รับจินซึ่งเป็นที่แสดงยังเป็นλiที่เป็นค่าเฉลี่ยของตัวแปรสุ่ม Yi ซึ่งต่อไปนี้การกระจาย Poisson ค่านี้เป็นมักจะไม่เป็นจำนวนเต็มขณะที่ค่า Yi สังเกตคือการนับ (Anselin, 2002). 3.2 เชิงลบแบบการถดถอยทวินามในเชิงลบทวินาม (NB) รุ่นยี่สังเกต I-TH ของตัวแปรตามมีฟังก์ชันการแจกแจงความน่าจะเป็นดังต่อไปนี้: สมการ (5) ดูแหล่งที่มา MathML เปิด MathJax บนหมายถึงเงื่อนไขของ Yi สำหรับเวกเตอร์ ของตัวแปรสังเกตอิสระจินจะได้รับจากสมการ (6) ดูแหล่งที่มา MathML เปิด MathJax ในการกระจาย NB ไม่ได้มีข้อเสียของการกระจาย Poisson เนื่องจากมีความเท่าเทียมกันไม่มีของค่าเฉลี่ยและความแปรปรวนของการกระจาย ความสัมพันธ์ระหว่างค่าเฉลี่ยและความแปรปรวนของการกระจาย NB เป็นสมการ (7) ดูแหล่งที่มา MathML เปิด MathJax บนแปรปรวนของการกระจาย NB อยู่เสมอมากกว่าค่าเฉลี่ย ดังนั้นจึงเหมาะกับข้อมูลที่มีความแปรปรวนสูงกว่าค่าเฉลี่ย α = 1 / R พารามิเตอร์กระจาย สำหรับข้อมูลการเกิดอุบัติเหตุที่มีการกระจายตัวในระดับต่ำการกระจาย Poisson ให้ผลลัพธ์ที่เป็นไปได้ แต่ถ้า overdispersion มีอยู่ในข้อมูลความแปรปรวนของข้อมูลที่จะสูงกว่าค่าเฉลี่ย ในกรณีเช่นนี้การกระจาย NB เป็นที่ต้องการ ในรูปแบบการถดถอย NB พารามิเตอร์การกระจายตัวที่ควรจะประมาณนอกเหนือไปจากการถดถอยพารามิเตอร์ R ต้องเป็นบวกดังนั้น R * = LNR เป็นที่คาดกันแทนซึ่งสามารถใช้ในค่าใด ๆ ฟังก์ชั่นการเข้าสู่ระบบน่าจะเป็นของรุ่น NB สำหรับการสังเกต I-TH จะได้รับจากสมการต่อไปนี้: สมการ (8) ดูแหล่งที่มา MathML เปิด MathJax ในการประเมินβและ R * เช่นเดียวกับในรูปแบบปัวซองซ้ำขั้นตอนของการ Newton-Raphson ถูกนำไปใช้ (อาเกรสติ, 2002) ขั้นตอนวิธีการทำซ้ำที่เกี่ยวข้องกับการเขียนใน SAS ที่จะได้รับค่าพารามิเตอร์ที่ไม่รู้จัก ในรูปแบบการถดถอย NB ค่าคาดการณ์ของ Yi เป็นเงื่อนไขค่าเฉลี่ยหรือค่าเฉลี่ยของการเกิดอุบัติเหตุได้รับ Xi นี้จะแสดงเป็นμiที่เป็นค่าเฉลี่ยของตัวแปรสุ่ม Yi ซึ่งต่อไปนี้การกระจาย NB

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

3.1 . รูปแบบถดถอยปัวซอในแบบจำลองการถดถอยปัวซง , i-th การสังเกตของยีตัวแปรแบบตัวแปรสุ่มปัวซงกับหมายถึงλฉัน :สมการ ( 1 )ดู MathML แหล่งเปิด mathjax บนคุณสมบัติที่สำคัญที่สุดของปัวซงสมการถดถอยแบบจำลองคือความเสมอภาคของค่าเฉลี่ยและความแปรปรวนของการกระจาย ซึ่งเป็นจุดอ่อนหลักของรุ่นนี้และบล็อก " overdispersion " ของข้อมูลอุบัติเหตุ ในพารามิเตอร์โมเดล แปรปรวนเงื่อนไขเท่ากับค่าเฉลี่ย เงื่อนไข :สมการ ( 2 )ดู MathML แหล่งเปิด mathjax บนบันทึกความน่าจะเป็นของการถดถอยปัวชงแบบที่จะได้รับโดยสมการ ( 3 )ดู MathML แหล่งเปิด mathjax บนการประเมินสัมประสิทธิ์ถดถอยโดยวิธีความควรจะเป็นสูงสุด จาก บันทึกโอกาสเมื่อเทียบกับเวกเตอร์ของสัมประสิทธิ์บีตาเป็นชุดเท่ากับศูนย์ :สมการ ( 1 )ดู MathML แหล่งเปิด mathjax บนการประมาณค่าของสัมประสิทธิ์ถดถอยในแบบจำลองการถดถอยปัวซอได้มาด้วยสมการโดยตรง แต่นิวตันราฟสันทำซ้ำขั้นตอนที่ใช้สำหรับประมาณค่าพารามิเตอร์ที่ไม่รู้จักของแบบจำลอง แสดงขั้นตอนการทำซ้ำที่สอดคล้องกันขั้นตอนวิธีวิเคราะห์โดยสถิติซอฟต์แวร์ SAS ได้รับการคำนวณค่าสัมประสิทธิ์ การทำนายค่าŷผมคือหมายถึงเงื่อนไขหรือจำนวนการเกิดอุบัติเหตุให้ ซี ซึ่งจะแสดงเป็นλผม นั่นคือ ค่าเฉลี่ยของตัวแปรสุ่ม อี , ข้อมูลการแจกแจงปัวซง . มูลค่านี้มักจะไม่ได้เป็นเลขจำนวนเต็ม และสังเกตค่าอีคือการนับ ( anselin , 2002 )3.2 . การวิเคราะห์การถดถอยทวินามลบในทางลบแบบทวินาม ( NB ) รุ่นอี , i-th การสังเกตของตัวแปรฟังก์ชันการแจกแจงความน่าจะเป็นดังนี้สมการ ( 5 )ดู MathML แหล่งเปิด mathjax บนหมายถึงเงื่อนไขของยี เพื่อสังเกตเวกเตอร์ของตัวแปรอิสระ , Xi , จะได้รับโดยสมการที่ ( 6 )ดู MathML แหล่งเปิด mathjax บนส่วน NB กระจายไม่มีข้อเสียของการแจกแจงปัวซงเพราะไม่มีความเสมอภาคของค่าเฉลี่ยและความแปรปรวนของการแจกแจง ความสัมพันธ์ระหว่างค่าเฉลี่ยและความแปรปรวนของ NB กระจายสมการที่ ( 7 )ดู MathML แหล่งเปิด mathjax บนความแปรปรวนของ NB กระจายเสมอมากกว่าค่าเฉลี่ย ดังนั้น จึงเหมาะกับข้อมูลที่มีความแปรปรวนมากขึ้นกว่าค่าเฉลี่ย α = 1 / R คือการแพร่กระจายของพารามิเตอร์ สำหรับข้อมูลอุบัติเหตุ ด้วยการกระจายต่ำ การแจกแจงปัวซงให้ผลที่น่าเชื่อถือ แต่ถ้า overdispersion ที่มีอยู่ในข้อมูล ความแปรปรวนของข้อมูลจะมากกว่าค่าเฉลี่ย ในบางกรณี , NB แบบที่ต้องการ ใน NB ตัวแบบการถดถอย , การประมาณพารามิเตอร์ควรนอกจากพารามิเตอร์การถดถอย R ต้องเป็นบวก ดังนั้น R * = ในปริมาณประมาณแทน ซึ่งสามารถใช้ค่าใด ๆ บันทึกโอกาสการทำงานของรุ่น NB สำหรับ i-th สังเกตได้จากสมการดังต่อไปนี้สมการ ( 8 )ดู MathML แหล่งเปิด mathjax บนประมาณการบีตาและ R * ในพารามิเตอร์โมเดล ทำซ้ำขั้นตอนของนิวตันราฟสันที่ใช้ ( agresti , 2002 ) ที่เกี่ยวข้องกับการวิธีการเขียนใน SAS เพื่อให้ได้ค่าพารามิเตอร์ที่ไม่รู้จัก ใน NB ตัวแบบการถดถอย , คาดการณ์มูลค่าของŷผมคือหมายถึงเงื่อนไขหรือจำนวนการเกิดอุบัติเหตุให้ Xi นี้จะแสดงเป็นμผม นั่นคือ ค่าเฉลี่ยของตัวแปรสุ่มยีซึ่งตามนบี การกระจายสินค้า

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.