Analysis of CovarianceBackground Th

Analysis of Covariance

Background
The analysis of covariance, a special case of regression analysis (Chapters 2-4), involves a continuous dependent variable and both cate gorical and continuous independent variables. The central purpose of an analysis of covariance is to compensate for influences on the dependent variable that interfere with direct comparisons among a series of categories. Situations arise, for example, where mean values differ to some extent because of the influence of one or more extraneous continuous vari ables.
The analysis of covariance seeks to remove the influence of these variables, called covariables, that bias direct comparisons among a series of categories. For example, the mean birth weights of a sample of infants differ when classified by mother’s smoking habits (never smoked, past smokers, and present smokers), but the three groups also differ by maternal age (covariable and present smokers),but the three groups also differ by maternal age (covariable). The contributions to the observed differences caused by maternal age, though of little interest, bias the evaluation of possible effects of maternal smoking exposure on birth weight. The analysis adjusts the mean birth weights among the three smoking categories so that compari sons are “free” from the influences of matermal age. The word free should not be taken too literally. Statistcal adjustment, which is at the center of the analysis of covariance, depends on the validity (or, at least, the goodness-of-fit) of a specific statical model. When this underlying model is appropriate, it is possible to remove effectively the influence of a covariable by regression techniques. Differences observed among the adjusted mean values are then no longer influenced by the effects of one or more extraneous variables. Similar to regression analysis,
Goodness of – fit is addressed as part of the analysis. The following sections concentrate on the simplest analysis of covariance followed by a less detailed description of more general and more complicated approaches.
Criterion
The basic analysis of covariance combines simple linear regression (Chapter 2) with one-way analysis of variance (reviewed in Chapter 1). The data are collected in pairs (a covariable and a dependent variable) and classified into a series of categories, illustrated in Table). The symbol x represents a continuous covariable and the sampled value of the dependent variable is denoted, as before, by y. The notation indicates the observation from the group. Furthemore, the covariable is related to the dependent of the sample mean values therefore, reflects and, on the average, differs among the g categories. The direct comparison of the sample mean values therefore, reflects any effects from the categorical classification as well as influences from the covariable, making direct evaluation of the role of the categorical variable difficult without compensating for the influence of the covariable If the variable difficult without compensating for the influence of the covariable. If the variable is unre lated to the dependent variable or does not differ among the g categories,then the comparison of the values is not affected by and an analysis of covariance is unnecessary.
An analysis of covariance can be viewed as a sequential investigation of the fit of three nested statistical models. As before, the criterion to examine the goodness-offit of these competing models is an F-statistic, used to contrast residual sums of squares.
Model I: Interaction
The key to an analysis of covariance is the feasibility of a specific statistical model, called for simplicity, model II. The effectiveness of model II is evaluated by a comparison with a more general statistical structure called model I (depicted in Figure
5.1) Model I postulates that the covariable is related to the dependent variable by a series of simple linear regression equation that differ among some or all of the groups. The property that the relationship between the covariable and the dependent variable differs among groups is an interaction. Like the previous description of an interaction (expression 4.26), the relationship between variables and Y depends on the value of a third variable (categories).
Specifically, a series of straight lines with different intercepts (a.) and different slopes (b) describes the sample data within each group. In symbols, the expected values of the dependent variable for each group is given by the simple linear equation
Where I indicates group membership and j indicates the specific observation within the group. This statistical model allows the parameters of each regression line (a.and b) to differ among the g categories (I= 1,2,.....,g).
A simple linear regression analysis applied to the n observed pairs (x.., y..) in each of the g categories yields an estimate of a and b denoted, as before, a, and b. The process involves a separate regression analyses, requires 2 g estimates, and produces g estimated linear regression lines. Furthermore, each estimated regression line certainly fails to fit the data perfectly and the lack of fit is measured by the residual sum of squares within each group. The residual sum of squares for the category is the sum of squared deviations of the points on the estimated regression line from each corresponding observation or, for category

Is the residual sum of squares for a specific group. This residual sum of squares divided by the variance of the dependent variable has a chi-square distribution with –2 degrees of freedom when the requirements hold for a regression analysis (independence,normality, equal variance, and linearity). The estimation and evaluation of each of these g regression equations is identical to the simple linear regression analysis described in hapter 2; the process is just repeated a times, once for each group.
An overall assessment of mode I comes from summarizing the total fit of the g regression equations by
The value Res(I)divided by the variance of Y also has a chi-square distribution under model I with degrees of freedom where is the total number of observations. It is necessary to make 2 g independent estimates to establish the N estimated values under model I, yielding degrees of freedom.
Model II: No Interaction
Model II allows the expected values among the a categories to differ but the relation shop between covariable and the dependent variable Y within each category is the same, namely linear with qual slopes (Figure 5.2) In symbols, model II is
Model II is a a special case of model I and provides the specific structure to statistically remove the influence of the covariable. A set of g+1 parameters deffer among categories but have the same slope, denoted b’ . Because the relationship between the independent and depenent variable is the same in each group, no interaction exists (no-interaction model). In other words, the relationship between variables x and y does not depend on the values of a third variable (categories).
The fit of model II is also measured by a residual sum of squares. For each of the g groups, the estimated linear model for the group is producing the summary measure of fit of the g regression equations
Or, similar to model I,
The quantity represented by Res (II) divided by the variance of Y has a chi-square distribution under model II with N- (g + 1 ) degrees of freedom. Because g+ 1 estimates are required to produce the N estimated values y , the degrees of freedom are N- (g + 1). The residual sum of squares is related to a chi-square distribution when, in addition to the four usual regreddion analysis requirements, the slopes of the regression lines within each group are identical. Note that the intercept values a generated by mode II are not equal to the a-values used to define mode I.
Because model II is a restriction of mode I, an F-statistic serves to assess wheter mode II is consistent with the data The formal F-statistic is
Which has an F-distribution with g- 1 and N – 2g degrees of freedom when the gslopes are equal, no interaction.
If the value F is unlikely to have occurred by chance, then model II is not a tractable representation of the data and adjustment by linear regression techniques is not meaningful. The rejection of model II indicates a statistical interaction---the relation-ship between x and Y differs among some or all of the g categories. Geometrically, a large difference between the fit of model I and model II indicates that the relationship between x and Y is not adequately represented by a set of parallel straight lines. A direct consequence of rejecting model II (accepting model I) is that a comparison of the mean values depends on the value of x (Figure 5.1) A comparison of the mean values at one choice of x produces a difference y-y that differs for the same comparison for other choices of x therefore,no general comparison can be made among the categories free of the influence of the covariable x.
When an F-statistic indicates that model II differs from model I by no more than chance variation, the no-interaction model provides a basis to remove the influence of the covariable x, yielding a set of adjusted mean values. Model II requires the differences between mean values to be the same for any choice of x. Adjusted mean values based on model II, free from the influence of the x-variate,are then directly compared to detect possible II, free from the influence of the x- variate, are then directly compared to detect possible differences associated with the g categories. Clearly,The validity or the close fit of model II is crucial because the computation of the adjusted mean values depends on the requirement that the regression lines be parallel within all groups compared.Failure to reject model II does not unequivocally imply that the model represents the relationships within the data. It simply means that in face of no evidence to the contrary, the a

Analysis of Covariance

Background
 The analysis of covariance, a special case of regression analysis (Chapters 2-4), involves a continuous dependent variable and both cate gorical and continuous independent variables. The central purpose of an analysis of covariance is to compensate for influences on the dependent variable that interfere with direct comparisons among a series of categories. Situations arise, for example, where mean values differ to some extent because of the influence of one or more extraneous continuous vari ables. 
The analysis of covariance seeks to remove the influence of these variables, called covariables, that bias direct comparisons among a series of categories. For example, the mean birth weights of a sample of infants differ when classified by mother’s smoking habits (never smoked, past smokers, and present smokers), but the three groups also differ by maternal age (covariable and present smokers),but the three groups also differ by maternal age (covariable). The contributions to the observed differences caused by maternal age, though of little interest, bias the evaluation of possible effects of maternal smoking exposure on birth weight. The analysis adjusts the mean birth weights among the three smoking categories so that compari sons are “free” from the influences of matermal age. The word free should not be taken too literally. Statistcal adjustment, which is at the center of the analysis of covariance, depends on the validity (or, at least, the goodness-of-fit) of a specific statical model. When this underlying model is appropriate, it is possible to remove effectively the influence of a covariable by regression techniques. Differences observed among the adjusted mean values are then no longer influenced by the effects of one or more extraneous variables. Similar to regression analysis,
Goodness of – fit is addressed as part of the analysis. The following sections concentrate on the simplest analysis of covariance followed by a less detailed description of more general and more complicated approaches. 
Criterion
 The basic analysis of covariance combines simple linear regression (Chapter 2) with one-way analysis of variance (reviewed in Chapter 1). The data are collected in pairs (a covariable and a dependent variable) and classified into a series of categories, illustrated in Table). The symbol x represents a continuous covariable and the sampled value of the dependent variable is denoted, as before, by y. The notation indicates the observation from the group. Furthemore, the covariable is related to the dependent of the sample mean values therefore, reflects and, on the average, differs among the g categories. The direct comparison of the sample mean values therefore, reflects any effects from the categorical classification as well as influences from the covariable, making direct evaluation of the role of the categorical variable difficult without compensating for the influence of the covariable If the variable difficult without compensating for the influence of the covariable. If the variable is unre lated to the dependent variable or does not differ among the g categories,then the comparison of the values is not affected by and an analysis of covariance is unnecessary.
 An analysis of covariance can be viewed as a sequential investigation of the fit of three nested statistical models. As before, the criterion to examine the goodness-offit of these competing models is an F-statistic, used to contrast residual sums of squares.
Model I: Interaction
The key to an analysis of covariance is the feasibility of a specific statistical model, called for simplicity, model II. The effectiveness of model II is evaluated by a comparison with a more general statistical structure called model I (depicted in Figure
5.1) Model I postulates that the covariable is related to the dependent variable by a series of simple linear regression equation that differ among some or all of the groups. The property that the relationship between the covariable and the dependent variable differs among groups is an interaction. Like the previous description of an interaction (expression 4.26), the relationship between variables and Y depends on the value of a third variable (categories).
 Specifically, a series of straight lines with different intercepts (a.) and different slopes (b) describes the sample data within each group. In symbols, the expected values of the dependent variable for each group is given by the simple linear equation
Where I indicates group membership and j indicates the specific observation within the group. This statistical model allows the parameters of each regression line (a.and b) to differ among the g categories (I= 1,2,.....,g).
 A simple linear regression analysis applied to the n observed pairs (x.., y..) in each of the g categories yields an estimate of a and b denoted, as before, a, and b. The process involves a separate regression analyses, requires 2 g estimates, and produces g estimated linear regression lines. Furthermore, each estimated regression line certainly fails to fit the data perfectly and the lack of fit is measured by the residual sum of squares within each group. The residual sum of squares for the category is the sum of squared deviations of the points on the estimated regression line from each corresponding observation or, for category

Is the residual sum of squares for a specific group. This residual sum of squares divided by the variance of the dependent variable has a chi-square distribution with –2 degrees of freedom when the requirements hold for a regression analysis (independence,normality, equal variance, and linearity). The estimation and evaluation of each of these g regression equations is identical to the simple linear regression analysis described in hapter 2; the process is just repeated a times, once for each group.
 An overall assessment of mode I comes from summarizing the total fit of the g regression equations by
The value Res(I)divided by the variance of Y also has a chi-square distribution under model I with degrees of freedom where is the total number of observations. It is necessary to make 2 g independent estimates to establish the N estimated values under model I, yielding degrees of freedom.
Model II: No Interaction
Model II allows the expected values among the a categories to differ but the relation shop between covariable and the dependent variable Y within each category is the same, namely linear with qual slopes (Figure 5.2) In symbols, model II is
Model II is a a special case of model I and provides the specific structure to statistically remove the influence of the covariable. A set of g+1 parameters deffer among categories but have the same slope, denoted b’ . Because the relationship between the independent and depenent variable is the same in each group, no interaction exists (no-interaction model). In other words, the relationship between variables x and y does not depend on the values of a third variable (categories).
 The fit of model II is also measured by a residual sum of squares. For each of the g groups, the estimated linear model for the group is producing the summary measure of fit of the g regression equations
Or, similar to model I,
The quantity represented by Res (II) divided by the variance of Y has a chi-square distribution under model II with N- (g + 1 ) degrees of freedom. Because g+ 1 estimates are required to produce the N estimated values y , the degrees of freedom are N- (g + 1). The residual sum of squares is related to a chi-square distribution when, in addition to the four usual regreddion analysis requirements, the slopes of the regression lines within each group are identical. Note that the intercept values a generated by mode II are not equal to the a-values used to define mode I.
 Because model II is a restriction of mode I, an F-statistic serves to assess wheter mode II is consistent with the data The formal F-statistic is
Which has an F-distribution with g- 1 and N – 2g degrees of freedom when the gslopes are equal, no interaction.
 If the value F is unlikely to have occurred by chance, then model II is not a tractable representation of the data and adjustment by linear regression techniques is not meaningful. The rejection of model II indicates a statistical interaction---the relation-ship between x and Y differs among some or all of the g categories. Geometrically, a large difference between the fit of model I and model II indicates that the relationship between x and Y is not adequately represented by a set of parallel straight lines. A direct consequence of rejecting model II (accepting model I) is that a comparison of the mean values depends on the value of x (Figure 5.1) A comparison of the mean values at one choice of x produces a difference y-y that differs for the same comparison for other choices of x therefore,no general comparison can be made among the categories free of the influence of the covariable x.
 When an F-statistic indicates that model II differs from model I by no more than chance variation, the no-interaction model provides a basis to remove the influence of the covariable x, yielding a set of adjusted mean values. Model II requires the differences between mean values to be the same for any choice of x. Adjusted mean values based on model II, free from the influence of the x-variate,are then directly compared to detect possible II, free from the influence of the x- variate, are then directly compared to detect possible differences associated with the g categories. Clearly,The validity or the close fit of model II is crucial because the computation of the adjusted mean values depends on the requirement that the regression lines be parallel within all groups compared.Failure to reject model II does not unequivocally imply that the model represents the relationships within the data. It simply means that in face of no evidence to the contrary, the a

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

วิเคราะห์ความแปรปรวนร่วมความเป็นมาวิเคราะห์ความแปรปรวนร่วม, เป็นกรณีพิเศษของการวิเคราะห์การถดถอย (บทที่ 2 - 4) เกี่ยวข้องกับตัวแปรตามอย่างต่อเนื่องและ gorical Cate และตัวแปรอิสระอย่างต่อเนื่อง วัตถุประสงค์สำคัญของการวิเคราะห์ความแปรปรวนคือการชดเชยอิทธิพลในตัวแปรตามที่ยุ่งเกี่ยวกับการเปรียบเทียบโดยตรงในหมู่ชุดของประเภท สถานการณ์ที่เกิดขึ้นเช่นค่าเฉลี่ยที่แตกต่างกันไปบ้างเพราะอิทธิพลของหนึ่งหรือมากกว่าภายนอก Ables Vari อย่างต่อเนื่อง. วิเคราะห์ความแปรปรวนร่วมพยายามที่จะเอาอิทธิพลของตัวแปรเหล่านี้เรียกว่า covariables ว่าอคติเปรียบเทียบโดยตรงในหมู่ชุดของ ประเภท ตัวอย่างเช่นน้ำหนักแรกเกิดเฉลี่ยของกลุ่มตัวอย่างที่แตกต่างกันของทารกเมื่อจำแนกตามแม่' S พฤติกรรมการสูบบุหรี่ (ไม่เคยสูบบุหรี่, สูบบุหรี่ที่ผ่านมาและผู้สูบบุหรี่ในปัจจุบัน) แต่ทั้งสามกลุ่มยังแตกต่างกันตามอายุของมารดา (covariable และผู้สูบบุหรี่ในปัจจุบัน) แต่ สามกลุ่มยังแตกต่างกันตามอายุของมารดา (covariable) มีส่วนร่วมในความแตกต่างที่สังเกตที่เกิดจากอายุมารดา แต่ที่น่าสนใจเล็ก ๆ น้อย ๆ อคติการประเมินผลของผลกระทบที่เป็นไปได้ของการเปิดรับการสูบบุหรี่ของมารดาในน้ำหนักแรกเกิด การวิเคราะห์ปรับน้ำหนักแรกเกิดเฉลี่ยในสามประเภทการสูบบุหรี่เพื่อให้ลูกหลาน Compari เป็น" ฟรี" จากอิทธิพลของอายุ matermal ฟรีคำไม่ควรดำเนินการตามตัวอักษรมากเกินไป ปรับ Statistcal ซึ่งเป็นศูนย์กลางของการวิเคราะห์ความแปรปรวนร่วมขึ้นอยู่กับความถูกต้อง (หรืออย่างน้อยความดี- ของ- พอดี) ของรูปแบบที่เฉพาะเจาะจงสถิตย์ เมื่อเป็นเช่นนี้รูปแบบพื้นฐานมีความเหมาะสมก็เป็นไปได้ที่จะลบได้อย่างมีประสิทธิภาพอิทธิพลของ covariable โดยใช้เทคนิคการถดถอย ความแตกต่างที่สังเกตได้ในหมู่ค่าเฉลี่ยที่ปรับจะถูกอิทธิพลอีกต่อไปโดยผลของการหนึ่งหรือมากกว่าหนึ่งตัวแปรภายนอก คล้ายกับการวิเคราะห์การถดถอย, ความดีของ- พอดีเป็นที่เป็นส่วนหนึ่งของการวิเคราะห์ ส่วนต่อไปนี้มีสมาธิในการวิเคราะห์ที่ง่ายที่สุดของความแปรปรวนตามด้วยคำอธิบายรายละเอียดน้อยของวิธีการทั่วไปและซับซ้อนมากขึ้นมากขึ้น. เกณฑ์การวิเคราะห์พื้นฐานของความแปรปรวนรวมการถดถอยเชิงเส้นอย่างง่าย (บทที่ 2) กับการวิเคราะห์ทางเดียวของความแปรปรวน (สอบทานในบทที่ 1 ) ข้อมูลจะถูกเก็บรวบรวมเป็นคู่ (covariable และตัวแปรตาม) และแบ่งออกเป็นชุดของประเภทแสดงในตาราง) สัญลักษณ์แสดงถึง x covariable อย่างต่อเนื่องและค่าตัวอย่างของตัวแปรตามจะแสดงเป็นมาก่อนด้วย y สัญกรณ์ชี้ให้เห็นการสังเกตจากกลุ่ม Furthemore, covariable ที่เกี่ยวข้องกับการขึ้นอยู่กับค่าเฉลี่ยของกลุ่มตัวอย่างจึงสะท้อนให้เห็นถึงและโดยเฉลี่ยมีความแตกต่างในหมู่กรัมประเภท เปรียบเทียบโดยตรงของตัวอย่างค่าเฉลี่ยค่าจึงสะท้อนให้เห็นถึงผลกระทบใด ๆ จากการจัดหมวดหมู่เด็ดขาดรวมทั้งอิทธิพลจาก covariable ทำให้การประเมินผลโดยตรงของบทบาทของตัวแปรเด็ดขาดยากโดยไม่ต้องชดเชยอิทธิพลของ covariable ถ้ายากตัวแปรโดยไม่ต้องชดเชย สำหรับอิทธิพลของ covariable ถ้าตัวแปรเป็น lated unre กับตัวแปรตามหรือไม่แตกต่างในหมู่ประเภทกรัมแล้วเปรียบเทียบค่าที่ไม่ได้รับผลกระทบจากเงินและการวิเคราะห์ความแปรปรวนไม่จำเป็น. วิเคราะห์ความแปรปรวนร่วมสามารถมองได้ว่าการตรวจสอบลำดับ พอดีในสามของแบบจำลองทางสถิติที่ซ้อนกัน เมื่อก่อนเกณฑ์ในการตรวจสอบความดี-Offit ของรูปแบบการแข่งขันเหล่านี้คือ F-สถิติที่ใช้เพื่อความคมชัดเงินก้อนเหลือของช่องสี่เหลี่ยม. รุ่น I: ปฏิสัมพันธ์ที่สำคัญในการวิเคราะห์ความแปรปรวนร่วมเป็นเป็นไปได้ของแบบจำลองทางสถิติเฉพาะที่เรียกว่า สำหรับความเรียบง่ายรูปแบบครั้งที่สอง ประสิทธิผลของรูปแบบครั้งที่สองได้รับการประเมินโดยเปรียบเทียบกับโครงสร้างสถิติทั่วไปมากขึ้นเรียกว่ารุ่นผม (แสดงในรูปที่5.1) รุ่นผมสมมุติฐานว่า covariable ที่เกี่ยวข้องกับตัวแปรตามโดยชุดของสมการถดถอยเชิงเส้นอย่างง่ายที่แตกต่างกันในบางส่วนหรือ ทั้งหมดของกลุ่ม ทรัพย์สินที่ความสัมพันธ์ระหว่าง covariable และตัวแปรตามความแตกต่างระหว่างกลุ่มปฏิสัมพันธ์ เช่นเดียวกับคำอธิบายก่อนหน้าของการทำงานร่วมกัน (การแสดงออก 4.26) ความสัมพันธ์ระหว่างตัวแปรและ Y ขึ้นอยู่กับค่าของตัวแปรที่สาม (ประเภท). โดยเฉพาะชุดของเส้นตรงกับดักที่แตกต่างกัน (.) และความลาดชันที่แตกต่างกัน (ข) อธิบายข้อมูลตัวอย่างในแต่ละกลุ่ม ในสัญลักษณ์ค่าที่คาดหวังของตัวแปรสำหรับแต่ละกลุ่มจะได้รับโดยสมการเชิงเส้นที่เรียบง่ายที่ฉันแสดงให้เห็นสมาชิกกลุ่มเจบ่งชี้สังเกตเฉพาะภายในกลุ่ม แบบจำลองทางสถิตินี้จะช่วยให้ค่าพารามิเตอร์ของแต่ละบรรทัดถดถอย (a.and ข) จะแตกต่างในหมู่ประเภทกรัม (I = 1,2, ..... , ช). การวิเคราะห์การถดถอยเชิงเส้นอย่างง่ายนำไปใช้กับ n สังเกตคู่ ( .. x, y .. ) ในแต่ละหมวดหมู่กรัมอัตราผลตอบแทนประมาณการของ a และ b แสดงเป็นมาก่อนและข กระบวนการเกี่ยวกับการวิเคราะห์การถดถอยแยกต้อง 2 ประมาณการกรัมและผลิตกรัมประมาณเส้นถดถอยเชิงเส้น นอกจากนี้แต่ละบรรทัดถดถอยประมาณล้มเหลวอย่างแน่นอนเพื่อให้พอดีกับข้อมูลได้อย่างสมบูรณ์แบบและการขาดความพอดีวัดจากผลรวมเหลือของช่องสี่เหลี่ยมในแต่ละกลุ่ม รวมส่วนที่เหลือของช่องสี่เหลี่ยมสำหรับประเภทคือผลรวมของส่วนเบี่ยงเบน squared ของจุดบนเส้นถดถอยประมาณจากการสังเกตแต่ละสอดคล้องหรือสำหรับประเภทนี้รวมส่วนที่เหลือของช่องสี่เหลี่ยมสำหรับกลุ่มที่เฉพาะเจาะจง นี้รวมส่วนที่เหลือของสี่เหลี่ยมหารด้วยแปรปรวนของตัวแปรตามมีไค- กระจายตารางด้วย- 2 องศาอิสระเมื่อความต้องการถือสำหรับการวิเคราะห์การถดถอย (อิสระปกติแปรปรวนเท่ากันและเป็นเส้นตรง) การประมาณค่าและการประเมินผลของแต่ละเหล่านี้กรัมสมการถดถอยเป็นเหมือนการวิเคราะห์การถดถอยเชิงเส้นอย่างง่ายที่อธิบายไว้ในบทที่ 2; กระบวนการนี้ซ้ำแล้วซ้ำอีกเพียงครั้ง, ครั้งเดียวสำหรับแต่ละกลุ่ม. ประเมินภาพรวมของโหมดผมมาจากการสรุปพอดีทั้งหมดของสมการถดถอยกรัมโดยค่า Res (I) หารด้วยแปรปรวนของ Y ยังมีไค- ตารางการจัดจำหน่าย ภายใต้รูปแบบผมที่มีองศาอิสระที่มีจำนวนของการสังเกต มันเป็นสิ่งจำเป็นที่จะทำให้ 2 กรัมประมาณการอิสระที่จะสร้างค่าไม่มีประมาณภายใต้รูปแบบผม, องศาที่ให้ผลผลิตของเสรีภาพ. รุ่นที่สอง: ไม่มีปฏิสัมพันธ์รุ่นที่สองช่วยให้ค่าคาดว่าในหมู่ประเภทแตกต่างกัน แต่ร้านค้าความสัมพันธ์ระหว่าง covariable และขึ้นอยู่กับ ตัวแปร Y ในแต่ละประเภทจะเหมือนกันคือเส้นตรงที่มีความลาด qual (รูปที่ 5.2) ในสัญลักษณ์รุ่น II เป็นรุ่นที่สองเป็น AA กรณีพิเศษของรูปแบบผมและให้โครงสร้างที่เฉพาะเจาะจงที่จะลบสถิติอิทธิพลของ covariable ชุดของ g + 1 พารามิเตอร์ deffer ในประเภท แต่มีความลาดชันเดียวกันแสดงข' . เพราะความสัมพันธ์ระหว่างตัวแปรอิสระและ depenent จะเหมือนกันในแต่ละกลุ่มปฏิสัมพันธ์ไม่มี (ไม่- รูปแบบการทำงานร่วมกัน) ในคำอื่น ๆ ความสัมพันธ์ระหว่างตัวแปร x และ y ไม่ขึ้นอยู่กับค่าของตัวแปรที่สาม (ประเภท). พอดีของรูปแบบครั้งที่สองยังเป็นวัดโดยรวมส่วนที่เหลือของสี่เหลี่ยม สำหรับแต่ละกลุ่มกรัม, แบบจำลองเชิงเส้นโดยประมาณสำหรับกลุ่มคือการผลิตการวัดบทสรุปของความพอดีของสมการถดถอยกรัมหรือคล้ายกับรูปแบบผม, ปริมาณแสดงโดย Res (II) หารด้วยแปรปรวนของ Y มีไค- การกระจายตารางภายใต้รูปแบบครั้งที่สองพร้อมชื่อ- ( g + 1) องศาอิสระ เพราะ g + 1 ประมาณการจะต้องผลิตยังไม่มีการประมาณการค่า y, องศาอิสระมีไม่มี- ( g + 1) รวมส่วนที่เหลือของสี่เหลี่ยมเป็นที่เกี่ยวข้องกับไค- กระจายตารางเมื่อนอกเหนือไปจากการวิเคราะห์ความต้องการ regreddion สี่ปกติทางลาดของเส้นถดถอยในแต่ละกลุ่มจะเหมือนกัน โปรดทราบว่าการตัดค่าที่สร้างขึ้นโดยโหมดที่สองไม่ได้เท่ากับ- ค่าใช้ในการกำหนดโหมด I. เพราะรูปแบบที่สองเป็นข้อ จำกัด ของโหมดฉัน, F - สถิติทำหน้าที่ในการประเมิน wheter โหมดที่สองมีความสอดคล้องกับข้อมูล อย่างเป็นทางการ F - สถิติเป็นซึ่งมี F - จัดจำหน่ายที่มีกรัม- 1 และ n - . องศา 2g ของเสรีภาพเมื่อ gslopes จะเท่ากับปฏิสัมพันธ์ไม่มีถ้า F มูลค่าไม่น่าจะเกิดขึ้นโดยบังเอิญแล้วแบบที่สองไม่ได้เป็นซูฮก เป็นตัวแทนของข้อมูลและการปรับตัวโดยใช้เทคนิคการถดถอยเชิงเส้นไม่ได้มีความหมาย การปฏิเสธของรูปแบบครั้งที่สองแสดงให้เห็นการทำงานร่วมกันทางสถิติ --- สัมพันธ์เรือระหว่าง x และ y มีความแตกต่างในบางส่วนหรือทั้งหมดของกรัมประเภท เรขาคณิตต่างกันมากระหว่างพอดีของรูปแบบผมและรูปแบบครั้งที่สองแสดงให้เห็นว่าความสัมพันธ์ระหว่าง x และ y ไม่ได้เป็นตัวแทนอย่างเพียงพอโดยชุดของเส้นตรงขนาน ผลโดยตรงจากการปฏิเสธแบบ II (ฉันยอมรับรูปแบบ) คือการเปรียบเทียบค่าเฉลี่ยขึ้นอยู่กับค่าของ x (รูปที่ 5.1) เปรียบเทียบค่าเฉลี่ยที่หนึ่งทางเลือกของ x ผลิต yy ความแตกต่างที่แตกต่างสำหรับเดียวกัน สำหรับการเปรียบเทียบทางเลือกอื่น ๆ ของ x จึงไม่มีการเปรียบเทียบทั่วไปสามารถทำได้ในประเภทฟรีอิทธิพลของ covariable x. เมื่อ F-สถิติแสดงให้เห็นว่ารูปแบบครั้งที่สองแตกต่างจากรูปแบบที่ผมได้ไม่เกินรูปแบบบังเอิญไม่มีปฏิสัมพันธ์ รูปแบบการมีพื้นฐานที่จะเอาอิทธิพลของ covariable x ยอมชุดของการปรับค่าเฉลี่ย รุ่นที่สองต้องมีความแตกต่างระหว่างค่าเฉลี่ยจะเป็นเหมือนกันสำหรับทางเลือกของ x ใด ๆ ค่าเฉลี่ยที่ปรับขึ้นอยู่กับรูปแบบที่สองเป็นอิสระจากอิทธิพลของ x-variate จะเปรียบเทียบโดยตรงในการตรวจสอบที่เป็นไปได้ครั้งที่สองเป็นอิสระจากอิทธิพลของ variate x- จะเปรียบเทียบโดยตรงในการตรวจสอบความแตกต่างที่เป็นไปได้ที่เกี่ยวข้องกับกรัมประเภท . เห็นได้ชัดว่าถูกต้องหรือแบบที่ใกล้ชิดของรูปแบบที่สองเป็นสิ่งสำคัญเนื่องจากการคำนวณค่าเฉลี่ยที่ปรับขึ้นอยู่กับความต้องการที่สายการถดถอยจะขนานภายในกลุ่มทั้งหมด compared.Failure ที่จะปฏิเสธรูปแบบครั้งที่สองไม่ได้อย่างแจ่มแจ้งหมายความว่ารูปแบบหมายถึง ความสัมพันธ์ในข้อมูล มันก็หมายความว่าในการเผชิญกับไม่มีหลักฐานไปในทางตรงกันข้าม

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

การวิเคราะห์ความแปรปรวนร่วม ( ANCOVA )

หลังการวิเคราะห์ความแปรปรวนร่วม เป็นกรณีพิเศษของการวิเคราะห์การถดถอย ( บทที่ 3 ) , ที่เกี่ยวข้องกับตัวแปรอย่างต่อเนื่องและทั้งเคท gorical และตัวแปรอิสระอย่างต่อเนื่อง วัตถุประสงค์ของการวิเคราะห์ความแปรปรวนร่วมกลาง เพื่อชดเชยที่มีอิทธิพลต่อตัวแปรตามที่ขัดขวางโดยเปรียบเทียบระหว่างชุดของประเภทสถานการณ์ที่เกิดขึ้น ยกตัวอย่างเช่น ซึ่งหมายถึงค่าแตกต่างไปบ้างเพราะอิทธิพลของหนึ่งหรือมากกว่าหนึ่งการอย่างต่อเนื่อง วารี Ables .
การวิเคราะห์ความแปรปรวนร่วม ( ANCOVA ) พยายามจะเอาอิทธิพลของตัวแปรเหล่านี้เรียกว่า covariables ที่อคติโดยเปรียบเทียบระหว่างชุดของประเภท ตัวอย่างเช่นคือน้ำหนักแรกเกิดของทารก มีตัวอย่างเมื่อจำแนกพฤติกรรมการสูบบุหรี่ของแม่ ( ไม่เคยรมควัน smokers อดีตและปัจจุบันผู้สูบบุหรี่ ) แต่ทั้งสามกลุ่มแตกต่างกัน โดยมารดาอายุ ( covariable และปัจจุบัน smokers ) แต่ทั้งสามกลุ่มแตกต่างกัน โดยมารดาอายุ ( covariable ) เงินสมทบเพื่อสังเกตความแตกต่างที่เกิดจากมารดาอายุ แม้ว่าดอกเบี้ยน้อยอคติในการประเมินผลที่เป็นไปได้ของการสูบบุหรี่ของมารดากับทารกแรกเกิดน้ำหนัก . การวิเคราะห์ปรับหมายถึงน้ำหนักแรกเกิดระหว่างสามสูบประเภทที่คัมพารีบุตรชาย " ฟรี " จากอิทธิพลของ matermal อายุ ฟรีคำไม่ควรถ่ายด้วยแหละ statistcal การปรับตัวซึ่งเป็นจุดศูนย์กลางของการวิเคราะห์ความแปรปรวนร่วม ขึ้นอยู่กับความถูกต้อง ( หรืออย่างน้อยความดีของพอดี ) statical เฉพาะรุ่น เมื่อนี้เป็นต้นแบบที่เหมาะสมก็เป็นไปได้ที่จะลบอย่างมีประสิทธิภาพ อิทธิพลของ covariable ด้วยเทคนิค Regression ความแตกต่างระหว่างค่าสังเกตปรับหมายถึงอิทธิพลอีกต่อไปแล้ว โดยผลของการหนึ่งหรือมากกว่าหนึ่งตัวแปรที่ไม่เกี่ยวข้อง คล้ายกับ
การวิเคราะห์การถดถอยความดีของผู้บริโภคพอดีมีอยู่เป็นส่วนหนึ่งของการวิเคราะห์ ส่วนต่อไปนี้มุ่งเน้นการวิเคราะห์ง่ายของความแปรปรวนร่วมตามด้วยรายละเอียดน้อยกว่าทั่วไปมากขึ้นและวิธีการที่ซับซ้อนมากขึ้น

เกณฑ์การวิเคราะห์ความแปรปรวนรวมการถดถอยเชิงเส้นอย่างง่าย ( บทที่ 2 ) การวิเคราะห์ความแปรปรวนทางเดียว ( ดูในบทที่ 1 )เก็บข้อมูลในคู่ ( covariable และตัวแปรตาม ) และจัดอยู่ในชุดของประเภทที่แสดงในตาราง ) สัญลักษณ์ X หมายถึง covariable อย่างต่อเนื่องและเก็บค่าของตัวแปรแสดงเช่นเดิม โดย วาย สัญลักษณ์บ่งชี้สังเกตได้จากกลุ่ม เอกสารการ covariable เกี่ยวข้องกับขึ้นอยู่กับตัวอย่างหมายถึง ค่านิยมจึงสะท้อนและ , โดยเฉลี่ย , แตกต่างระหว่างกรัมประเภท โดยการเปรียบเทียบตัวอย่างหมายถึง ค่านิยมจึงสะท้อนให้เห็นถึงผลของการจำแนกอย่างแท้จริง ตลอดจนอิทธิพลจาก covariable ,ทำให้การประเมินโดยตรงของบทบาทของตัวแปรอย่างแท้จริงเป็นเรื่องยากโดยไม่ต้องชดเชยอิทธิพลของ covariable ถ้าตัวแปรที่ยากได้โดยไม่ต้องชดเชยอิทธิพลของ covariable . ถ้าตัวแปรที่เป็น unre สายกับตัวแปรตามหรือไม่แตกต่างระหว่างกรัมประเภทแล้วเปรียบเทียบค่า จะไม่ได้รับผลกระทบ โดยการวิเคราะห์ความแปรปรวนร่วมและไม่จำเป็น . . .
การวิเคราะห์ความสามารถถูกมองว่าเป็นคดีต่อเนื่องพอดีสามซ้อนกันแบบจำลองทางสถิติ เป็นก่อน เกณฑ์พิจารณาความดี offit เหล่านี้แข่งขันกันแบบเป็น f-statistic ใช้ความคมชัดที่เหลือผลรวมของสี่เหลี่ยม แบบปฏิสัมพันธ์

ฉันคีย์เพื่อการวิเคราะห์ความแปรปรวนร่วม คือ การศึกษาความเป็นไปได้ของรูปแบบเฉพาะทางสถิติที่เรียกว่าความเรียบง่ายแบบ 2 ประสิทธิผลของแบบที่สองคือการประเมินโดยการเปรียบเทียบกับสถิติทั่วไปโครงสร้างที่เรียกว่า รุ่นผม ( ภาพในรูป
51 ) รุ่นผมสมมุติฐานว่า covariable มีความสัมพันธ์กับตัวแปรตาม โดยชุดของสมการถดถอยเชิงเส้นอย่างง่ายที่แตกต่างในบางส่วนหรือทั้งหมดของกลุ่ม คุณสมบัติที่ความสัมพันธ์ระหว่าง covariable และตัวแปรแตกต่างระหว่างกลุ่มที่เป็นปฏิสัมพันธ์ ชอบรายละเอียดก่อนหน้าของปฏิสัมพันธ์ ( การแสดงออก 4.26 )ความสัมพันธ์ระหว่างตัวแปรและ y ขึ้นอยู่กับค่าของตัวแปรที่สาม ( ประเภท ) .
โดยเฉพาะชุดของเส้นตรงที่สกัด ( A ) และ ( B ) ลาดที่แตกต่างกันอธิบายข้อมูลตัวอย่างในแต่ละกลุ่ม ในรูปของสัญลักษณ์ ซึ่งค่าของตัวแปรในแต่ละกลุ่มจะได้รับโดยง่าย
สมการเชิงเส้นที่ผมพบว่า สมาชิกของกลุ่ม และ เจ บ่งชี้ว่า การสังเกตเฉพาะภายในกลุ่ม รุ่นนี้ทางสถิติช่วยให้พารามิเตอร์ของแต่ละการถดถอยเชิงเส้น ( a.and B ) G แตกต่างระหว่างประเภท ( i = 1 , 2 , . . . , g )
การวิเคราะห์การถดถอยเชิงเส้นอย่างง่าย ใช้กับ N สังเกตคู่ ( X , Y ) ในแต่ละประเภท G ผลผลิตการประเมินทั้ง A และ B , ก่อน , A และ Bกระบวนการที่เกี่ยวข้องกับการถดถอยการวิเคราะห์แยกต้องมี 2 จี ประเมิน และสร้าง กรัม โดยประมาณการถดถอยเชิงเส้น นอกจากนี้ แต่ละประมาณเส้นถดถอยแน่นอนล้มเหลวที่จะพอดีกับข้อมูลที่สมบูรณ์ และขาดพอดี วัดจากจำนวนที่เหลือของสี่เหลี่ยมภายในแต่ละกลุ่มที่เหลือผลรวมของกำลังสองสำหรับประเภทคือผลรวมของค่ายกกำลังสองของจุดบนเส้นถดถอยจากแต่ละที่ประมาณการสังเกตหรือประเภท

เป็นส่วนที่เหลือจำนวนสี่เหลี่ยมสำหรับกลุ่มที่เฉพาะเจาะจงนี้เหลือผลรวมของสี่เหลี่ยมแบ่งตามความแปรปรวนของตัวแปรที่มีการแจกแจงไคสแควร์– 2 องศาของเสรีภาพเมื่อความต้องการถือสำหรับการวิเคราะห์การถดถอย ( ความเป็นอิสระ ความเป็นปกติ ความแปรปรวนเท่ากันและเป็นเส้นตรง ) การประเมินและการประเมินผลของแต่ละเหล่านี้ g เป็นเหมือนกันกับสมการถดถอยเชิงเส้นอย่างง่ายในการวิเคราะห์อธิบาย hapter 2กระบวนการเป็นเพียงซ้ำหลายครั้ง เมื่อ กลุ่ม
มีการประเมินโดยรวมของโหมดผมมาจากสรุปพอดีกับทั้งหมดของสมการ G โดย
ค่า RES ( ฉัน ) หารด้วยผลต่างของ Y มีไคสแควร์กระจายในรุ่นผม กับองศาของเสรีภาพที่เป็นจํานวนรวมของ ข้อสังเกตมันเป็นสิ่งจำเป็นเพื่อให้อิสระประมาณ 2 กรัม สร้าง n ประมาณค่าในรุ่นผม องศาอิสระที่มีรูปแบบไม่แตกต่างกัน 2 .
:
รุ่น II ช่วยให้คาดว่าค่าระหว่างประเภทแตกต่าง แต่ความสัมพันธ์ระหว่าง covariable ร้านและตัวแปรตาม y ภายในแต่ละประเภทจะเหมือนกัน คือ เชิงเส้น qual ลาด ( รูปที่ 5.2 ) ในรูปแบบ 2
สัญลักษณ์แบบที่ 2 เป็นกรณีพิเศษ รูปแบบ และมีโครงสร้างเฉพาะทางลบอิทธิพลของ covariable . ชุด G 1 พารามิเตอร์ของประเภท deffer แต่มีความชันเท่ากัน กล่าวคือ B ' เพราะความสัมพันธ์ระหว่างตัวแปรอิสระและ depenent จะเหมือนกันในแต่ละกลุ่ม ไม่มีปฏิสัมพันธ์อยู่ ( ไม่มีปฏิสัมพันธ์แบบ ) ในคำอื่น ๆความสัมพันธ์ระหว่างตัวแปร X และ Y ไม่ได้ขึ้นอยู่กับค่าของตัวแปรที่สาม ( ประเภท ) .
พอดีของรุ่น 2 ยังเป็นวัดโดยจำนวนที่เหลือของสี่เหลี่ยม สำหรับแต่ละของ G กลุ่ม ประเมินเชิงเส้นแบบกลุ่มการผลิต สรุปวัดพอดีของสมการ G
หรือคล้ายกับรุ่น I ,
จำนวนที่แสดงโดย RES ( 2 ) หารด้วยผลต่างของ Y มีการแจกแจงแบบไคสแควร์ภายใต้ II กับ N - ( ช 1 ) องศาของอิสรภาพ เพราะ G 1 ประเมินจะต้องผลิตไนโตรเจนประมาณค่า Y , องศาของเสรีภาพคือ N - ( ช 1 ) เงินตกค้างของสี่เหลี่ยมจะเกี่ยวข้องกับไคสแควร์กระจายเมื่อไหร่ นอกจากสี่ปกติ regreddion การวิเคราะห์ความต้องการความชันของเส้นถดถอยภายในกลุ่มแต่ละกลุ่มเหมือนกัน โปรดทราบว่าสกัดค่าสร้างขึ้นโดยโหมดที่ 2 จะไม่เท่ากับ a-values ใช้เพื่อกำหนดโหมด I .
เพราะรูปแบบ II เป็นข้อ จำกัด ของโหมดผม , f-statistic บริการประเมินโหมด 2 wheter สอดคล้องกับข้อมูลที่ f-statistic อย่างเป็นทางการคือ
ซึ่งมี f-distribution กับ G - 1 และ n - 2G องศาอิสระเมื่อ gslopes เท่ากัน ไม่มีปฏิสัมพันธ์ .
ถ้าค่า F ก็ไม่น่าจะมีเกิดขึ้นโดยบังเอิญ แล้วรุ่น 2 ไม่ใช่การดัดแปลงได้ง่ายของข้อมูลและปรับด้วยวิธีการถดถอยเชิงเส้นไม่มีความหมายการปฏิเสธของรุ่น 2 แสดงปฏิสัมพันธ์เชิงสถิติ --- ความสัมพันธ์ระหว่าง X และ Y ที่แตกต่างในบางส่วนหรือทั้งหมดของกรัมประเภท วิธีมีความแตกต่างขนาดใหญ่ระหว่างพอดีรุ่นผมและรุ่นที่ 2 พบว่า ความสัมพันธ์ระหว่าง x และ y จะไม่เพียงพอที่แสดงโดยชุดของเส้นตรงที่ขนานกันผลโดยตรงของการปฏิเสธแบบที่ 2 ( รับรุ่นผม ) คือ การเปรียบเทียบค่าเฉลี่ยขึ้นอยู่กับค่าของ x ( รูปที่ 5.1 ) การเปรียบเทียบค่าเฉลี่ยในหนึ่งทางเลือกของ X สร้างความแตกต่างเมื่อเทียบกับปีก่อนที่แตกต่างสำหรับการเดียวกันเพื่อเลือกอื่น ๆ ของ X ดังนั้นจึงไม่มีการเปรียบเทียบสามารถทำได้ทั่วไป ในบรรดาประเภทฟรีของอิทธิพลของ covariable
xเมื่อ f-statistic แสดงว่ารุ่น 2 แตกต่างจาก รุ่นผมไม่มี มากกว่า โอกาสของ ไม่มีปฏิสัมพันธ์แบบให้พื้นฐานการเอาอิทธิพลของ covariable x ผลผลิตชุดปรับหมายถึงค่า แบบที่ 2 ต้องมีความแตกต่างระหว่างค่าเฉลี่ยจะเหมือนกันสำหรับการเลือกปรับค่าของ x หมายถึง ตามแบบจำลองที่ 2 จากอิทธิพลของ x-variate ฟรี ,แล้ว โดยเปรียบเทียบกับการตรวจสอบที่เป็นไปได้ 2 , ฟรีจากอิทธิพลของ x - variate , แล้ว โดยเปรียบเทียบกับการตรวจสอบความแตกต่างที่เป็นไปได้ที่เกี่ยวข้องกับกรัมประเภท ชัดเจน ความถูกต้อง หรือ พอดีปิดแบบ 2 เป็นสิ่งสำคัญเนื่องจากการคำนวณของปรับค่าเฉลี่ยขึ้นอยู่กับความต้องการที่ถดถอยเส้นคู่ขนานภายในทุกกลุ่มเปรียบเทียบความล้มเหลวในการปฏิเสธแบบที่ 2 ไม่ได้เป็นการบ่งบอกว่าแบบจำลองแสดงความสัมพันธ์ในข้อมูล มันก็หมายความว่าต้องมีหลักฐานในทางตรงกันข้าม , เป็น

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.