This study aimed to gain knowledge

This study aimed to gain knowledge of students' beliefs and difficulties in understanding p-values, and to use this knowledge to develop improved teaching programs. This study took place over four consecutive teaching semesters of a one-semester tertiary statistics unit. The study was cyclical, in that the results of each semester were used to inform the instructional design for the following semester. Over the semesters, the following instructional techniques were introduced computer simulation, the introduction of hypothetical probabilistic reasoning using a familiar context, and the use of alternative representations. The students were also encouraged to write about their work. As the interventions progressed a higher proportion of students successfully defined and used p-values in Null Hypothesis Testing procedures.
1. INTRODUCTION
This study examined students' problems in understanding p-values, and the results of an intervention that aimed to improve this understanding. Null Hypothesis Testing (NHT) is one of the main techniques in inferential statistics, yet previous research has shown that the concept of the p- value can be problematic for students (Batanero, 2008: Gliner, Leech & Morgan, 2002; Nickerson 2000)
P-values have come about from the desire to estimate the likelihood that a sample was drawn from a population with a specified value for the population parameter. When a venous blood sample has been taken correctly, the sample will be like the blood in the rest of the venous system. In most sampling situations, however, it is extremely unlikely that a sample will be exactly representative of the population. If another sample were taken, it too is unlikely to be exactly representative of the population, and in addition, unlikely to be exactly like the first sample. Despite this, researchers know that the sample will in some way tend to resemble the population, and that it is still possible to make conclusions about the population, even if it is not possible to be absolutely certain about the accuracy conclusions.
One way around the problem of uncertainty is to perform NHT. With this process, a proposition (the null hypothesis) is made about a population parameter. A sample is then collected, the relevant sample statistic calculated, and a judgment is made as to how likely the sample statistic (or one even more extreme) would be if the proposition about the parameter were true. In the NHT process, this judgment is made by calculating a conditional probability, the probability of obtaining the sample with the given or more extreme statistic, if the population has the parameter proposed in the null hypothesis. It is this probability that is known as the p-value. One way to interpret this P-value is to compare it to a pre-set value. If this p-value is below the pre-set value, it is concluded that it is unlikely that the sample came from a population with the stated null hypothesis and the null hypothesis is rejected. If this p-value is above the pre-set value, then it is concluded that the sample could have come from a population with the proposed value and one fails to reject the null hypothesis.
Previous research shows that students of statistics can have problems understanding this process, and this lack of understanding can be undetected by their instructors because the students may follow the procedures accurately (Garfield & Ahlgren, 1988) It is only when questions are asked that require students to describe their reasoning that this lack of understanding is detected. The aim of this study was to gain knowledge of students' beliefs and difficulties in understanding p-values, and to use this knowledge to develop teaching programs to enhance student understandings of this concept. The research questions were: What are students' understandings of p-values? What misconceptions may they hold? And can teaching methods be developed to improve students' understandings?

1.1. LITERATURE REVIEW
A null hypothesis test starts with the statement of the null hypothesis containing the proposed value of the population parameter. Previous research shows that students may believe that this hypothesis refers to both the sample and the population, and are therefore confused about NHT form the very start the process (Sotos, Vanhoof,Ven den Noortgate,2007). It has also been found that students may carry out the procedures for NHT correctly, but then may misinterpret the results through lack of understanding of what rejecting and failing to reject the null hypothesis really indicates. This problem was investigated by Haller and Krauss (2002) who conducted a survey of staff and students, some of whom were statistics instructors, from the psychology departments of six universities. In this survey, an example of an independent samples t-test was given where the p-value was 0.01. Approximately 26% of the participants (including a small number of statistics methodology instructors) agreed with the statement: "You have found the probability of the null hypothesis being true." Approximately 69% of the participants (including approximately one third of the statistics methodology instructors) agreed with the statement: "You know, if you decide to reject the hypothesis, the probability that you are making the wrong decision." Those who agreed with this statement did not seem to be aware of the conditional nature of the probability the p-value represents. That is, the p-value is the probability of making the wrong decision if the null hypothesis is true.
The belief that the p-value is the probability that the null hypothesis is true appears to be a commonly held misconception. A related misconception is that 1-P is the probability that the alternative hypothesis is true. It may also be believed that rejecting a null hypothesis proves the underlying theory that predicted the rejection. It may also be believed that a low the value for the p-value suggests that the results are replicable (Nickerson, 2000)
1.2. WHY USE P-VALUES?
The use of the null-hypothesis test is widespread and p-values are reported widely in the literature. The way a p-value is used differs and is the subject of debate (cumming, 2010: Gliner, Leech, & Morgan, 2002: Hubbard & Lindsay, 2008). One way p values can be used, attributed to Neyman and Pearson, is that a pre-existing level of significance is chosen, and the null hypothesis is rejected if the p-value is less than this level of significance. This form of analysis leads to the possible calculation of Type I and Type II error rates. An alternative (advocated by Fisher) is to look at the level of support a
p-value gives to a null hypothesis. As the p-value decreases, the level of support given for the null hypothesis is also considered to decrease (Wagenmakers, 2007). Recently, however, the question has been asked: should p-values be used at all?
One tenet of a scientific experiment is that it should be replicable. Therefore, it would seem not unreasonable to assume that if an experiment should be repeatable then the p-value would also be replicable. Cumming (2010) has shown that in fact p-values vary much more from sample to sample than many researchers realise. Hubbard and Lindsay (2008) show that p-values can vary even with the same data, depending on the method of analysis chosen by the researcher and on whether the researcher has chosen a one- or two-tailed test.
Another problem with p-values is that they do not indicate the effect size. A small study with a large effect size can yield the same p-value as a large study with small effect size (Hubbard & Lindsay, 2008: Wagenmakers, 2007). In addition, there is concern about the validity of the way p-values are calculated. Assuming the null hypothesis is true, a p-value is the probability of the observed data and the probability of more extreme data, yet these more extreme data are not actually observed. It is questionable whether decisions should be made on unobserved data (Hubbard & Lindsay, 2008).
It is for these reasons that it has been suggested that the results of scientific experiments should instead be presented as confidence interval estimates of the parameters. Confidence intervals have the advantage that they are in the same units as the point estimate, and make it easier for the reader to determine if an effect is important, rather than just if it is statistically significant. of even more consequence is that confidence intervals give an idea of the precision of an estimate via the width of the interval. In addition, the width of the interval gives an idea of what the infinite set of possible results may look like (Cumming, 2010; Wagenmakers, 2007). The contrast between the variation in p- values and the variation in confidence intervals is graphically and amusingly illustrated by the "Dance of the p-values

1.1. LITERATURE REVIEW
 A null hypothesis test starts with the statement of the null hypothesis containing the proposed value of the population parameter. Previous research shows that students may believe that this hypothesis refers to both the sample and the population, and are therefore confused about NHT form the very start the process (Sotos, Vanhoof,Ven den Noortgate,2007). It has also been found that students may carry out the procedures for NHT correctly, but then may misinterpret the results through lack of understanding of what rejecting and failing to reject the null hypothesis really indicates. This problem was investigated by Haller and Krauss (2002) who conducted a survey of staff and students, some of whom were statistics instructors, from the psychology departments of six universities. In this survey, an example of an independent samples t-test was given where the p-value was 0.01. Approximately 26% of the participants (including a small number of statistics methodology instructors) agreed with the statement: "You have found the probability of the null hypothesis being true." Approximately 69% of the participants (including approximately one third of the statistics methodology instructors) agreed with the statement: "You know, if you decide to reject the hypothesis, the probability that you are making the wrong decision." Those who agreed with this statement did not seem to be aware of the conditional nature of the probability the p-value represents. That is, the p-value is the probability of making the wrong decision if the null hypothesis is true. 
The belief that the p-value is the probability that the null hypothesis is true appears to be a commonly held misconception. A related misconception is that 1-P is the probability that the alternative hypothesis is true. It may also be believed that rejecting a null hypothesis proves the underlying theory that predicted the rejection. It may also be believed that a low the value for the p-value suggests that the results are replicable (Nickerson, 2000)
1.2. WHY USE P-VALUES? 
The use of the null-hypothesis test is widespread and p-values are reported widely in the literature. The way a p-value is used differs and is the subject of debate (cumming, 2010: Gliner, Leech, & Morgan, 2002: Hubbard & Lindsay, 2008). One way p values can be used, attributed to Neyman and Pearson, is that a pre-existing level of significance is chosen, and the null hypothesis is rejected if the p-value is less than this level of significance. This form of analysis leads to the possible calculation of Type I and Type II error rates. An alternative (advocated by Fisher) is to look at the level of support a 
p-value gives to a null hypothesis. As the p-value decreases, the level of support given for the null hypothesis is also considered to decrease (Wagenmakers, 2007). Recently, however, the question has been asked: should p-values be used at all? 
One tenet of a scientific experiment is that it should be replicable. Therefore, it would seem not unreasonable to assume that if an experiment should be repeatable then the p-value would also be replicable. Cumming (2010) has shown that in fact p-values vary much more from sample to sample than many researchers realise. Hubbard and Lindsay (2008) show that p-values can vary even with the same data, depending on the method of analysis chosen by the researcher and on whether the researcher has chosen a one- or two-tailed test. 
Another problem with p-values is that they do not indicate the effect size. A small study with a large effect size can yield the same p-value as a large study with small effect size (Hubbard & Lindsay, 2008: Wagenmakers, 2007). In addition, there is concern about the validity of the way p-values are calculated. Assuming the null hypothesis is true, a p-value is the probability of the observed data and the probability of more extreme data, yet these more extreme data are not actually observed. It is questionable whether decisions should be made on unobserved data (Hubbard & Lindsay, 2008).
It is for these reasons that it has been suggested that the results of scientific experiments should instead be presented as confidence interval estimates of the parameters. Confidence intervals have the advantage that they are in the same units as the point estimate, and make it easier for the reader to determine if an effect is important, rather than just if it is statistically significant. of even more consequence is that confidence intervals give an idea of the precision of an estimate via the width of the interval. In addition, the width of the interval gives an idea of what the infinite set of possible results may look like (Cumming, 2010; Wagenmakers, 2007). The contrast between the variation in p- values and the variation in confidence intervals is graphically and amusingly illustrated by the "Dance of the p-values

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

การศึกษานี้มีวัตถุประสงค์เพื่อรับรู้ความเชื่อและความยากลำบากในการทำความเข้าใจเกี่ยวกับค่า p นักเรียน และการใช้ความรู้นี้ไปพัฒนาปรับปรุงการสอนโปรแกรม การศึกษานี้เอาสถานภาคสอนติดต่อกัน 4 หน่วยสถิติระดับมหาวิทยาลัยภาคการศึกษาหนึ่ง การศึกษาเป็นห่วง ที่ผลลัพธ์ของแต่ละภาคการศึกษาที่ใช้ในการแจ้งการออกแบบการเรียนการสอนในภาคเรียนต่อไปนี้ ผ่านภาค เทคนิคการจัดการเรียนการสอนต่อไปนี้ได้นำคอมพิวเตอร์จำลอง การแนะนำของสมมุติ probabilistic ใช้เหตุผลใช้บริบทที่คุ้นเคย และการใช้ของอื่นแทน นักเรียนให้เขียนเกี่ยวกับงานของเขายังได้ เป็นงานหน้าไปเพียงใด สัดส่วนสูงของนักเรียนกำหนดเรียบร้อยแล้ว และใช้ค่า p ในขั้นตอนการทดสอบสมมติฐานเป็น Null1. บทนำ การศึกษานี้ตรวจสอบปัญหานักเรียนในความเข้าใจค่า p และผลของการแทรกแซงที่มุ่งความเข้าใจนี้ การทดสอบสมมติฐานของ null (NHT) เป็นหนึ่งในเทคนิคหลักสถิติเพียงน้อยนิด ยังวิจัยก่อนหน้านี้ได้แสดงให้เห็นว่า แนวคิดของค่า p จะมีปัญหาสำหรับนักเรียน (Batanero, 2008: Gliner ปลิง และ มอร์แกน 2002 Nickerson 2000) ค่า P มาจากความปรารถนาที่จะประเมินโอกาสที่ตัวอย่างมาจากประชากรกับค่าที่ระบุสำหรับพารามิเตอร์ของประชากร เมื่อนำตัวอย่างเลือดดำถูกต้อง จะมีตัวอย่างเช่นเลือดในส่วนเหลือของระบบดำ ในสถานการณ์ส่วนใหญ่สุ่มตัวอย่าง อย่างไรก็ตาม ก็ไม่มากน่าตัวอย่างจะเป็นเหมือนตัวแทนของประชากร ถ้าถูกดำเนินการอย่างอื่น ก็เกินไปไม่น่าจะเหมือนตัวแทน ของประชากร และนอกจาก นี้ น่าจะเหมือนกับตัวอย่างแรก แม้นี้ นักวิจัยทราบว่า ตัวอย่างจะในบางวิธีมีแนวโน้มคล้ายประชากร และว่า จะยังคงสามารถให้ข้อสรุปเกี่ยวกับประชากร แม้ว่าไม่สามารถจะแน่ใจเกี่ยวกับบทสรุปของความถูกต้องวิธีหนึ่งปัญหาของความไม่แน่นอนคือการ ทำ NHT ด้วยกระบวนการนี้ ข้อเสนอ (สมมติฐานว่าง) ทำเกี่ยวกับพารามิเตอร์ของประชากร ตัวอย่างแล้วรวบ รวม คำนวณสถิติตัวอย่างที่เกี่ยวข้อง และทำการพิพากษาเป็นตัวอย่างแนวโน้มวิธีสถิติ (หรือยิ่งมากหนึ่ง) จะถ้าเสนอเกี่ยวกับพารามิเตอร์แท้จริง ระหว่าง NHT คำพิพากษานี้จะทำ โดยการคำนวณแบบมีเงื่อนไขความน่าเป็น ความน่าเป็นของการได้รับตัวอย่าง ด้วยสถิติกำหนด หรือมากขึ้น ถ้าประชากรมีพารามิเตอร์ที่นำเสนอในสมมติฐานว่าง เป็นค่าที่เรียกว่าค่า p วิธีหนึ่งในการแปลความหมายค่า P นี้จะเปรียบเทียบกับค่าที่ตั้งไว้ล่วงหน้า ถ้าค่า p นี้จะต่ำกว่าค่าที่ตั้งไว้ล่วงหน้า จะมีสรุปว่า ไม่น่าให้ตัวอย่างมาจากประชากรด้วยสมมติฐานว่างระบุ และปฏิเสธสมมติฐานว่าง ถ้าค่า p นี้จะสูงกว่าค่าที่ตั้งไว้ล่วงหน้า แล้วก็จะสรุปว่า มาจากประชากรกับค่านำเสนอตัวอย่าง และหนึ่งไม่สามารถปฏิเสธสมมติฐานว่างงานวิจัยก่อนหน้านี้แสดงว่า สถิตินักเรียนมีความเข้าใจกระบวนการนี้ และการขาดความเข้าใจสามารถ undetected โดยผู้สอนของพวกเขาเนื่องจากนักเรียนอาจปฏิบัติตามได้อย่างถูกต้อง (การ์ฟิลด์ & Ahlgren, 1988) มีเฉพาะ เมื่อคำถามขอที่ต้องให้นักเรียนอธิบายเหตุผลของการขาดความเข้าใจจะพบ จุดมุ่งหมายของการศึกษานี้ได้รับความรู้ความเชื่อและความยากลำบากในการทำความเข้าใจกับค่า p นักเรียน และ การใช้ความรู้นี้เพื่อพัฒนาโปรแกรมการสอนเพื่อเสริมสร้างนักเรียนเปลี่ยนความเข้าใจของแนวคิดนี้ คำถามวิจัย: เปลี่ยนความเข้าใจของนักเรียนค่า p คืออะไร พวกเขาอาจถือว่าความเข้าใจผิด และสามารถสอนได้รับการพัฒนาปรับปรุงเปลี่ยนความเข้าใจของนักเรียน 1.1 การทบทวนวรรณกรรม การทดสอบสมมติฐานว่างเริ่มต้น ด้วยยอดของสมมติฐานว่างที่ประกอบด้วยการเสนอค่าของพารามิเตอร์ของประชากร แสดงงานวิจัยก่อนหน้านี้ที่นักเรียนอาจเชื่อว่า สมมติฐานนี้อ้างอิงถึงตัวอย่างและประชากร และดังนั้นได้สับสนว่า NHT ฟอร์มเริ่มต้นมากกระบวนการ (Sotos, Vanhoof, Ven den Noortgate, 2007) ก็ยังพบว่า นักศึกษาอาจดำเนินการตามขั้นตอนสำหรับ NHT ได้อย่างถูกต้อง แต่แล้ว อาจคนตีความผิดผลลัพธ์ through lack of เข้าใจว่าการปฏิเสธและการปฏิเสธสมมติฐานว่างจริง ๆ แสดง ปัญหานี้ถูกสอบสวนโดย Haller Krauss (2002) ที่ทำการสำรวจของเจ้าหน้าที่และนักศึกษา บางคนถูกสอนสถิติ จากแผนกจิตวิทยาของมหาวิทยาลัย 6 ในแบบสำรวจนี้ ตัวอย่าง t-ทดสอบตัวอย่างอิสระได้รับซึ่งค่า p 0.01 ประมาณ 26% ของผู้เข้าร่วม (รวมถึงจำนวนเล็ก ๆ ของผู้สอนในระเบียบวิธีสถิติ) ตกลงกับคำสั่ง: "คุณได้พบความเป็นไปได้ของสมมติฐานว่างที่เป็นจริง" ประมาณ 69% ของผู้เข้าร่วม (รวมประมาณหนึ่งในสามของผู้สอนในระเบียบวิธีสถิติ) ตกลงกับคำสั่ง: "คุณทราบ ถ้าคุณตัดสินใจจะปฏิเสธสมมติฐาน ความเป็นไปได้ว่า คุณทำการตัดสินใจผิด" ผู้ที่ตกลงกับคำสั่งนี้ไม่ได้ดูเหมือนตระหนักถึงลักษณะเงื่อนไขความน่าเป็นค่า p แทน นั่นคือ ค่า p คือ ความน่าเป็นของการตัดสินใจไม่ถูกต้องถ้าสมมติฐานว่างเป็นจริง ความเชื่อที่ว่า ค่า p คือ ความน่าเป็นว่า สมมติฐานว่างเป็นจริงดูเหมือนจะ เป็นความเข้าใจผิดทั่วไปจัดขึ้น ความเข้าใจผิดที่เกี่ยวข้องคือ 1-P ที่มีความเป็นไปได้ว่าสมมติฐานทางเลือกจริง มันอาจยังจะเชื่อว่า ปฏิเสธสมมติฐานว่างพิสูจน์ทฤษฎีพื้นฐานที่ปฏิเสธการคาดการณ์ มันอาจยังจะเชื่อว่า ต่ำค่าสำหรับค่า p แนะนำผลการจำลองแบบ (Nickerson, 2000)1.2 ใช้ค่า P ทำไม การใช้การทดสอบสมมติฐานค่า null เป็นที่แพร่หลาย และค่า p มีรายงานอย่างกว้างขวางในวรรณคดี วิธีใช้ p-ค่าแตกต่างกัน และเป็นเรื่องของการอภิปราย (cumming, 2010: Gliner ปลิง และ มอร์แกน 2002: Hubbard และลินด์เซย์ 2008) สามารถใช้ค่า บันทึก Neyman และ Pearson, p เที่ยวคือ เลือกความสำคัญของระดับที่มีอยู่ก่อน และถูกปฏิเสธสมมติฐานเป็น null ถ้าค่า p น้อยกว่านี้ความสำคัญของระดับ แบบฟอร์มนี้การวิเคราะห์นำไปคำนวณได้ชนิดฉันและอัตราผิดพลาดชนิด II ทางเลือก (advocated โดย Fisher) คือการ ดูระดับของการสนับสนุนการ ค่า p ให้สมมติฐานว่าง เป็นค่า p ลด จะได้มีพิจารณาระดับของการสนับสนุนสำหรับสมมติฐานว่างลดลง (Wagenmakers, 2007) เมื่อเร็ว ๆ นี้ อย่างไร ตามมีคำถาม: ควรใช้ค่า p ที่ ทฤษฎีหนึ่งของการทดลองทางวิทยาศาสตร์ได้ว่า มันควรถูกจำลองแบบ ดังนั้น จึง ดูเหมือนไม่ unreasonable สมมติว่า ถ้าการทดลองควรจะซ้ำ แล้วค่า p ยังจะสามารถจำลองแบบ Cumming (2010) ได้แสดงว่า ในความเป็นจริงค่า p แตกต่างกันมากจากตัวอย่างกับตัวอย่างกว่านักวิจัยจำนวนมากตระหนักถึง Hubbard และลินด์เซย์ (2008) แสดงที่แตกต่างกันแม้จะ มีข้อมูลเดียวกัน ขึ้นอยู่กับวิธีการวิเคราะห์ที่เลือก โดยนักวิจัย และว่านักวิจัยได้เลือกทดสอบหางเดียว หรือสองค่า p ปัญหาอื่น มีค่า p คือ ว่า พวกเขาไม่ระบุขนาดผล การศึกษาขนาดเล็ก มีขนาดผลใหญ่สามารถผลผลิต p-ค่าเดียวกันกับการศึกษาขนาดใหญ่มีขนาดผลเล็ก (Hubbard และลินด์เซย์ 2008: Wagenmakers, 2007) นอกจากนี้ มีปัญหาเกี่ยวกับวิธีคำนวณค่า p สมมติว่า สมมติฐานว่างเป็นจริง ค่า p คือ ความน่าเป็นของข้อมูลที่สังเกตและความเป็นไปได้ของข้อมูลมาก ได้จริงจะไม่สามารถทำการตรวจสอบข้อมูลเหล่านี้มากขึ้น ได้แก้แค้นคืนว่าการตัดสินใจควรมีข้อมูล unobserved (Hubbard และลินด์เซย์ 2008)It is for these reasons that it has been suggested that the results of scientific experiments should instead be presented as confidence interval estimates of the parameters. Confidence intervals have the advantage that they are in the same units as the point estimate, and make it easier for the reader to determine if an effect is important, rather than just if it is statistically significant. of even more consequence is that confidence intervals give an idea of the precision of an estimate via the width of the interval. In addition, the width of the interval gives an idea of what the infinite set of possible results may look like (Cumming, 2010; Wagenmakers, 2007). The contrast between the variation in p- values and the variation in confidence intervals is graphically and amusingly illustrated by the "Dance of the p-values

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

การศึกษาครั้งนี้มีวัตถุประสงค์เพื่อได้รับความรู้ความเชื่อของนักเรียนและความยากลำบากในการทำความเข้าใจพีค่านิยมและการใช้ความรู้นี้ในการพัฒนาโปรแกรมการเรียนการสอนที่ดีขึ้น การศึกษาครั้งนี้เกิดขึ้นในช่วงการเรียนการสอนภาคการศึกษาที่สี่ติดต่อกันในระดับอุดมศึกษาของหนึ่งภาคการศึกษาหน่วยสถิติ การศึกษาเป็นวัฏจักรในการที่ผลของแต่ละภาคการศึกษาที่ถูกนำมาใช้เพื่อแจ้งการออกแบบการเรียนการสอนสำหรับภาคการศึกษาต่อไป กว่าภาคการศึกษาที่ต่อไปนี้เทคนิคการสอนที่ถูกนำมาจำลองคอมพิวเตอร์เบื้องต้นของความน่าจะเป็นเหตุผลที่สมมุติใช้บริบทที่คุ้นเคยและการใช้งานของการเป็นตัวแทนทางเลือก นักเรียนยังได้รับการสนับสนุนที่จะเขียนเกี่ยวกับการทำงานของพวกเขา ในฐานะที่เป็นความก้าวหน้าการแทรกแซงสัดส่วนที่สูงขึ้นของนักเรียนประสบความสำเร็จในการกำหนดและใช้ค่าพีในขั้นตอนการทดสอบสมมติฐาน.
1
บทนำการศึกษาครั้งนี้การตรวจสอบปัญหาของนักเรียนในการทำความเข้าใจพีค่านิยมและผลของการแทรกแซงที่มีวัตถุประสงค์เพื่อปรับปรุงความเข้าใจนี้ การทดสอบสมมติฐาน (NHT) เป็นหนึ่งในเทคนิคหลักในสถิติอนุมานที่ยังวิจัยก่อนหน้านี้ได้แสดงให้เห็นว่าแนวคิดของค่า p-สามารถจะเป็นปัญหาสำหรับนักเรียน (Batanero 2008: Gliner, ปลิงและมอร์แกน, 2002; Nickerson 2000)
P-ค่าเกี่ยวกับการได้มาจากความปรารถนาที่จะประเมินโอกาสที่กลุ่มตัวอย่างถูกดึงออกมาจากประชากรที่มีค่าที่ระบุสำหรับพารามิเตอร์ประชากร เมื่อตัวอย่างเลือดดำได้รับการดำเนินการอย่างถูกต้องตัวอย่างจะเป็นเช่นเลือดในส่วนที่เหลือของระบบหลอดเลือดดำที่ ในส่วนสถานการณ์การสุ่มตัวอย่าง แต่มันไม่น่าเป็นอย่างยิ่งที่กลุ่มตัวอย่างจะเป็นตัวแทนตรงของประชากร ถ้าอีกตัวอย่างถูกนำตัวมันก็ไม่น่าจะเป็นตัวแทนตรงของประชากรและนอกจากนี้ไม่น่าจะเป็นเหมือนตัวอย่างแรก อย่างไรก็ตามเรื่องนี้นักวิจัยรู้ว่ากลุ่มตัวอย่างจะในทางใดทางหนึ่งมีแนวโน้มที่จะมีลักษณะคล้ายกับประชากรและที่มันยังคงเป็นไปได้ที่จะให้ข้อสรุปเกี่ยวกับประชากรถึงแม้ว่ามันจะเป็นไปไม่ได้ที่จะเป็นบางอย่างเกี่ยวกับข้อสรุปที่ถูกต้อง.
วิธีการหนึ่งรอบ ปัญหาของความไม่แน่นอนคือการดำเนินการ NHT กับกระบวนการนี้เป็นเรื่อง (สมมติฐาน) จะทำเกี่ยวกับพารามิเตอร์ของประชากร ตัวอย่างที่เก็บรวบรวมแล้วสถิติที่เกี่ยวข้องตัวอย่างการคำนวณและการตัดสินจะทำเป็นวิธีการที่มีแนวโน้มสถิติตัวอย่าง (หรือใครมากขึ้น) จะเป็นเรื่องที่เกี่ยวกับถ้าพารามิเตอร์เป็นจริง ในกระบวนการ NHT ตัดสินนี้จะทำโดยการคำนวณความน่าจะเป็นเงื่อนไขของการได้รับความน่าจะเป็นตัวอย่างที่มีสถิติที่กำหนดหรือมากขึ้นถ้ามีประชากรที่มีพารามิเตอร์เสนอในสมมติฐาน มันเป็นความน่าจะเป็นที่เป็นที่รู้จักกัน p-value วิธีหนึ่งที่จะแปลความหมายนี้ P-value คือการเปรียบเทียบกับค่าที่ตั้งไว้ล่วงหน้า ในกรณีนี้ p-value ต่ำกว่าค่าที่ตั้งไว้ล่วงหน้าก็จะได้ข้อสรุปว่ามันไม่น่าเป็นตัวอย่างที่มาจากประชากรที่มีการระบุไว้สมมติฐานและสมมติฐานถูกปฏิเสธ ในกรณีนี้ p-value สูงกว่าค่าที่ตั้งไว้ล่วงหน้าแล้วมันจะได้ข้อสรุปว่ากลุ่มตัวอย่างจะได้มาจากประชากรที่มีมูลค่าที่เสนอและล้มเหลวที่จะปฏิเสธสมมติฐาน.
วิจัยก่อนหน้านี้แสดงให้เห็นว่านักเรียนของสถิติสามารถมีความเข้าใจในปัญหา กระบวนการนี้และการขาดความเข้าใจนี้สามารถตรวจไม่พบโดยอาจารย์ของพวกเขาเพราะนักเรียนอาจปฏิบัติตามขั้นตอนอย่างถูกต้อง (การ์ฟิลด์และ Ahlgren, 1988) มันเป็นเพียงเมื่อคำถามจะถามที่ต้องการให้นักเรียนที่จะอธิบายเหตุผลของพวกเขาที่ขาดความเข้าใจนี้มีการตรวจพบ . จุดมุ่งหมายของการศึกษาครั้งนี้คือการได้รับความรู้ความเชื่อของนักเรียนและความยากลำบากในการทำความเข้าใจพีค่านิยมและการใช้ความรู้นี้ในการพัฒนาโปรแกรมการเรียนการสอนเพื่อเสริมสร้างความเข้าใจของนักเรียนของแนวคิดนี้ คำถามการวิจัยคือ: อะไรคือความเข้าใจของนักเรียนของค่าพี? ความเข้าใจผิดสิ่งที่พวกเขาอาจจะมี? และสามารถวิธีการสอนที่ได้รับการพัฒนาเพื่อปรับปรุงความเข้าใจของนักเรียน? 1.1 ทบทวนวรรณกรรมสมมติฐานการทดสอบเริ่มต้นด้วยคำสั่งของสมมติฐานที่มีค่าพารามิเตอร์เสนอของประชากร วิจัยก่อนหน้านี้แสดงให้เห็นว่านักเรียนจะได้เชื่อว่าสมมติฐานนี้หมายถึงทั้งตัวอย่างและประชากรและจึงสับสนเกี่ยวกับ NHT รูปแบบเริ่มต้นมากกระบวนการ (Sotos, Vanhoof, เวนถ้ำ Noortgate 2007) ก็ยังได้รับพบว่านักเรียนอาจจะดำเนินการขั้นตอนการ NHT ได้อย่างถูกต้อง แต่แล้วอาจตีความผิดผลที่ผ่านการขาดความเข้าใจในสิ่งที่ปฏิเสธและความล้มเหลวที่จะปฏิเสธสมมติฐานจริงๆบ่งชี้ ปัญหานี้ได้รับการตรวจสอบโดยฮาลเลอร์และอูส (2002) ผู้ดำเนินการสำรวจความคิดเห็นของพนักงานและนักเรียนบางคนเป็นอาจารย์สถิติจากหน่วยงานจิตวิทยาของมหาวิทยาลัยหก ในการสำรวจครั้งนี้เป็นตัวอย่างของกลุ่มที่เป็นอิสระ t-test ได้รับที่เป็น p-value 0.01 ประมาณ 26% ของผู้เข้าร่วม (รวมทั้งขนาดเล็กจำนวนสถิติผู้สอนวิธีการ) เห็นด้วยกับคำที่ว่า "คุณก็จะพบน่าจะเป็นของสมมติฐานเป็นจริง." ประมาณ 69% ของผู้เข้าร่วม (รวมถึงประมาณหนึ่งในสามของสถิติผู้สอนวิธีการ) เห็นด้วยกับคำที่ว่า "คุณจะรู้ว่าถ้าคุณตัดสินใจที่จะปฏิเสธสมมติฐานที่น่าจะเป็นที่จะทำให้คุณตัดสินใจที่ผิดพลาด." บรรดาผู้ที่เห็นด้วยกับคำสั่งนี้ไม่ได้ดูเหมือนจะตระหนักถึงธรรมชาติของความน่าจะเป็นเงื่อนไขที่ p-value แสดงให้เห็นถึง นั่นคือ p-value คือน่าจะเป็นของการตัดสินใจที่ผิดถ้าสมมติฐานเป็นจริง. the เชื่อว่า p-value คือน่าจะเป็นที่สมมติฐานเป็นจริงดูเหมือนจะเป็นความเข้าใจผิดที่ยึดถือร่วมกัน ความเข้าใจผิดที่เกี่ยวข้องคือ 1-P คือความน่าจะเป็นทางเลือกที่สมมติฐานเป็นจริง นอกจากนี้ยังอาจจะเชื่อว่าปฏิเสธสมมติฐานพิสูจน์ทฤษฎีพื้นฐานที่คาดการณ์การปฏิเสธ มันก็อาจจะเป็นที่เชื่อกันว่ามีมูลค่าต่ำสำหรับ p-value แสดงให้เห็นว่าผลที่จำลองแบบ (Nickerson, 2000) 1.2 ทำไมต้องใช้ค่า P-? การใช้การทดสอบโมฆะสมมติฐานที่เป็นที่แพร่หลายและค่าพีจะมีการรายงานกันอย่างแพร่หลายในวรรณกรรม วิธี p-value จะใช้แตกต่างกันและเป็นเรื่องของการอภิปราย (คัมมิง, 2010: Gliner, ทากและมอร์แกน, 2002: ฮับบาร์ดและ Lindsay, 2008) วิธีการหนึ่งที่ค่าพีสามารถนำมาใช้ประกอบกับ Neyman และเพียร์สันก็คือระดับที่มีอยู่ก่อนอย่างมีนัยสำคัญได้รับการแต่งตั้งและสมมติฐานถูกปฏิเสธถ้า p-value น้อยกว่าระดับนัยสำคัญนี้ รูปแบบของการวิเคราะห์นี้จะนำไปสู่การคำนวณเป็นไปได้ของ Type I และ Type II อัตราความผิดพลาด ทางเลือก (สนับสนุนโดยฟิชเชอร์) คือดูที่ระดับของการสนับสนุนที่p-value ให้กับสมมติฐาน null ในฐานะที่เป็น p-value ลดระดับของการสนับสนุนได้รับสำหรับสมมติฐานจะถือว่ายังลดลง (Wagenmakers 2007) เมื่อเร็ว ๆ นี้ แต่คำถามที่ได้รับการถาม: ควรค่าพีนำมาใช้ในทุกหนึ่งทฤษฎีของการทดลองทางวิทยาศาสตร์ก็คือว่ามันควรจะจำลองแบบ ดังนั้นก็จะดูเหมือนไม่มีเหตุผลที่จะคิดว่าถ้าการทดลองควรจะทำซ้ำแล้ว p-value นอกจากนี้ยังจะจำลองแบบ คัมมิง (2010) แสดงให้เห็นว่าในความเป็นจริงค่าพีแตกต่างกันมากขึ้นจากกลุ่มตัวอย่างที่จะลิ้มลองกว่านักวิจัยหลายคนตระหนักถึง ฮับบาร์ดและลินด์ซี (2008) แสดงให้เห็นว่าค่าพีสามารถแตกต่างกันแม้จะมีข้อมูลที่เหมือนกันขึ้นอยู่กับวิธีการวิเคราะห์ที่ถูกเลือกโดยนักวิจัยและอยู่กับว่านักวิจัยได้เลือกการทดสอบหนึ่งหรือสองด้าน. ปัญหากับ p- อีก ค่าคือการที่พวกเขาไม่ได้ระบุขนาดผลกระทบ การศึกษาขนาดเล็กที่มีขนาดผลขนาดใหญ่สามารถให้ผลผลิต p-value เช่นเดียวกับการศึกษาขนาดใหญ่ที่มีขนาดผลขนาดเล็ก (ฮับบาร์ดและ Lindsay 2008: Wagenmakers 2007) นอกจากนี้ยังมีความกังวลเกี่ยวกับความถูกต้องของวิธี P-ค่าที่คำนวณ สมมติว่าสมมติฐานเป็นจริงเป็น p-value คือความน่าจะเป็นที่สังเกตของข้อมูลและความน่าจะเป็นของข้อมูลมากขึ้น แต่ข้อมูลเหล่านี้มากขึ้นจะไม่ได้สังเกตจริง มันเป็นที่น่าสงสัยว่าการตัดสินใจควรจะทำกับข้อมูลที่ไม่มีใครสังเกต (ฮับบาร์ดและ Lindsay, 2008). จะด้วยเหตุผลเหล่านี้ที่จะได้รับการชี้ให้เห็นว่าผลของการทดลองทางวิทยาศาสตร์ควรจะนำเสนอเป็นประมาณการความเชื่อมั่นของพารามิเตอร์ ช่วงความเชื่อมั่นมีความได้เปรียบที่พวกเขาอยู่ในหน่วยเดียวกับประมาณการจุดและทำให้มันง่ายขึ้นสำหรับผู้อ่านที่จะตรวจสอบว่าผลเป็นสิ่งที่สำคัญมากกว่าแค่ว่ามันเป็นนัยสำคัญทางสถิติ ผลมากยิ่งขึ้นก็คือว่าช่วงความเชื่อมั่นให้ความคิดของความแม่นยำของประมาณการที่ผ่านความกว้างของช่วง นอกจากนี้ความกว้างของช่วงเวลาที่ให้ความคิดของสิ่งที่ไม่มีที่สิ้นสุดของผลลัพธ์ที่เป็นไปได้อาจมีลักษณะเช่น (คัมมิง, 2010; Wagenmakers 2007) ความแตกต่างระหว่างการเปลี่ยนแปลงในค่า p- และการเปลี่ยนแปลงในช่วงความเชื่อมั่นที่มีกราฟิกและสนุกสนานแสดงโดย "การเต้นรำของค่าพีที่

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

การศึกษานี้มีวัตถุประสงค์เพื่อเพิ่มความรู้ของนักเรียน ' ความเชื่อและความยากในความเข้าใจ p-values และใช้ความรู้นี้ไปพัฒนาปรับปรุงโปรแกรมการสอน การศึกษานี้ใช้เวลาติดต่อกันสี่สอนเทอมของเทอมหนึ่งสถิติระบบหน่วย การศึกษาวัฏจักร ,ในผลลัพธ์ของแต่ละภาคการศึกษาเพื่อใช้แจ้งออกแบบการเรียนการสอนในภาคการศึกษาต่อไป ผ่านเทอม , ต่อไปนี้การแนะนำเทคนิคการจำลองคอมพิวเตอร์เบื้องต้นของสมมุติความน่าจะเป็นเหตุผลโดยใช้บริบทที่คุ้นเคย และใช้แนวทางอื่น นักเรียน ส่งเสริมให้เขียนเกี่ยวกับการทำงานของพวกเขาเป็นมาตรการขึ้น สัดส่วนที่สูงของนักเรียนเรียบร้อยแล้ว กำหนดและใช้ p-values ในขั้นตอนการทดสอบสมมติฐานโมฆะ .
1 บทนำ
นี้เพื่อศึกษาปัญหาของนักเรียนในการทำความเข้าใจ p-values และผลลัพธ์ของการแทรกแซงที่มีวัตถุประสงค์เพื่อปรับปรุงความเข้าใจนี้ การทดสอบสมมติฐานว่าง ( nht ) เป็นหนึ่งในเทคนิคหลักสถิติเชิงอนุมานแต่งานวิจัยก่อนหน้านี้ได้แสดงให้เห็นว่าแนวคิดของ p สามารถเป็นปัญหาสำหรับนักศึกษา ( batanero 2008 : gliner , ปลิง&มอร์แกน , 2002 ; นิเคอร์สัน 2000 )
p-values ได้มาจากความปรารถนาที่จะประเมินโอกาสที่จำนวนสุ่มจากประชากรที่มีกําหนดค่าพารามิเตอร์ของประชากร . เมื่อเลือดจากหลอดเลือดดำตัวอย่างได้ถูกต้องตัวอย่างจะเป็นเหมือนเลือดในส่วนที่เหลือของระบบหลอดเลือดดำ . ในกลุ่มตัวอย่างส่วนใหญ่สถานการณ์ อย่างไรก็ตาม เป็นไปได้ยากมากที่ ตัวอย่างจะเป็นเหมือนตัวแทนของประชากร ถ้าอย่างอื่นถ่าย มันก็ไม่น่าจะเป็นเหมือนตัวแทนของประชากร และนอกจากนี้ ไม่น่าจะเหมือนกับตัวอย่างก่อน แม้นี้นักวิจัยทราบว่าตัวอย่างในบางวิธีที่มักจะมีลักษณะประชากร และมันยังสามารถให้ข้อสรุปเกี่ยวกับประชากร แม้ว่ามันเป็นไปไม่ได้เพื่อให้แน่ใจเกี่ยวกับความถูกต้องของบทสรุป
วิธีหนึ่งปัญหาของความไม่แน่นอนคือการ nht . ด้วยกระบวนการนี้ ข้อเสนอ ( สมมติฐานโมฆะ ) ทำเกี่ยวกับประชากรพารามิเตอร์

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.