4 Experimental results4.1 Quick sor

4 Experimental results
4.1 Quick sort. We rst analyze the dependence of
the version of Quick sort shown in Figure 1 on the
number of inversions in the input.
Figure 4 shows our data for the AMD Athlon ar-
chitecture. The number of comparisons is independent
of the number of inversions in the input, as expected.
For the number of element swaps, the plot is very
close to linear when considering the input sequence with
small di's. Since the x-axis shows log(Inv), this is in very
good correspondence with the bound O(n log(Inv=n))
of Theorem 1.1 (recall that n is xed in the plot). For
the input sequence with large di's, the plot is dierent.
This is a sign of the slack in the analysis (for this type
of input) noted after the proof of Theorem 1.1. We
will demonstrate below that this curve is in very good
correspondence with the version of the bound given by
Equation (2.2). The plots for the number of branch mis-
predictions and for the running time clearly show that
they are correlated with the number of element swaps.
For the number of branch mispredictions, this is ex-
plained by the fact that an element swap is performed
after the two while loops stop, and hence corresponds
to two branch mispredictions. For the running time,
it seems reasonable to infer that branch mispredictions
are a dominant part of the running time of Quicksort on
this type of architecture. Finally, the number of data
cache misses seems independent of the presortedness of
the input sequence, in correspondence with the fact that
for all element swaps, the data to be manipulated is al-
ready in the cache and therefore the element swaps do
not generate additional cache misses.
Figure 5 show the same plots for the P4 architec-
ture, except that we were not able to obtain data for
L2 data cache misses. We note that the plots follow the
same trends as in Figure 4. The number of comparisons
and the number of element swaps are approximately the
same, but the running time is aected by up to a factor
of 1:8 on the P4, while only by up to a factor of 1:42 on
the Athlon. One reason for this behavior is the number
of branch mispredictions, which is slightly smaller for
the Athlon. Also, the length of the pipeline, shorter for
Athlon, makes the branch mispredictions more costly
on a P4 than on an Athlon.
Similar observations on the resemblance between
the data for the two architectures apply to all our
experiments. For this reason, and because of the
extra data for L2 that we have for Athlon, we for
the remaining plots restrict ourselves to the Athlon
architecture.
We now turn to the variants of Quicksort. Figure 6
shows the number comparisons, the number of element
swaps, the number of branch mispredictions, the run-
ning time, and the L2 data cache misses for the version
of Quicksort that chooses as pivot the median of three
random elements in the input sequence. We note that
the plots have a behavior similar to the ones for the ver-
sion of Quicksort shown in Figure 4. However, some im-
provements are noticed. The three-median pivot Quick-
sort performs around 25% less comparisons, due to the
better choice of the pivot. This immediately triggers a
slight improvement in the number of data cache misses.
However, the number of branch mispredictions increases
due to the extra branches required to compute the me-
dian of three elements. The number of element swaps
remains approximately the same.
Figure 7 shows the same plots for the determin-
istic version of Quicksort that chooses the middle el-
ement as pivot. In this case we note that the num-
ber of comparisons does depend on the presortedness
of the input. This is because for small disorder, the
middle element is very close to the median and there-
fore the number of comparisons is close to n log n, as
opposed to 1:4n log n expected for the randomized
Quicksort [10]. The good pivot choice for small disorder
in the input also triggers a smaller number of compar-
isons and branch mispredictions. However, for large dis-
order, the number of comparisons is larger compared to
randomized median-of-three Quicksort due to bad pivot
choices. Also, the running time is aected by up to a
factor of two by the disorder in the input.
Figure 8 and Figure 9 show that when varying the
input size n, the behavior of the plots remains the same
for randomized Quicksort. Hence, our ndings do not
seem to be tied to the particular choice of n = 2 106.
Finally, in Figure 10 we demonstrate that the
number of element swaps is very closely related to
Pn
i=1 log di, cf. the comment after the proof of Theo-
rem 1.1. Hence the reason for the non-linear shape of
the previous plots for input sequences with large di's
seems to be the slack introduced (for this type of input)
after Equation (2.2) in the proof of Theorem 1.1. As
in the other cases, the running time and the number
of branch mispredictions follow the same trend as the
number of swaps.

4 Experimental results
4.1 Quick sort. We rst analyze the dependence of
the version of Quick sort shown in Figure 1 on the
number of inversions in the input.
Figure 4 shows our data for the AMD Athlon ar-
chitecture. The number of comparisons is independent
of the number of inversions in the input, as expected.
For the number of element swaps, the plot is very
close to linear when considering the input sequence with
small di's. Since the x-axis shows log(Inv), this is in very
good correspondence with the bound O(n log(Inv=n))
of Theorem 1.1 (recall that n is xed in the plot). For
the input sequence with large di's, the plot is dierent.
This is a sign of the slack in the analysis (for this type
of input) noted after the proof of Theorem 1.1. We
will demonstrate below that this curve is in very good
correspondence with the version of the bound given by
Equation (2.2). The plots for the number of branch mis-
predictions and for the running time clearly show that
they are correlated with the number of element swaps.
For the number of branch mispredictions, this is ex-
plained by the fact that an element swap is performed
after the two while loops stop, and hence corresponds
to two branch mispredictions. For the running time,
it seems reasonable to infer that branch mispredictions
are a dominant part of the running time of Quicksort on
this type of architecture. Finally, the number of data
cache misses seems independent of the presortedness of
the input sequence, in correspondence with the fact that
for all element swaps, the data to be manipulated is al-
ready in the cache and therefore the element swaps do
not generate additional cache misses.
Figure 5 show the same plots for the P4 architec-
ture, except that we were not able to obtain data for
L2 data cache misses. We note that the plots follow the
same trends as in Figure 4. The number of comparisons
and the number of element swaps are approximately the
same, but the running time is aected by up to a factor
of 1:8 on the P4, while only by up to a factor of 1:42 on
the Athlon. One reason for this behavior is the number
of branch mispredictions, which is slightly smaller for
the Athlon. Also, the length of the pipeline, shorter for
Athlon, makes the branch mispredictions more costly
on a P4 than on an Athlon.
Similar observations on the resemblance between
the data for the two architectures apply to all our
experiments. For this reason, and because of the
extra data for L2 that we have for Athlon, we for
the remaining plots restrict ourselves to the Athlon
architecture.
We now turn to the variants of Quicksort. Figure 6
shows the number comparisons, the number of element
swaps, the number of branch mispredictions, the run-
ning time, and the L2 data cache misses for the version
of Quicksort that chooses as pivot the median of three
random elements in the input sequence. We note that
the plots have a behavior similar to the ones for the ver-
sion of Quicksort shown in Figure 4. However, some im-
provements are noticed. The three-median pivot Quick-
sort performs around 25% less comparisons, due to the
better choice of the pivot. This immediately triggers a
slight improvement in the number of data cache misses.
However, the number of branch mispredictions increases
due to the extra branches required to compute the me-
dian of three elements. The number of element swaps
remains approximately the same.
Figure 7 shows the same plots for the determin-
istic version of Quicksort that chooses the middle el-
ement as pivot. In this case we note that the num-
ber of comparisons does depend on the presortedness
of the input. This is because for small disorder, the
middle element is very close to the median and there-
fore the number of comparisons is close to n log n, as
opposed to  1:4n log n expected for the randomized
Quicksort [10]. The good pivot choice for small disorder
in the input also triggers a smaller number of compar-
isons and branch mispredictions. However, for large dis-
order, the number of comparisons is larger compared to
randomized median-of-three Quicksort due to bad pivot
choices. Also, the running time is aected by up to a
factor of two by the disorder in the input.
Figure 8 and Figure 9 show that when varying the
input size n, the behavior of the plots remains the same
for randomized Quicksort. Hence, our ndings do not
seem to be tied to the particular choice of n = 2  106.
Finally, in Figure 10 we demonstrate that the
number of element swaps is very closely related to
Pn
i=1 log di, cf. the comment after the proof of Theo-
rem 1.1. Hence the reason for the non-linear shape of
the previous plots for input sequences with large di's
seems to be the slack introduced (for this type of input)
after Equation (2.2) in the proof of Theorem 1.1. As
in the other cases, the running time and the number
of branch mispredictions follow the same trend as the
number of swaps.

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

ผลการทดลอง 44.1 การเรียงลำดับอย่างรวดเร็ว เรา rst วิเคราะห์การพึ่งพาของรุ่นเรียงลำดับด่วนที่แสดงในรูปที่ 1 ในการจำนวน inversions ในการป้อนข้อมูลรูปที่ 4 แสดงข้อมูลสำหรับตัว AMD Athlon architecture จำนวนเปรียบเทียบเป็นอิสระของจำนวน inversions ในการป้อนข้อมูล ตามที่คาดไว้จำนวนองค์ประกอบ swaps พล็อตจะมากปิดถึงเส้นเมื่อพิจารณาลำดับการป้อนข้อมูลด้วยเล็กดีของ เนื่องจากแกน x แสดง log(Inv) นี้เป็นมากติดต่อดี ด้วย O ผูก (n log(Inv=n))ของทฤษฎีบท 1.1 (นึกว่า n คือ xed ในพล็อต) สำหรับลำดับที่เข้าใหญ่นอของ พล็อตเป็น erent ดิเป็นเครื่องหมายของกระตุ้นในการวิเคราะห์ (สำหรับชนิดนี้ของอินพุต) กล่าวหลังจากหลักฐานของ 1.1 ทฤษฎีบท เราแสดงด้านล่างโค้งนี้ว่าดีมากโต้ตอบกับรุ่นของขอบเขตที่กำหนดสมการ (2.2) โครงการจำนวนสาขา mis-คาดคะเน และสำหรับเวลาทำงานอย่างชัดเจนแสดงว่าพวกเขามี correlated กับจำนวน swaps องค์ประกอบจำนวนสาขา mispredictions นี่คือเก่าplained ความจริงที่ว่าองค์ประกอบสลับเป็นดำเนินการหลังจากที่ทั้งสองขณะหยุดการวนรอบ และดังนั้น สอดคล้องไปสองสาขา mispredictions สำหรับเวลาทำงานเหมือนสมรู้สาขาที่ mispredictionsเป็นส่วนหลักของเวลา Quicksort ทำงานบนชนิดนี้ของสถาปัตยกรรม ในที่สุด จำนวนข้อมูลแคพุ่งเหมือนขึ้นอยู่กับ presortedness ของลำดับที่ป้อนเข้า ในการติดต่อกับความจริงที่ในทุกองค์ประกอบ swaps ข้อมูลที่จะจัดการคือ อัล-พร้อมในแคช และทำ swaps องค์ประกอบดังนั้นไม่สร้างพุ่งแคชเพิ่มเติมรูปที่ 5 แสดงผืนเดียวกันสำหรับการ P4 architecture ยกเว้นว่าเราไม่สามารถได้รับข้อมูลข้อมูล L2 แคชพุ่ง เราทราบว่า โครงการตามแนวโน้มเดียวกันกับในรูปที่ 4 จำนวนเปรียบเทียบจำนวนองค์ประกอบ swaps มีประมาณนี้เหมือนกัน แต่เวลาทำงานเป็น ected โดยขึ้นอยู่กับตัว1:8 ใน P4 ขณะเท่านั้นโดยขึ้นอยู่กับปัจจัยที่ 1:42 บนAthlon เหตุผลหนึ่งสำหรับการทำงานนี้คือ หมายเลขสาขา mispredictions ซึ่งเป็นเล็กสำหรับAthlon ยัง ความยาวของท่อ สั้นสำหรับAthlon ทำ mispredictions สาขามากบนเป็น P4 กว่าบนตัว Athlonสังเกตคล้ายกับความคล้ายคลึงระหว่างข้อมูลสำหรับสถาปัตยกรรมสองใช้ทั้งหมดของเราการทดลอง ด้วยเหตุนี้ และเนื่องจากการL2 ที่เรามีสำหรับ Athlon เราสำหรับข้อมูลเพิ่มเติมผืนที่เหลือจำกัดตนเอง Athlonสถาปัตยกรรมเราตอนนี้เปลี่ยนไปตัวแปร Quicksort รูปที่ 6แสดงการเปรียบเทียบหมายเลข หมายเลของค์ประกอบswaps จำนวนสาขา mispredictions รัน-หนิง และเวลาพุ่งแคข้อมูล L2 สำหรับรุ่นของ Quicksort ที่เลือกเป็นสาระสำคัญมัธยฐานของสามองค์ประกอบที่สุ่มในลำดับการป้อนข้อมูล เราสังเกตว่าผืนนี้มีลักษณะคล้ายกับที่สำหรับหนอน-นของ Quicksort ที่แสดงในรูปที่ 4 อย่างไรก็ตาม im บาง -provements จะสังเกตเห็น สาระสำคัญ 3 ค่ามัธยฐานด่วน-เรียงลำดับทำการประมาณ 25% เปรียบเทียบ เนื่องในทางเลือกที่ดีกว่าของ pivot ทริกเกอร์นี้ทันทีปรับปรุงเล็กน้อยในจำนวนข้อมูลแคพุ่งอย่างไรก็ตาม การเพิ่มขึ้นของจำนวนสาขา mispredictionsเนื่องจากต้องคำนวณมีสาขาเพิ่ม-เตียนของทั้งสามองค์ประกอบ จำนวนองค์ประกอบ swapsนั้นประมาณรูปที่ 7 แสดงผืนเดียวสำหรับ determin-รุ่น Quicksort ที่เลือกกลางเอล - isticement เป็นสาระสำคัญ ในกรณีนี้ เราหมายเหตุว่า num -ber ของเปรียบเทียบขึ้นอยู่กับ presortedness ที่ของการป้อนข้อมูล ทั้งนี้เนื่องจากสำหรับโรคขนาดเล็ก การองค์กลางมีมากมัธยฐานและมี-fore จำนวนเปรียบเทียบเป็นใกล้กับล็อก n nตรงข้ามกับ 1:4n ล็อก n ที่คาดไว้สำหรับแบบ randomizedQuicksort [10] เลือกสาระสำคัญดีสำหรับโรคเล็ก ๆในการป้อนข้อมูลทริกเล็กจำนวน compar-mispredictions isons และสาขา อย่างไรก็ตาม สำหรับขนาดใหญ่หรือไม่??สั่งซื้อ การเปรียบเทียบจำนวนมีขนาดใหญ่เมื่อเทียบกับrandomized Quicksort มัธยฐานของสามเนื่องจากสาระสำคัญไม่ถูกต้องตัวเลือก นอกจากนี้ เวลาทำงานเป็น ected โดยถึงคูณสองโดยโรคในการป้อนข้อมูลรูปที่ 8 และรูปที่ 9 แสดงว่าเมื่อแตกต่างกันป้อนข้อมูลขนาด n ลักษณะการทำงานของโครงการยังคงเหมือนเดิมสำหรับ randomized Quicksort ดังนั้น ndings ของเราไม่ดูเหมือนจะผูกพันการเลือกเฉพาะ n = 2 106สุดท้าย ในรูปที่ 10 เราแสดงให้เห็นถึงที่หมายจำนวน swaps องค์ประกอบเกี่ยวข้องอย่างใกล้ชิดกับพีเอ็นฉัน = 1 ระบบ di, cf. คิดหลังจากหลักฐานที-rem 1.1 ดังนั้นเหตุผลที่รูปร่างไม่ใช่เชิงเส้นของโครงการก่อนหน้าสำหรับป้อนลำดับของนอที่ใหญ่ดูเหมือนจะ กระตุ้นแนะนำ (สำหรับป้อนข้อมูลชนิดนี้)จากสมการ (2.2) ในหลักฐานของ 1.1 ทฤษฎีบท เป็นในกรณีอื่น ๆ เวลาทำงาน และหมายเลขของ mispredictions สาขาตามแนวโน้มเดียวกันเป็นการจำนวน swaps

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

4 ผลการทดลอง
4.1 การจัดเรียงอย่างรวดเร็ว เราแรกวิเคราะห์พึ่งพาอาศัยกันของรุ่นของการจัดเรียงอย่างรวดเร็วแสดงในรูปที่ 1 ในจำนวนinversions ในการป้อนข้อมูล. รูปที่ 4 แสดงให้เห็นว่าข้อมูลของเราสำหรับ AMD Athlon ar- chitecture จำนวนของการเปรียบเทียบความเป็นอิสระของจำนวน inversions ในการป้อนข้อมูลเป็นไปตามคาด. สำหรับจำนวนของสัญญาแลกเปลี่ยนองค์ประกอบพล็อตเป็นอย่างมากใกล้เคียงกับเมื่อพิจารณาการเชิงเส้นลำดับการป้อนข้อมูลด้วยdi ขนาดเล็ก ตั้งแต่แกน x แสดงให้เห็นถึงการเข้าสู่ระบบ (Inv) นี้มากอยู่ในการติดต่อที่ดีกับขอบเขต O (log n (Inv = n)) ของทฤษฎีบท 1.1 (n จำที่ xed ในพล็อต) สำหรับลำดับการป้อนข้อมูลที่มีขนาดใหญ่ของดิพล็อตเป็นดิ? ต่างกัน. นี้เป็นสัญญาณของการหย่อนในการวิเคราะห์ที่ (สำหรับประเภทนี้ของท่าน) ตั้งข้อสังเกตหลังจากที่พิสูจน์ทฤษฎีบท 1.1 เราจะแสดงให้เห็นด้านล่างที่โค้งนี้เป็นสิ่งที่ดีมากในการติดต่อกับรุ่นที่ถูกผูกไว้ที่กำหนดโดยสมการ(2.2) แปลงสำหรับจำนวนของสาขาศาสนาการคาดการณ์และเวลาการทำงานอย่างชัดเจนแสดงให้เห็นว่าพวกเขาจะมีความสัมพันธ์กับจำนวนของสัญญาแลกเปลี่ยนองค์ประกอบ. สำหรับจำนวน mispredictions สาขานี้เป็นอดีตplained โดยความจริงที่ว่าแลกเปลี่ยนองค์ประกอบที่จะดำเนินการหลังจากที่ในขณะที่ทั้งสองหยุดลูปและด้วยเหตุนี้สอดคล้องสอง mispredictions สาขา สำหรับเวลาทำงาน, ดูเหมือนว่าเหมาะสมเพื่อสรุป mispredictions สาขาที่เป็นส่วนที่โดดเด่นของเวลาการทำงานของQuicksort ในรูปแบบของสถาปัตยกรรมนี้ ในที่สุดจำนวนของข้อมูลที่คิดถึงแคชดูเหมือนเป็นอิสระจาก presortedness ของลำดับการป้อนข้อมูลในการติดต่อกับความจริงที่สำหรับแลกเปลี่ยนองค์ประกอบทุกข้อมูลที่จะจัดการเป็นอัลพร้อมในแคชและดังนั้นจึงแลกเปลี่ยนองค์ประกอบไม่ได้สร้างเพิ่มเติมคิดถึงแคช. รูปที่ 5 แสดงแปลงเดียวกันสำหรับ P4 สถาปัตยกรรมture ยกเว้นว่าเราไม่สามารถที่จะได้รับข้อมูลสำหรับการคิดถึงแคชข้อมูลL2 เราทราบว่าแปลงตามแนวโน้มเดียวกับในรูปที่ 4 จำนวนของการเปรียบเทียบและจำนวนของสัญญาแลกเปลี่ยนองค์ประกอบที่มีประมาณเดียวกันแต่เวลาทำงานเป็น ected ได้ถึงปัจจัยที่1: 8 P4 ที่ ในขณะที่เพียงได้ถึงปัจจัยของ 01:42 ในAthlon เหตุผลหนึ่งที่ทำให้การทำงานนี้คือจำนวนของ mispredictions สาขาซึ่งมีขนาดเล็กกว่าเล็กน้อยสำหรับ Athlon นอกจากนี้ความยาวของท่อสั้นสำหรับAthlon ทำให้ mispredictions สาขาค่าใช้จ่ายเพิ่มเติมในP4 กว่า Athlon. สังเกตที่คล้ายกันในความคล้ายคลึงระหว่างข้อมูลสำหรับทั้งสองนำไปใช้กับสถาปัตยกรรมของเราทดลอง ด้วยเหตุนี้และเพราะข้อมูลเพิ่มเติมสำหรับ L2 ที่เรามีสำหรับ Athlon เราสำหรับแปลงที่เหลือจะจำกัด ตัวเอง Athlon สถาปัตยกรรม. ตอนนี้เราหันไปสายพันธุ์ของ Quicksort รูปที่ 6 แสดงให้เห็นถึงการเปรียบเทียบตัวเลขจำนวนขององค์ประกอบแลกเปลี่ยนจำนวน mispredictions สาขา Run- เวลาหนิงและคิดถึงแคชข้อมูล L2 สำหรับรุ่นของQuicksort ที่เลือกเป็นเดือยเฉลี่ยของสามองค์ประกอบสุ่มในลำดับการป้อนข้อมูล. เราทราบว่าแปลงมีพฤติกรรมคล้ายกับคนสำหรับ Ver- ไซออนของ Quicksort แสดงในรูปที่ 4 อย่างไรก็ตามญบางprovements จะสังเกตเห็น เดือยสามเฉลี่ย Quick- เรียงทำการเปรียบเทียบประมาณ 25% น้อยเนื่องจากทางเลือกที่ดีของการหมุน นี้ทันทีเรียกปรับปรุงเล็กน้อยในจำนวนของข้อมูลที่คิดถึงแคช. อย่างไรก็ตามจำนวนสาขาเพิ่มขึ้น mispredictions เนื่องจากสาขาพิเศษที่จำเป็นในการคำนวณเลืDian สามองค์ประกอบ จำนวนของสัญญาแลกเปลี่ยนองค์ประกอบยังคงประมาณเดียวกัน. รูปที่ 7 แสดงให้เห็นแปลงเดียวกันสำหรับ determin- รุ่น istic ของ Quicksort ที่เลือกตรงกลาง EL- ement เป็นสาระสำคัญ ในกรณีนี้เราทราบว่าจานวนเบอร์ของการเปรียบเทียบไม่ขึ้นอยู่กับ presortedness ของท่าน เพราะนี่คือความผิดปกติเล็ก ๆองค์ประกอบกลางอยู่ใกล้กับค่าเฉลี่ยและรองเพื่อขอแบ่งข้างหน้าจำนวนของการเปรียบเทียบอยู่ใกล้กับ n ล็อก n เป็นตรงข้ามกับ? 1: เข้าสู่ระบบ 4N คาดว่า n สำหรับสุ่มQuicksort [10] ทางเลือกที่ดีสำหรับการหมุนความผิดปกติเล็ก ๆในการป้อนข้อมูลนอกจากนี้ยังก่อให้เกิดการขนาดเล็กจำนวน compar- isons และ mispredictions สาขา อย่างไรก็ตามสำหรับการปรากฏขนาดใหญ่เพื่อที่จำนวนของการเปรียบเทียบจะมีขนาดใหญ่เมื่อเทียบกับค่าเฉลี่ยของสามQuicksort สุ่มเนื่องจากการหมุนที่ไม่ดีทางเลือก นอกจากนี้ยังมีเวลาทำงานเป็น? ected โดยขึ้นอยู่กับปัจจัยที่สองจากความผิดปกติในการป้อนข้อมูลที่. รูปที่ 8 และรูปที่ 9 แสดงให้เห็นว่าเมื่อที่แตกต่างกันขนาดn การป้อนข้อมูลพฤติกรรมของแปลงยังคงเหมือนเดิมสำหรับQuicksort สุ่ม ดังนั้น ndings ของเราไม่ได้ดูเหมือนจะเชื่อมโยงกับทางเลือกเฉพาะของn = 2? 106 ในที่สุดรูปที่ 10 เราแสดงให้เห็นว่าจำนวนของสัญญาแลกเปลี่ยนองค์ประกอบที่มีความสัมพันธ์อย่างใกล้ชิดกับPn i = 1 เข้าสู่ระบบดิ cf เลย แสดงความคิดเห็นหลังจากที่หลักฐานการ Theo- REM 1.1 ดังนั้นเหตุผลที่รูปร่างไม่เชิงเส้นของแปลงก่อนหน้านี้สำหรับลำดับการป้อนข้อมูลที่มีขนาดใหญ่ของดิดูเหมือนว่าจะหย่อนแนะนำ(สำหรับประเภทของการป้อนข้อมูลนี้) หลังจากที่สมการ (2.2) ในการพิสูจน์ทฤษฎีบท 1.1 ขณะที่ในกรณีอื่น ๆ ที่เวลาทำงานและจำนวนของmispredictions สาขาตามแนวโน้มเช่นเดียวกับจำนวนของสัญญาแลกเปลี่ยน

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

4 การทดลอง
4.1 ด่วนประเภท เรา RST วิเคราะห์การพึ่งพาอาศัยกันของ
รุ่นเร็ว เรียงแสดงในรูปที่ 1 ใน
จำนวน inversions ในการป้อนข้อมูล
รูปที่ 4 แสดงข้อมูลสำหรับ AMD Athlon AR -
chitecture . จำนวนของการเปรียบเทียบเป็นอิสระ
จำนวน inversions ในการป้อนข้อมูลตามที่คาดหวัง .
สำหรับหมายเลขแลกเปลี่ยนองค์ประกอบพล็อตมาก
ใกล้เชิงเส้น เมื่อพิจารณาลำดับใส่กับ
ขนาดเล็ก di . ตั้งแต่แกน x แสดง log ( INV ) , มีมาก
จดหมายที่ดีกับผูกพัน O ( n log ( N = N )
ทฤษฎีบท 1.1 ( จำได้ว่าเป็น xed ในพล็อต ) สำหรับลำดับใส่
ขนาดใหญ่ di , แปลงเป็น ดิ erent .
นี้เป็นสัญญาณของเวลาในการวิเคราะห์ ( ชนิดนี้
ใส่ ) กล่าวหลังจากพิสูจน์ทฤษฎีบท 1.1 . เรา
จะแสดงด้านล่างที่โค้งนี้มีมาก
ติดต่อกับรุ่นไว้ให้โดย
สมการ ( 2.2 ) แปลงสำหรับจำนวนของสาขา MIS -
คาดคะเนและสำหรับเวลา แสดงให้เห็นชัดเจนว่าพวกเขามีความสัมพันธ์กับ
จำนวนองค์ประกอบ swaps .
สำหรับจำนวนของ mispredictions สาขา ซึ่งเป็นอดีต -
plained โดยความจริงที่ว่าองค์ประกอบสลับแสดง
หลังจากที่ทั้งสองในขณะที่ลูปหยุดและด้วยเหตุนี้สอดคล้อง
2 สาขา mispredictions . สำหรับวิ่งเวลา
ดูเหมือนว่าเหมาะสมที่จะอนุมานว่า mispredictions สาขา
เป็นส่วนเด่นของเวลาการทำงานของควิกซอร์ตบน
สถาปัตยกรรมประเภทนี้ ในที่สุดจำนวนของข้อมูล
แคชคิดถึงดูอิสระของ presortedness ของ
ใส่ลำดับในการติดต่อกับความเป็นจริงที่
สำหรับการแลกเปลี่ยนธาตุทั้งหมด ข้อมูลที่ถูก คือ อัล -
พร้อมในแคชและดังนั้นองค์ประกอบการทำ

ไม่สร้างแคชเพิ่มเติมพลาด รูปที่ 5 แสดงเดียวกันแปลงสำหรับ P4 engineer -
จริง ยกเว้นว่าเราไม่สามารถรับข้อมูลสำหรับแคช L2
ข้อมูลพลาด เราทราบว่าแปลงตามแนวโน้ม
เหมือนในรูปที่ 4 จำนวนของการเปรียบเทียบ
และจำนวนของการแลกเปลี่ยนธาตุประมาณ
เหมือนกัน แต่เวลาวิ่งเป็น ประมวลถึงปัจจัยของบน P4
8 ในขณะที่เพียงโดยขึ้นอยู่กับปัจจัย 1:42 บน
Athlon . หนึ่งในเหตุผลสำหรับพฤติกรรมนี้เป็นหมายเลข
ของ mispredictions สาขาซึ่งเล็กกว่าเล็กน้อยสำหรับ
Athlon . นอกจากนี้ ความยาวของท่อ , สั้นสำหรับ
โพรเซสเซอร์ ทำให้ราคาแพงมากขึ้น
mispredictions สาขาบน P4 กว่า Athlon .

สังเกตที่คล้ายกันในความเหมือนกันระหว่างข้อมูลสองสถาปัตยกรรมสมัครทดลองของเรา
. ด้วยเหตุนี้และเนื่องจาก
ข้อมูลพิเศษสำหรับ L2 ที่เรามีข้อมูล เราสำหรับ
แปลงที่เหลือ จำกัด ตัวเองเพื่อ Athlon

ตอนนี้เราเปิดสถาปัตยกรรม รูปของควิกซอร์ต . รูปที่ 6
แสดงจํานวนการเปรียบเทียบจำนวนขององค์ประกอบ
swaps จำนวน mispredictions สาขา , วิ่ง -
หนิงเวลาและข้อมูล L2 แคชขาดหายไปสำหรับรุ่นที่เลือกเป็นเดือยของควิกซอร์ต
3
ค่ามัธยฐานขององค์ประกอบในการสุ่มลำดับ เราทราบว่า
แปลงมีพฤติกรรมคล้ายกับสำหรับ Ver -
ไซออนของควิกซอร์ตแสดงในรูปที่ 4 อย่างไรก็ตาม อิม -
provements จะสังเกตเห็น 3 หมุนเร็ว -
มัธยฐานจัดเรียงแสดงประมาณ 25% เปรียบเทียบน้อย เนื่องจาก
ทางเลือกที่ดีของเดือย เดี๋ยวนี้เรียก
การปรับปรุงเล็กน้อยในหมายเลขของแคชข้อมูลคิดถึง .
แต่จำนวนสาขาเพิ่มขึ้น mispredictions
จากพิเศษสาขาต้องคำนวณฉัน -
เถียนสามองค์ประกอบ จำนวนของการแลกเปลี่ยนองค์ประกอบยังคงประมาณเดียวกัน

.รูปที่ 7 แสดงเดียวกันแปลง เพื่อทราบ -
istic รุ่นควิกซอร์ตที่จะเลือกกลาง เอล -
ซึ่งเป็นเดือย ในกรณีนี้เราทราบว่าหนุ่ม -
เบอร์มือถือจะขึ้นอยู่กับ presortedness
ของการป้อนข้อมูล นี้เป็นเพราะความผิดปกติขนาดเล็ก
กลางธาตุอยู่ใกล้กับค่ามัธยฐานและมี -
4 จำนวนเปรียบเทียบใกล้ n log n ,
นอกคอก 15 log n โดยสุ่ม
ควิกซอร์ต [ 10 ] ดีทางเลือกสำหรับโรคเดือยเล็ก
ในใส่ยังเรียกขนาดเล็กจำนวน compar -
isons และสาขา mispredictions . อย่างไรก็ตาม สำหรับขนาดใหญ่ dis -
สั่งซื้อ จำนวนของการเปรียบเทียบมีขนาดใหญ่เมื่อเทียบกับค่าเฉลี่ยของทั้งสามคน

เลือกควิกซอร์ตเนื่องจากหมุนไม่ดี นอกจากนี้ เวลาทํางานเป็น ประมวลถึง
โดยปัจจัยที่สอง โดยความผิดปกติในการป้อนข้อมูล .
รูปที่ 8 และรูปที่ 9 แสดงให้เห็นว่าเมื่อเปลี่ยนแปลง
ใส่ขนาด N , พฤติกรรมของแปลงยังคงเหมือนเดิม
สำหรับแบบควิกซอร์ต . ดังนั้น ndings ของเราไม่
ดูเหมือนจะเชื่อมโยงกับทางเลือกเฉพาะของ n = 2 106 .
ในที่สุด ในรูปที่ 10 เราแสดงให้เห็นว่า
เลขที่แลกเปลี่ยนองค์ประกอบที่เกี่ยวข้องอย่างใกล้ชิดกับ PN

= 1 DI CFแสดงความคิดเห็นหลังพิสูจน์ของ Theo -
เรม 1.1 . ดังนั้นเหตุผลที่รูปร่างแบบไม่เชิงเส้นของ
แปลงก่อนหน้านี้ สำหรับข้อมูลที่มีขนาดใหญ่เป็นลำดับ ดิ
ดูเหมือนจะหย่อนแนะนำ ( สำหรับข้อมูลประเภทนี้ )
หลังจากที่สมการ ( 2.2 ) ในการพิสูจน์ทฤษฎีบท 1.1 . โดย
ในบางกรณี เวลา และหมายเลขของ mispredictions
สาขาเป็นไปตามแนวโน้มเดียวกับ
จำนวน swaps

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.