Figure 7: Read and write performanc

Figure 7: Read and write performance of default and tuned
MVAPICH2’s eager threshold settings.
5.1 Point-to-Point Protocol Threshold
To demonstrate the performance improvement achieved
by tuning the point-to-point protocol threshold we selected
a benchmark that mimics a modified version of the CFOUR
Quantum Chemistry Application7
[7]. This version, provided
by Simmons and Schulz [20] augments CFOUR by
using the open-source GRVY toolkit library8
to convert disk
transactions into distributed memory transactions using MPI.
The benchmark reads and writes fixed records in random
order as part of an out-of-core solve procedure. For this
case study the offload hosts are placed on different nodes
to maximize the available memory. By using MPI Advisor,
we discovered that the messages were mainly point-to-point
7http://www.cfour.de/
8https://red.ices.utexas.edu/projects/software/wiki/
GRVY
Eager vs . rendezvous program details :
- Number of call sites that used MPI_Send : 1
- Maximum median size ( bytes ) of messages sent
through MPI_Send : 131072
- Eager threshold of MPI library ( bytes ): 17408
- For more details on the messages sent ,
consult the mpiP report : ./ cfour .88089.1. mpiP
Eager vs . rendezvous suggestions :
- POSSIBLE OPTIMIZATION : The maximum of the
median messages sent is 131072 bytes , but
the eager threshold of the MPI Library is
17408. Consider increasing the eager thres -
hold to a value higher than 131072 bytes .
- WARNING : Increasing the eager threshold will
also increase MPI library memory footprint .
MVAPICH2 command that can be used to change the
eager threshold :
- MV2_IBA_EAGER_THRESHOLD =< nbytes >
- Related documentation can be found in :
http :// mvapich . cse . ohio - state . edu / support /
Listing 1: MPI Advisor recommendation for tuning the
point-to-point eager vs. rendezvous protocol threshold for
a benchmark that mimics CFOUR.
with sizes around 256 KB or less. Following the advice provided
by the tool and shown in Listing 1, we changed the
value of the eager threshold of MVAPICH2 from 17 KB to
256 KB by setting the value of MV2_IBA_EAGER_THRESHOLD
to 262144. Running the micro-benchmark with the 256 KB
threshold yielded a significant improvement for write and
read operations. The results, presented in Figure 7, report
the aggregate write and read speeds for the default
and tuned MVAPICH2 settings.
5.2 Algorithms for Collective Operations
To illustrate the performance benefit obtainable from tuning
collective operations, we use the ASP [9] application,
which is a parallel implementation of the Floyd-Warshall algorithm
used to solve the all-pairs shortest-path problem.
ASP mainly uses MPI Bcast and changes the root of the
broadcast operation for each iteration. On Maverick Intel
MPI outperforms MVAPICH2 for ASP. The default con-
figuration of MVAPICH2 is tuned based on OMB, which
always uses the same root for collective operations. Following
the tool’s recommendation, which is shown in Listing 2,
we were able to improve the performance of ASP by 8.3%.
Table 5 provides the results obtained by MVAPICH2 with
tuned and default settings, and Intel MPI on the Maverick
cluster using 80 MPI tasks. MPI Advisor does not provide
any recommendation for Intel MPI because its default value
is already tuned.
MVAPICH2 MVAPICH2 Intel MPI
Default Tuned Default
24.45 22.41 22.38
Table 5: ASP execution time (seconds) on 80 cores.
5.3 Mapping of MPI Tasks to Cores
To illustrate the benefits of using MPI Advisor to tune
the MPI tasks-to-cores mapping we use HPCG [3]. HPCG
is an application that is used as an alternative ranking of
the TOP500 list9
and can be used only with MPI or with
9http://www.top500.org/
Collective program details :
- Number of call sites that used MPI_Bcast : 1
- Average MPI_Bcast message sizes :
* Callsite ID : 2, size : 2097152
- MPI_Bcast algorithm employed : 5
- Root is changing
- For more details on the messages sent ,
consult the mpiP report : ./ asp .8.22585.1. mpiP
Collective suggestions :
- POSSIBLE OPTIMIZATION : The algorithm being
employed for MPI BCAST may not provide the
best performance for the messages being sent .
* Consider changing to algorithm 2
MVAPICH2 command that can be used to change the
MPI_Bcast algorithm :
- MV2_INTER_BCAST_TUNING =
Listing 2: MPI Advisor recommendation for selecting the
appropriate collective operation algorithm for ASP.

Figure 7: Read and write performance of default and tuned
MVAPICH2’s eager threshold settings.
5.1 Point-to-Point Protocol Threshold
To demonstrate the performance improvement achieved
by tuning the point-to-point protocol threshold we selected
a benchmark that mimics a modified version of the CFOUR
Quantum Chemistry Application7
[7]. This version, provided
by Simmons and Schulz [20] augments CFOUR by
using the open-source GRVY toolkit library8
to convert disk
transactions into distributed memory transactions using MPI.
The benchmark reads and writes fixed records in random
order as part of an out-of-core solve procedure. For this
case study the offload hosts are placed on different nodes
to maximize the available memory. By using MPI Advisor,
we discovered that the messages were mainly point-to-point
7http://www.cfour.de/
8https://red.ices.utexas.edu/projects/software/wiki/
GRVY
Eager vs . rendezvous program details :
- Number of call sites that used MPI_Send : 1
- Maximum median size ( bytes ) of messages sent
through MPI_Send : 131072
- Eager threshold of MPI library ( bytes ): 17408
- For more details on the messages sent ,
consult the mpiP report : ./ cfour .88089.1. mpiP
Eager vs . rendezvous suggestions :
- POSSIBLE OPTIMIZATION : The maximum of the
median messages sent is 131072 bytes , but
the eager threshold of the MPI Library is
17408. Consider increasing the eager thres -
hold to a value higher than 131072 bytes .
- WARNING : Increasing the eager threshold will
also increase MPI library memory footprint .
MVAPICH2 command that can be used to change the
eager threshold :
- MV2_IBA_EAGER_THRESHOLD =< nbytes >
- Related documentation can be found in :
http :// mvapich . cse . ohio - state . edu / support /
Listing 1: MPI Advisor recommendation for tuning the
point-to-point eager vs. rendezvous protocol threshold for
a benchmark that mimics CFOUR.
with sizes around 256 KB or less. Following the advice provided
by the tool and shown in Listing 1, we changed the
value of the eager threshold of MVAPICH2 from 17 KB to
256 KB by setting the value of MV2_IBA_EAGER_THRESHOLD
to 262144. Running the micro-benchmark with the 256 KB
threshold yielded a significant improvement for write and
read operations. The results, presented in Figure 7, report
the aggregate write and read speeds for the default
and tuned MVAPICH2 settings.
5.2 Algorithms for Collective Operations
To illustrate the performance benefit obtainable from tuning
collective operations, we use the ASP [9] application,
which is a parallel implementation of the Floyd-Warshall algorithm
used to solve the all-pairs shortest-path problem.
ASP mainly uses MPI Bcast and changes the root of the
broadcast operation for each iteration. On Maverick Intel
MPI outperforms MVAPICH2 for ASP. The default con-
figuration of MVAPICH2 is tuned based on OMB, which
always uses the same root for collective operations. Following
the tool’s recommendation, which is shown in Listing 2,
we were able to improve the performance of ASP by 8.3%.
Table 5 provides the results obtained by MVAPICH2 with
tuned and default settings, and Intel MPI on the Maverick
cluster using 80 MPI tasks. MPI Advisor does not provide
any recommendation for Intel MPI because its default value
is already tuned.
MVAPICH2 MVAPICH2 Intel MPI
Default Tuned Default
24.45 22.41 22.38
Table 5: ASP execution time (seconds) on 80 cores.
5.3 Mapping of MPI Tasks to Cores
To illustrate the benefits of using MPI Advisor to tune
the MPI tasks-to-cores mapping we use HPCG [3]. HPCG
is an application that is used as an alternative ranking of
the TOP500 list9
and can be used only with MPI or with
9http://www.top500.org/
Collective program details :
- Number of call sites that used MPI_Bcast : 1
- Average MPI_Bcast message sizes :
* Callsite ID : 2, size : 2097152
- MPI_Bcast algorithm employed : 5
- Root is changing
- For more details on the messages sent ,
consult the mpiP report : ./ asp .8.22585.1. mpiP
Collective suggestions :
- POSSIBLE OPTIMIZATION : The algorithm being
employed for MPI BCAST may not provide the
best performance for the messages being sent .
* Consider changing to algorithm 2
MVAPICH2 command that can be used to change the
MPI_Bcast algorithm :
- MV2_INTER_BCAST_TUNING = 
Listing 2: MPI Advisor recommendation for selecting the
appropriate collective operation algorithm for ASP.

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

รูปที่ 7: อ่าน และประสิทธิภาพของการเริ่มต้นเขียน และปรับแต่งการตั้งค่าขีดจำกัดความกระตือรือร้นของ MVAPICH25.1 point-to-Point โพรโทคอลจำกัดแสดงให้เห็นถึงการปรับปรุงประสิทธิภาพการทำงานประสบความสำเร็จโดยปรับเกณฑ์การโพรโทคอล point-to-point ที่เราเลือกมาตรฐานที่เลียนแบบ CFOUR ที่แก้ไขApplication7 เคมีควอนตัม[7] . รุ่นนี้ ให้โดยซิมมอนส์และยนต์ [20] augments โดย CFOURใช้เปิดแหล่ง GRVY ชุดเครื่องมือ library8การแปลงดิสก์ธุรกรรมเป็นธุรกรรมแบบกระจายหน่วยความจำโดยใช้ MPIมาตรฐานอ่าน และเขียนระเบียนถาวรในสุ่มสั่งเป็นส่วนหนึ่งของการออกของหลักแก้ไขขั้นตอน สำหรับเรื่องนี้กรณีศึกษาการจัดถ่ายข้อมูลอยู่บนโหนแตกต่างเพื่อเพิ่มหน่วยความจำพร้อมใช้งาน โดยใช้ที่ปรึกษา MPIเราพบว่า ข้อความที่ถูกส่วนใหญ่เป็นแบบจุดต่อจุด7 http://www.cfour.de/8 https://red.ices.utexas.edu/projects/software/wiki/GRVYอยาก vs รายละเอียดโปรแกรมการประชุม:-จำนวนของไซต์โทรที่ใช้ MPI_Send: 1-สูงสุดเฉลี่ยขนาด (ไบต์) ของข้อความที่ส่งผ่าน MPI_Send: 131072-กระตือรือร้นเกณฑ์ของไลบรารี MPI (ไบต์): 17408-สำหรับรายละเอียดเพิ่มเติมเกี่ยวกับข้อความที่ส่งดูรายงาน mpiP: . / cfour .88089.1 mpiPอยาก vs ข้อเสนอแนะของการประชุม:-เพิ่มประสิทธิภาพเป็นไปได้: สูงสุดของการเฉลี่ยข้อความที่ส่งเป็น 131072 ไบต์ แต่มีเกณฑ์ความกระตือรือร้นของไลบรารีของ MPI17408 พิจารณาเพิ่มการกระหาย thres -ถือเป็นค่าที่สูงกว่า 131072 ไบต์-คำเตือน: การเพิ่มขีดจำกัดความกระตือรือร้นที่จะยัง เพิ่มหน่วยความจำ MPI ไลบรารีรอยคำสั่ง MVAPICH2 ที่สามารถใช้การเปลี่ยนแปลงความกระตือรือร้นเกณฑ์:-MV2_IBA_EAGER_THRESHOLD = < nbytes >-เอกสารที่เกี่ยวข้องสามารถพบได้ใน:http :// mvapich cse โอไฮโอ - รัฐ edu / สนับสนุน /รายการ 1: คำแนะนำปรึกษา MPI สำหรับปรับแต่งการกระหาย point-to-point เจอนัดพบโพรโทคอเพดานสำหรับเกณฑ์มาตรฐานที่เลียนแบบ CFOURมีขนาดประมาณ 256 KB หรือน้อยกว่า ต่อไปนี้คำแนะนำให้โดยเครื่องมือ และแสดงในรายการ 1 เราเปลี่ยนการค่าของเกณฑ์ของ MVAPICH2 จาก 17 KB การกระตือรือร้น256 KB โดยการตั้งค่าของ MV2_IBA_EAGER_THRESHOLDการ 262144 เรียกใช้ไมโครมาตรฐาน 256 KBเกณฑ์ผลการปรับปรุงที่สำคัญสำหรับการเขียน และอ่านการดำเนินงาน ผล แสดงในรูปที่ 7 รายงานการรวมเขียน และอ่านความเร็วสำหรับการเริ่มต้นและปรับการตั้งค่า MVAPICH25.2 ขั้นตอนวิธีสำหรับการดำเนินงานรวมเพื่อแสดงให้เห็นถึงประโยชน์ประสิทธิภาพจากการปรับแต่งการดำเนินงานร่วมกัน เราใช้โปรแกรมประยุกต์ ASP [9]ซึ่งเป็นการดำเนินการแบบขนานของอัลกอริทึม Warshall ฟลอยด์ใช้ในการแก้ปัญหาเส้นทางที่สั้นที่สุดทั้งคู่ASP ใช้ MPI Bcast และการเปลี่ยนแปลงรากของส่วนใหญ่การดำเนินการสำหรับแผนการออกอากาศ บน Intel ไม่ฝักใฝ่ฝ่ายใดMPI มีประสิทธิภาพสูงกว่า MVAPICH2 สำหรับ ASP เริ่มต้นปรับ-figuration ของ MVAPICH2 ได้รับการปรับตาม OMB ซึ่งจะใช้หลักเดียวกันสำหรับการดำเนินงานรวม ต่อไปนี้the tool’s recommendation, which is shown in Listing 2,we were able to improve the performance of ASP by 8.3%.Table 5 provides the results obtained by MVAPICH2 withtuned and default settings, and Intel MPI on the Maverickcluster using 80 MPI tasks. MPI Advisor does not provideany recommendation for Intel MPI because its default valueis already tuned.MVAPICH2 MVAPICH2 Intel MPIDefault Tuned Default24.45 22.41 22.38Table 5: ASP execution time (seconds) on 80 cores.5.3 Mapping of MPI Tasks to CoresTo illustrate the benefits of using MPI Advisor to tunethe MPI tasks-to-cores mapping we use HPCG [3]. HPCGis an application that is used as an alternative ranking ofthe TOP500 list9and can be used only with MPI or with9http://www.top500.org/Collective program details :- Number of call sites that used MPI_Bcast : 1- Average MPI_Bcast message sizes :* Callsite ID : 2, size : 2097152- MPI_Bcast algorithm employed : 5- Root is changing- For more details on the messages sent ,consult the mpiP report : ./ asp .8.22585.1. mpiPCollective suggestions :- POSSIBLE OPTIMIZATION : The algorithm beingemployed for MPI BCAST may not provide thebest performance for the messages being sent .* Consider changing to algorithm 2MVAPICH2 command that can be used to change theMPI_Bcast algorithm :- MV2_INTER_BCAST_TUNING = <1 -9 >Listing 2: MPI Advisor recommendation for selecting theappropriate collective operation algorithm for ASP.

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

รูปที่ 7: อ่านและเขียนผลการดำเนินงานของการเริ่มต้นและปรับ
. การตั้งค่าเกณฑ์ความกระตือรือร้น MVAPICH2 ของ
5.1 พีพีพีเกณฑ์
ในการแสดงให้เห็นถึงการปรับปรุงประสิทธิภาพการทำงานประสบความสำเร็จ
โดยการปรับเกณฑ์โปรโตคอลแบบจุดต่อจุดที่เราเลือก
มาตรฐานที่เลียนแบบรุ่นที่ปรับเปลี่ยน CFOUR
ควอนตัมเคมี Application7
[7] รุ่นนี้ให้
โดยซิมมอนส์ชูลซ์ [20] augments CFOUR โดย
ใช้โอเพนซอร์ส GRVY Toolkit library8
การแปลงดิสก์
การทำธุรกรรมการทำธุรกรรมหน่วยความจำการกระจายการใช้ MPI.
ดัชนีการอ่านและเขียนบันทึกที่คงที่ในการสุ่ม
เพื่อเป็นส่วนหนึ่งของออกจาก -core แก้ขั้นตอน สำหรับเรื่องนี้
กรณีศึกษาเจ้าภาพ Offload จะถูกวางไว้บนโหนดที่แตกต่างกัน
เพื่อเพิ่มหน่วยความจำที่มีอยู่ โดยใช้ MPI ปรึกษา
เราค้นพบว่าข้อความส่วนใหญ่เป็นจุดหนึ่งไปยังจุด
7http: //www.cfour.de/
8https: //red.ices.utexas.edu/projects/software/wiki/
GRVY
กระตือรือร้น VS รายละเอียดโปรแกรมการนัดพบ:
- จำนวนเว็บไซต์ที่เรียกใช้ MPI_Send: 1
- ขนาดเฉลี่ยสูงสุด (ไบต์) ข้อความที่ส่ง
ผ่าน MPI_Send: 131072
- เกณฑ์กระตือรือร้นของไลบรารี MPI (ไบต์): 17408
- สำหรับรายละเอียดเพิ่มเติมเกี่ยวกับข้อความที่ส่ง,
ปรึกษา รายงาน mpiP: ./ cfour .88089.1 mpiP
กระตือรือร้น VS ข้อเสนอแนะที่นัดพบ:
- การเพิ่มประสิทธิภาพเป็นไปได้: สูงสุดของ
ข้อความที่ส่งแบ่งเป็น 131,072 ไบต์ แต่
เกณฑ์ความกระตือรือร้นของห้องสมุด MPI เป็น
17408 พิจารณาเพิ่ม thres กระตือรือร้น -
ถือเป็นค่าที่สูงกว่า 131,072 ไบต์.
- คำเตือน: การเพิ่มเกณฑ์ความกระตือรือร้นที่จะ
. ยังเพิ่ม MPI รอยความทรงจำห้องสมุด
คำสั่ง MVAPICH2 ที่สามารถใช้ในการเปลี่ยน
เกณฑ์ความกระตือรือร้นที่:
- MV2_IBA_EAGER_THRESHOLD = <nbytes>
- ที่เกี่ยวข้อง เอกสารสามารถพบได้ใน:
http: // mvapich CSE โอไฮโอ - รัฐ edu / Support /
รายการที่ 1: คำแนะนำ MPI ที่ปรึกษาสำหรับการปรับ
จุดหนึ่งไปยังจุดเทียบกับเกณฑ์ความกระตือรือร้นที่นัดพบสำหรับโปรโตคอล
มาตรฐานที่เลียนแบบ CFOUR.
ที่มีขนาดรอบ 256 KB หรือน้อยกว่า ทำตามคำแนะนำที่ให้ไว้
โดยเครื่องมือและแสดงในรายการที่ 1 เราเปลี่ยน
ค่าของเกณฑ์ความกระตือรือร้นของ MVAPICH2 จาก 17 KB ไป
256 KB โดยการตั้งค่าของ MV2_IBA_EAGER_THRESHOLD
เพื่อ 262144. วิ่งไมโครมาตรฐานกับ 256 KB
เกณฑ์การให้ผล การปรับปรุงที่สำคัญสำหรับการเขียนและ
การอ่าน ผลนำเสนอในรูปที่ 7 รายงาน
การเขียนรวมและความเร็วในการอ่านสำหรับการเริ่มต้น
และการปรับตั้งค่า MVAPICH2.
5.2 อัลกอริทึมสำหรับการดำเนินงานกลุ่ม
เพื่อแสดงให้เห็นผลประโยชน์ที่จะได้รับจากการปรับ
การดำเนินงานโดยรวมของเราจะใช้ ASP [9] แอพลิเคชัน
ซึ่งเป็น การดำเนินการคู่ขนานของขั้นตอนวิธี Floyd-Warshall
ใช้ในการแก้ปัญหาทั้งหมดคู่ที่สั้นที่สุดเส้นทาง.
ASP ส่วนใหญ่ใช้ MPI bcast และการเปลี่ยนแปลงรากของ
การดำเนินการออกอากาศซ้ำกัน เมื่อวันที่ไม่ฝักใฝ่ฝ่ายใด Intel
MPI ประสิทธิภาพดีกว่า MVAPICH2 สำหรับ ASP งเริ่มต้น
เค้าโครงของ MVAPICH2 ปรับขึ้นอยู่กับ OMB ซึ่ง
มักจะใช้รากเดียวกันสำหรับการดำเนินงานร่วมกัน ต่อไปนี้
คำแนะนำเครื่องมือซึ่งปรากฏอยู่ในรายชื่อที่ 2
เราสามารถที่จะปรับปรุงประสิทธิภาพของการทำงานของ ASP 8.3%.
ตารางที่ 5 ให้ผลที่ได้จากการ MVAPICH2 กับ
ความคืบหน้าและเริ่มต้นการตั้งค่าและ Intel MPI ในไม่ฝักใฝ่ฝ่ายใด
คลัสเตอร์ใช้ 80 งาน MPI . MPI ที่ปรึกษาไม่ได้ให้
คำแนะนำใด ๆ สำหรับ Intel MPI เพราะค่าเริ่มต้น
จะถูกปรับแล้ว.
MVAPICH2 MVAPICH2 Intel MPI
เริ่มต้น Tuned เริ่มต้น
24.45 22.41 22.38
ตารางที่ 5:. และเวลาในการดำเนินการ ASP (วินาที) 80 แกน
5.3 การทำแผนที่ของ MPI งานแกน
เพื่อแสดงให้เห็น ประโยชน์ของการใช้ MPI ที่ปรึกษาปรับแต่ง
แผนที่ MPI งานต่อแกนที่เราใช้ HPCG [3] HPCG
เป็นโปรแกรมที่ใช้เป็นทางเลือกในการจัดอันดับของ
list9 TOP500
และสามารถนำมาใช้เฉพาะกับ MPI หรือ
9http: //www.top500.org/
รายละเอียดโปรแกรมรวม:
- จำนวนเว็บไซต์ที่ใช้โทร MPI_Bcast: 1
- เฉลี่ย MPI_Bcast ข้อความขนาด:
* callsite ID: 2, ขนาด: 2,097,152
- อัลกอริทึม MPI_Bcast การจ้างงาน: 5
- รากมีการเปลี่ยนแปลง
- สำหรับรายละเอียดเพิ่มเติมเกี่ยวกับข้อความที่ส่ง,
ปรึกษารายงาน mpiP: ./ ASP .8.22585.1 mpiP
ข้อเสนอแนะรวม:
- การเพิ่มประสิทธิภาพเป็นไปได้: อัลกอริทึมที่ถูก
ใช้สำหรับการ MPI bcast อาจไม่ให้
ประสิทธิภาพที่ดีที่สุดสำหรับข้อความที่ถูกส่ง.
* พิจารณาการเปลี่ยนแปลงขั้นตอนวิธีการ 2
MVAPICH2 คำสั่งที่สามารถใช้ในการเปลี่ยน
อัลกอริทึม MPI_Bcast:
- MV2_INTER_BCAST_TUNING = <1 -9>
รายชื่อ 2: MPI ที่ปรึกษาให้คำแนะนำในการเลือก
ขั้นตอนวิธีการดำเนินงานโดยรวมที่เหมาะสมสำหรับ ASP

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

รูปที่ 7 : อ่านและเขียนเริ่มต้นและติดตามการปฏิบัติงานของmvapich2 กระตือรือร้นของการตั้งค่า5.1 ชี้ไปที่จุดโปรโตคอลธรณีประตูเพื่อแสดงให้เห็นถึงการปรับปรุงผลสําเร็จโดยปรับแต่งจากจุดหนึ่งไปยังอีกจุดหนึ่ง ( เราเลือกโปรโตคอลเกณฑ์มาตรฐานที่เลียนแบบรุ่นการแก้ไขของ cfourapplication7 เคมีควอนตัม[ 7 ] รุ่นนี้ให้โดย ซิมมอนส์ และ ชูลซ์ [ 20 ] augments cfour โดยการใช้โอเพนซอร์สเครื่องมือ library8 GRVYแปลงดิสก์ธุรกรรมในการกระจายธุรกรรมหน่วยความจำโดยใช้ MPI .อ้างอิงอ่านและเขียนบันทึกในแบบคงที่การเป็นส่วนหนึ่งของการออกจากแกนกลางแก้ขั้นตอน สำหรับนี้กรณีศึกษาระบบโฮสต์จะถูกวางไว้บนโหนดต่าง ๆเพื่อเพิ่มหน่วยความจำที่ใช้ได้ โดยใช้ MPI ที่ปรึกษาเราพบว่าข้อความที่เป็นจุด7http://www.cfour.de/8https://red.ices.utexas.edu/projects/software/wiki/GRVYกระตือรือร้น VS รายละเอียดโปรแกรมการนัดพบ :- หมายเลขของเว็บไซต์ที่ใช้ mpi_send โทร : 1- ขนาด ( ไบต์ ) เฉลี่ยสูงสุดของข้อความที่ส่งผ่าน mpi_send : 131072- เกณฑ์ความกระตือรือร้นของห้องสมุด MPI ( ไบต์ ) : 17408สำหรับรายละเอียดเพิ่มเติมในข้อความที่ส่งมาปรึกษารายงาน mpip : . / cfour . 88089.1 . mpipกระตือรือร้น VS จุดนัดพบ : ข้อเสนอแนะ- เป็นไปได้สูงสุดของการเพิ่มประสิทธิภาพ :โดยข้อความที่ส่งจะ 131072 ไบต์ แต่เกณฑ์ความกระตือรือร้นของ MPI ห้องสมุดคือ17408 . ในการพิจารณา thres กระตือรือร้น -ค้างค่าสูงกว่า 131072 ไบต์- คำเตือน : การเพิ่มเกณฑ์ความกระตือรือร้นจะยังเพิ่ม footprint หน่วยความจำสำหรับห้องสมุดmvapich2 คำสั่งที่สามารถใช้ในการเปลี่ยนธรณีประตู : กระตือรือร้น- mv2_iba_eager_threshold = < nbytes >- เอกสารที่เกี่ยวข้องสามารถพบได้ใน :http : / / mvapich . CSE . โอไฮโอ - รัฐ การศึกษา / สนับสนุน /รายการที่ 1 : คำแนะนำในการปรับแต่งสำหรับที่ปรึกษากระตือรือร้นกับโปรโตคอลที่กำหนดจุดนัดพบจุดเกณฑ์มาตรฐานที่เลียนแบบ cfour .มีขนาดประมาณ 256 KB หรือน้อยกว่า ตามคําแนะนําให้โดยเครื่องมือและแสดงในรายการ 1 เราเปลี่ยนค่าของเกณฑ์ความกระตือรือร้นของ mvapich2 17 KB เพื่อ256 kb โดยกำหนดมูลค่าของ mv2_iba_eager_thresholdเพื่อ 262144 . ใช้มาตรฐานไมโครกับ 256 กิโลไบต์เกณฑ์ให้ผลการปรับปรุงสําคัญสำหรับเขียนอ่านงาน ผลลัพธ์ที่แสดงในรูปที่ 7 รายงานรวมที่เขียนและอ่านความเร็วในการเริ่มต้นและปรับการตั้งค่า mvapich2 .5.2 ขั้นตอนวิธีสำหรับการรวมเพื่อแสดงให้เห็นถึงประโยชน์จากการปรับแต่งสมรรถนะที่มีสิทธิได้รับงานส่วนรวมที่เราใช้ ASP [ 9 ] โปรแกรมซึ่งเป็นการใช้งานแบบขนานขั้นตอนวิธีของฟลอยด์ warshallใช้แก้ทุกคู่เส้นทางที่สั้นที่สุดปัญหาASP ส่วนใหญ่ใช้ bcast MPI และการเปลี่ยนแปลงของรากของออกอากาศซ้ำสำหรับแต่ละงาน . บน Intel มาเวอริคmvapich2 MPI มีประสิทธิภาพดีกว่าสำหรับ คอน - เริ่มต้นคำอุปมาของ mvapich2 ปรับตาม OMB ซึ่งมักใช้รากเดียวกันสำหรับการดำเนินงานร่วมกัน ต่อไปนี้คำแนะนำของเครื่องมือซึ่งจะแสดงในรายการ 2เราสามารถที่จะปรับปรุงประสิทธิภาพของ ASP.NET โดย 8.3 %ตารางที่ 5 แสดงผลลัพธ์ที่ได้จาก mvapich2 กับติดตามและการตั้งค่าเริ่มต้นและ Intel PCI บน มาเวอริคกลุ่มใช้งาน 80 ล้าน . สำหรับที่ปรึกษาไม่ได้ให้ข้อเสนอแนะใด ๆสำหรับ Intel PCI เพราะเป็นค่าปริยายจะได้ติดตามต่อไปmvapich2 mvapich2 Intel PIIปรับค่าปริยาย24.45 22.41 22 , 380 , 000 , 000ตารางที่ 5 : เวลา ( วินาที ) ในการ ASP 80 คอร์5.3 แผนที่งาน MPI ให้แกนแสดงให้เห็นถึงประโยชน์ของการใช้ที่ปรึกษาเพื่อปรับเทียบMPI งานแกนแผนที่เราใช้ hpcg [ 3 ] hpcgเป็นโปรแกรมที่ใช้เป็นทางเลือกในการจัดอันดับของที่บริษัท list9และสามารถใช้เฉพาะกับประเทศหรือ9http://www.top500.org/รายละเอียด : โปรแกรมรวม- หมายเลขของเว็บไซต์ที่ใช้ mpi_bcast โทร : 1- mpi_bcast เฉลี่ยขนาดของข้อความ :* callsite ID : 2 ขนาด : 2097152- mpi_bcast ขั้นตอนวิธีที่ใช้ : 5- รากเปลี่ยนสำหรับรายละเอียดเพิ่มเติมในข้อความที่ส่งมาปรึกษารายงาน mpip : . / ASP 8.22585.1 . mpipข้อเสนอแนะร่วมกัน :- สามารถเพิ่มประสิทธิภาพ : วิธีเป็นใช้สำหรับ bcast ไม่อาจให้ประสิทธิภาพที่ดีที่สุดสำหรับข้อความที่ถูกส่งมา* พิจารณาการเปลี่ยนแปลงอัลกอริทึม 2mvapich2 คำสั่งที่สามารถใช้ในการเปลี่ยนmpi_bcast อัลกอริทึม :- mv2_inter_bcast_tuning = < 1 - 9 >รายการที่ 2 : MPI ที่ปรึกษาแนะนำ สำหรับการเลือกขั้นตอนวิธีรวมการดำเนินงานที่เหมาะสมสำหรับ

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.