The first benchmark task (the “Grep

The first benchmark task (the “Grep task”) requires each system
to scan through a data set of 100-byte records looking for a
three character pattern. This is the only task that requires processing
largely unstructured data, and was originally included in the
benchmark by the authors of [23] since the same task was included
in the original MapReduce paper [8].
To explore more complex uses of the benchmarked systems, the
benchmark includes four more analytical tasks related to log-file
analysis and HTML document processing. Three of these tasks operate
on structured data; the final task operates on both structured
and unstructured data.
The datasets used by these four tasks include a UserVisits table
meant to model log files of HTTP server traffic, a Documents table
containing 600,000 randomly generated HTML documents, and a
Rankings table that contains some metadata calculated over the data
in the Documents table. The schema of the tables in the benchmark
data set is described in detail in [23]. In summary, the UserVisits
table contains 9 attributes, the largest of which is destinationURL
which is of type VARCHAR(100). Each tuple is on the order of 150
bytes wide. The Documents table contains two attributes:

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

The first benchmark task (the “Grep task”) requires each systemto scan through a data set of 100-byte records looking for athree character pattern. This is the only task that requires processinglargely unstructured data, and was originally included in thebenchmark by the authors of [23] since the same task was includedin the original MapReduce paper [8].To explore more complex uses of the benchmarked systems, thebenchmark includes four more analytical tasks related to log-fileanalysis and HTML document processing. Three of these tasks operateon structured data; the final task operates on both structuredand unstructured data.The datasets used by these four tasks include a UserVisits tablemeant to model log files of HTTP server traffic, a Documents tablecontaining 600,000 randomly generated HTML documents, and aRankings table that contains some metadata calculated over the datain the Documents table. The schema of the tables in the benchmarkdata set is described in detail in [23]. In summary, the UserVisitstable contains 9 attributes, the largest of which is destinationURLwhich is of type VARCHAR(100). Each tuple is on the order of 150bytes wide. The Documents table contains two attributes:

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

งานมาตรฐานครั้งแรก (ที่ "งาน Grep")
ต้องใช้แต่ละระบบการสแกนผ่านชุดของการบันทึกข้อมูล100
ไบต์มองหารูปแบบตัวอักษรที่สาม นี้เป็นงานเดียวที่ต้องมีการประมวลผลส่วนใหญ่ข้อมูลที่ไม่มีโครงสร้างและถูกรวมอยู่เดิมในเกณฑ์มาตรฐานโดยผู้เขียนของ[23] ตั้งแต่งานเดียวกันถูกรวมอยู่ในกระดาษMapReduce เดิม [8]. เพื่อสำรวจการใช้งานที่ซับซ้อนมากขึ้นของระบบการวัดประสิทธิผล ที่มาตรฐานมากขึ้นรวมถึงสี่งานวิเคราะห์ที่เกี่ยวข้องกับการเข้าสู่ระบบไฟล์การวิเคราะห์และการประมวลผลเอกสารHTML สามของงานเหล่านี้ทำงานเกี่ยวกับข้อมูลที่มีโครงสร้าง; งานสุดท้ายที่ทำงานได้ทั้งที่มีโครงสร้างข้อมูลและไม่มีโครงสร้าง. ชุดข้อมูลที่ใช้งานเหล่านี้สี่รวมถึงตาราง UserVisits หมายถึงการจำลองล็อกไฟล์ของการจราจรเซิร์ฟเวอร์ HTTP, ตารางเอกสารที่มี600,000 ที่สร้างแบบสุ่มเอกสาร HTML และตารางการจัดอันดับที่มีเมตาดาต้าบางคำนวณมากกว่าข้อมูลในตารางเอกสาร เค้าร่างของตารางในมาตรฐานชุดข้อมูลที่มีการอธิบายในรายละเอียดใน [23] โดยสรุป UserVisits ตารางมี 9 คุณลักษณะที่ใหญ่ที่สุดของซึ่งเป็น destinationURL ซึ่งเป็นประเภท VARCHAR (100) tuple แต่ละคำสั่งของ 150 ไบต์กว้าง ตารางเอกสารมีสองลักษณะ:

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

งานมาตรฐานแรก ( " สามารถใช้งาน " ) แต่ละระบบ
สแกนผ่านชุดข้อมูล 100 ไบต์ประวัติมองหารูปแบบ
3 ตัวละคร นี้เป็นเพียงงานที่ต้องมีการประมวลผลข้อมูลที่ไม่มีโครงสร้าง
ส่วนใหญ่ และถูกรวมอยู่ในเกณฑ์มาตรฐาน โดยผู้เขียนของ
[ 23 ] เนื่องจากงานเดียวกันถูกรวมอยู่ในต้นฉบับ mapreduce กระดาษ

[ 8 ]การสํารวจที่ซับซ้อนมากขึ้นของการใช้ข้ามระบบ มาตรฐานรวมถึงสี่วิเคราะห์
งานเพิ่มเติมที่เกี่ยวข้องเข้าสู่ระบบการวิเคราะห์ไฟล์
และการประมวลผลเอกสาร HTML สามของงานเหล่านี้ทำงาน
เชิงข้อมูล งานสุดท้ายการทั้งสองและข้อมูลที่ไม่มีโครงสร้าง
.
ข้อมูลที่ใช้โดยทั้ง 4 งานรวม โต๊ะ uservisits เป็น
นางแบบล็อกไฟล์ของการจราจรของเซิร์ฟเวอร์ HTTP ,เป็นโต๊ะที่มีเอกสาร
600000 สุ่มสร้างเอกสาร HTML และอันดับตารางที่ประกอบด้วยข้อมูล

คำนวณผ่านข้อมูลในเอกสารที่โต๊ะ schema ของตารางในชุดข้อมูลมาตรฐาน
จะอธิบายในรายละเอียดใน [ 23 ] สรุปได้ว่า uservisits
ตารางมี 9 คุณลักษณะที่ใหญ่ที่สุดซึ่งเป็น destinationurl
ซึ่งเป็นประเภท VAIO HK ( 100 )แต่ละ tuple คือลำดับที่ 150
ขนาดกว้าง เอกสารตารางที่มีสองคุณสมบัติ

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.