2. What is big data?Manyika et al.

2. What is big data?

Manyika et al. [10, page 1] define Big Data as “datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze”. Likewise, Davis and Patterson [1, page 4] say “Big data is data too big to be handled and analyzed by traditional database protocols such as SQL”; and the same opinion is shared by [11,3,4], etc. Both groups of authors previously mentioned go beyond the only size aspects of data when defining Big Data! Edd Dumbill in [12, page 3] explicitly conveys the multi-dimensionality of Big Data when adding that “the data is too big, moves too fast, or doesn’t fit the strictures of your database architectures”. This quotation allows us to see that extra characteristics should be added to large datasets to be considered as Big Data, or big size data as often found throughout the literature [2].
Now it is assumed that size is not the only feature of Big Data. Many authors [1,12,11,9,13,4] explicitly use the Three V’s (Volume, Variety and Velocity) to characterize Big Data. If the three V’s are largely found in the literature, many authors [10,13] and institutes like IEEE focus on Big Data Value, Veracity and Visualization. This last “V” to notice how important it is to
provide good tools to figure out data and analysis’ results.2
Volume (Data in rest). The benefit gained from the ability to process large amounts of information is the main attraction of big data analytics. Having more data beats having better models [12]. The consequence is that it is a trend for many companies to store vast amount of various sorts of data: social networks data, health care data, financial data, biochemistry and genetic data, astronomical data, etc.
Variety (Data in many forms). These data do not have a fixed structure and rarely present themselves in a per- fectly ordered form and ready for processing [12]. Indeed,such data can be highly structured (data from relational databases), semi-structured (web logs, social media feeds, raw feed directly from a sensor source, email, etc.) or unstructured (video, still images, audio, clicks) [12]. Another “V”, for Vari- ability, can be added to variety to emphasize on semantics, or the variability of meaning in language and communication protocols.
Velocity (Data in motion). Velocity involves streams of data, structured records creation, and availability for access and delivery.3 Indeed it is not just the velocity of the incoming
data that is the issue: it is possible to stream fast-moving data into bulk storage for later batch processing, for example. The importance lies in the speed of the feedback loop, taking data from input through to decision [12].
Value (Data in highlight). This feature is the purpose of Big Data technology. This view is well expressed by the International Data Corporation4 when saying that Big
Data architectures are: “designed to economically extract value from very large volumes of a wide variety of data, by enabling high-velocity capture, discovery, and/or analysis”. This value falls into two categories: analytical use (replacing/supporting human decision, discovering needs, segmenting populations to customize actions) and enabling new business models, products and services [12,10].
Veracity (Data in doubt). Veracity is what is conform with truth or fact, or in short, Accuracy, Certainty, Precision. Uncertainty can be caused by inconsistencies, model approximations, ambiguities, deception, fraud, duplication, incompleteness, spam and latency. Due to veracity, results derived from Big data cannot be proven; but they can be assigned a probability.
To conclude, dealing effectively with Big Data requires one to create value against the volume, variety and veracity of data while it is still in motion (velocity), not just after it is at rest [11]. And at the end, as recommended by [13], scientists must jointly tackle Big Data with all its features

2. What is big data?

Manyika et al. [10, page 1] define Big Data as “datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze”. Likewise, Davis and Patterson [1, page 4] say “Big data is data too big to be handled and analyzed by traditional database protocols such as SQL”; and the same opinion is shared by [11,3,4], etc. Both groups of authors previously mentioned go beyond the only size aspects of data when defining Big Data! Edd Dumbill in [12, page 3] explicitly conveys the multi-dimensionality of Big Data when adding that “the data is too big, moves too fast, or doesn’t fit the strictures of your database architectures”. This quotation allows us to see that extra characteristics should be added to large datasets to be considered as Big Data, or big size data as often found throughout the literature [2].
Now it is assumed that size is not the only feature of Big Data. Many authors [1,12,11,9,13,4] explicitly use the Three V’s (Volume, Variety and Velocity) to characterize Big Data. If the three V’s are largely found in the literature, many authors [10,13] and institutes like IEEE focus on Big Data Value, Veracity and Visualization. This last “V” to notice how important it is to
provide good tools to figure out data and analysis’ results.2
Volume (Data in rest). The benefit gained from the ability to process large amounts of information is the main attraction of big data analytics. Having more data beats having better models [12]. The consequence is that it is a trend for many companies to store vast amount of various sorts of data: social networks data, health care data, financial data, biochemistry and genetic data, astronomical data, etc.
Variety (Data in many forms). These data do not have a fixed structure and rarely present themselves in a per- fectly ordered form and ready for processing [12]. Indeed,such data can be highly structured (data from relational databases), semi-structured (web logs, social media feeds, raw feed directly from a sensor source, email, etc.) or unstructured (video, still images, audio, clicks) [12]. Another “V”, for Vari- ability, can be added to variety to emphasize on semantics, or the variability of meaning in language and communication protocols.
Velocity (Data in motion). Velocity involves streams of data, structured records creation, and availability for access and delivery.3 Indeed it is not just the velocity of the incoming
data that is the issue: it is possible to stream fast-moving data into bulk storage for later batch processing, for example. The importance lies in the speed of the feedback loop, taking data from input through to decision [12].
Value (Data in highlight). This feature is the purpose of Big Data technology. This view is well expressed by the International Data Corporation4 when saying that Big
Data architectures are: “designed to economically extract value from very large volumes of a wide variety of data, by enabling high-velocity capture, discovery, and/or analysis”. This value falls into two categories: analytical use (replacing/supporting human decision, discovering needs, segmenting populations to customize actions) and enabling new business models, products and services [12,10].
Veracity (Data in doubt). Veracity is what is conform with truth or fact, or in short, Accuracy, Certainty, Precision. Uncertainty can be caused by inconsistencies, model approximations, ambiguities, deception, fraud, duplication, incompleteness, spam and latency. Due to veracity, results derived from Big data cannot be proven; but they can be assigned a probability.
To conclude, dealing effectively with Big Data requires one to create value against the volume, variety and veracity of data while it is still in motion (velocity), not just after it is at rest [11]. And at the end, as recommended by [13], scientists must jointly tackle Big Data with all its features

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

2.เป็นข้อมูลManyika ร้อยเอ็ด [10 หน้า 1] กำหนดข้อมูลเป็น "ชุดข้อมูลที่มีขนาดไม่เกินความสามารถของเครื่องมือซอฟต์แวร์ฐานข้อมูลทั่วไปจับภาพ จัดเก็บ จัดการ และวิเคราะห์" ทำนองเดียวกัน Davis และต [1 หน้า 4] กล่าวว่า "ข้อมูลคือ ข้อมูลใหญ่เกินไปที่จะจัดการ และวิเคราะห์ โดยโพรโทคอฐานข้อมูลดั้งเดิมเช่น SQL" และสอดคล้องร่วมกัน [11,3,4], เป็นต้น กลุ่มของผู้เขียนที่กล่าวถึงก่อนหน้านี้ ไปไกลกว่าเพียงขนาดของข้อมูลเมื่อกำหนดข้อมูล Edd Dumbill ใน [12 หน้า 3] ถ่ายทอดมิติหลายของข้อมูลอย่างชัดเจนเมื่อมีการเพิ่ม "ข้อมูลใหญ่เกินไป เคลื่อนที่เร็วเกินไป ไม่พอดีกับ strictures ของสถาปัตยกรรมของฐานข้อมูล" ใบเสนอราคานี้ให้เราได้เห็นว่า ลักษณะพิเศษควรเพิ่มชุดข้อมูลขนาดใหญ่จะถือว่าเป็นข้อมูลขนาดใหญ่ หรือข้อมูลขนาดใหญ่ที่พบบ่อยทั้งวรรณกรรม [2]ตอนนี้ มันจะสันนิษฐานว่า ขนาดไม่ใช่คุณลักษณะเฉพาะของข้อมูลขนาดใหญ่ ผู้เขียนหลาย [1,12,11,9,13,4] ได้ใช้สาม V ของ (ปริมาณ ความหลากหลาย และความเร็ว) กับลักษณะข้อมูล ถ้าของ V สามส่วนใหญ่พบในวรรณคดี ผู้สร้าง [10,13] และสถาบันหลายเช่น IEEE เน้น ค่าข้อมูลใหญ่ จริง และแสดงภาพประกอบเพลง นี้ล่าสุด "V" สังเกตมีความสำคัญอย่างไรให้เครื่องมือเพื่อดูข้อมูลและการวิเคราะห์ของ results.2ปริมาณ (ข้อมูลในส่วนที่เหลือ) ประโยชน์ที่ได้รับจากความสามารถในการประมวลผลข้อมูลจำนวนมากเป็นแหล่งท่องเที่ยวหลักของการวิเคราะห์ข้อมูลขนาดใหญ่ มีข้อมูลเพิ่มเติมก็เต้นดีกว่ามีรุ่น [12] โทษคือ ว่า มันเป็นแนวโน้มสำหรับเก็บข้อมูลประเภทต่าง ๆ จำนวนมากหลายบริษัท: ข้อมูลเครือข่ายทางสังคม ข้อมูลสุขภาพ ข้อมูลทางการเงิน ชีวเคมี และข้อมูลทางพันธุกรรม ข้อมูลดารา ฯลฯหลากหลาย (ข้อมูลในหลายรูปแบบ) ข้อมูลเหล่านี้ไม่มีโครงสร้างถาวร และไม่ค่อยแสดงตัวในการต่อ-fectly สั่งแบบฟอร์มและพร้อมสำหรับการประมวลผล [12] แน่นอน ข้อมูลดังกล่าวสามารถมีโครงสร้างสูง (ข้อมูลจากฐานข้อมูลเชิงสัมพันธ์) , กึ่งโครงสร้าง (เว็บล็อก ฟีดสังคม อาหารดิบโดยตรงจากการเซ็นเซอร์แหล่ง อีเมล ฯลฯ) หรือไม่มีโครงสร้าง (วิดีโอ ทำภาพ เสียง คลิ) [12] อื่น "V" สำหรับความสามารถภาพ สามารถเพิ่มไปเพื่อเน้นความหมายของ หรือความแปรปรวนของความหมายในภาษาและการสื่อสารความเร็ว (ข้อมูลภาพเคลื่อนไหว) ความเร็วเกี่ยวข้องกับกระแสของข้อมูล ระเบียนโครงสร้าง และพร้อมสำหรับการเข้าถึงและ delivery.3 Indeed ไม่เพียงความเร็วของการเข้ามาข้อมูลที่มีปัญหา: เป็นไปได้ที่การสตรีมข้อมูลที่เคลื่อนที่เร็วลงพื้นที่เก็บข้อมูลจำนวนมากสำหรับการประมวลผลชุดงานในภายหลัง ตัวอย่างเช่น ความสำคัญอยู่ที่ความเร็วในการวนรอบคำติชม การข้อมูลจากการป้อนข้อมูลผ่านการตัดสินใจ [12]ค่า (ข้อมูลในไฮไลท์) คุณลักษณะนี้มีวัตถุประสงค์ของเทคโนโลยีข้อมูลขนาดใหญ่ มุมมองนี้เป็นอย่างดี โดย Corporation4 ข้อมูลนานาชาติเมื่อพูดที่ใหญ่มีสถาปัตยกรรมข้อมูล: "ออกแบบเพื่อแยกค่า โดยการใช้ความเร็วสูงจับภาพ ค้นหา หรือวิเคราะห์จากปริมาณมากของความหลากหลายของข้อมูล เศรษฐกิจ" ค่านี้จัดอยู่ในประเภทที่สอง: ใช้วิเคราะห์ (แทนสนับสนุนมนุษย์ตัดสินใจ การค้นพบความต้องการ แบ่งกลุ่มประชากรการกำหนดการดำเนินการ) และเปิดใช้งานรูปแบบธุรกิจใหม่ ผลิตภัณฑ์ และบริการ [12,10]จริง (ข้อมูลข้อสงสัย) จริงคือ สิ่งสอดคล้อง กับความจริงหรือข้อเท็จจริง หรือ ในระยะ สั้น ความแม่นยำ ความแน่นอน ความแม่นยำ ความไม่แน่นอนอาจมีสาเหตุจากไม่สอดคล้องกัน เพียงการประมาณแบบจำลอง สู่การงงงวย หลอกลวง ทุจริต ทำสำเนา เกอเดล สแปม และเวลาแฝง จริง เนื่องจากไม่สามารถพิสูจน์ผลที่ได้มาจากข้อมูล แต่สามารถกำหนดความน่าเป็นเพื่อสรุป จัดการกับข้อมูลขนาดใหญ่ได้อย่างมีประสิทธิภาพต้องมีการสร้างค่าปริมาตร ความหลากหลาย และถูกต้องของข้อมูลในขณะที่ยังอยู่ในเคลื่อนไหว (ความเร็ว), ไม่เพียงแต่หลังจากที่ [11] และปลาย แนะนำ โดย [13], นักวิทยาศาสตร์ต้องร่วมกันจัดการข้อมูลขนาดใหญ่ ด้วยคุณสมบัติทั้งหมด

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.