Regression analysis (such as linear

Regression analysis (such as linear regression models, negative binomial regression
models and Poisson regression models) has been the most popular technique in crash
analysis because the connection between accidents and factors affecting them can be
evidently identified. Using such information, the accident-prone locations can be located
by the traffic engineers, and facilities such as illumination and enforcement, can then
be effectively applied. However, they have limited capacity to discover new and unanticipated
patterns and relationships that are hidden in conventional databases, [12] demonstrates
that certain problem may occur while using traditional statistical analysis to
analyze datasets with large dimensions such as an exponential increase in the number
of parameters with an increase in number of variables and there could be some invalidity
of statistical tests as a due to sparse data. Also, Regression models usually have
their own model specific assumptions and predefined underlying relationships between
dependent and independent variables. Violation of these assumptions may lead the
model to provide erroneous results [13]. Hence, we need a different technique that can
be used to analyze road accidents properly and can extract better results. Data mining
[14] can be described as the set of techniques used for the extraction of implicit, previously
unknown and hidden information from the huge amount of data. Data mining is
an upcoming area that is being used by the researchers worldwide for the analysis of
various types of transportation data. Several data mining techniques such as clustering,
classification, association rule mining have been used to analyzed road safety data.
Chang and Chen [13] analyzed national freeway-1 data from Taiwan using CART and
negative binomial regression model. Abellan et al. [15] analyzed two lane rural highway
data of Granada, Spain using decision rules extracted from decision tree method. Depaire
et al. [2] applied latent class clustering on two road user traffic accident data from 1997
to 1999 of Belgium which divides the accident data into seven clusters. Rovsek et al. [16]
analyzed crash data from 2005 to 2009 of Slovenia with classification and regression tree
(CART) algorithm. Kashani et al. [17] uses CART to analyze crash records obtained from
information and technology department of the Iran traffic police from 2006 to 2008.
This paper proposes a framework that is based on the cluster analysis using K modes
algorithm and association rule mining using Apriori algorithm. Using cluster analysis as
a preliminary task can group the data into different homogeneous segments. Association
rule mining is further applied on these clusters as well as on entire data set (EDS) to
generate association rules. In the best of our knowledge, it is the first time that both the
approaches have been used together for analysis of road accident data. The result of the
analysis proves that using cluster analysis as a preliminary task can help in removing heterogeneity
to some extent in the road accident data. The paper is organized as follows: In
Sect. “Proposed framework”, a framework is proposed to analyze the road accident data.
Next, a description of the data set used is given. In Sect. “Results and discussion”, the

Regression analysis (such as linear regression models, negative binomial regression
models and Poisson regression models) has been the most popular technique in crash
analysis because the connection between accidents and factors affecting them can be
evidently identified. Using such information, the accident-prone locations can be located
by the traffic engineers, and facilities such as illumination and enforcement, can then
be effectively applied. However, they have limited capacity to discover new and unanticipated
patterns and relationships that are hidden in conventional databases, [12] demonstrates
that certain problem may occur while using traditional statistical analysis to
analyze datasets with large dimensions such as an exponential increase in the number
of parameters with an increase in number of variables and there could be some invalidity
of statistical tests as a due to sparse data. Also, Regression models usually have
their own model specific assumptions and predefined underlying relationships between
dependent and independent variables. Violation of these assumptions may lead the
model to provide erroneous results [13]. Hence, we need a different technique that can
be used to analyze road accidents properly and can extract better results. Data mining
[14] can be described as the set of techniques used for the extraction of implicit, previously
unknown and hidden information from the huge amount of data. Data mining is
an upcoming area that is being used by the researchers worldwide for the analysis of
various types of transportation data. Several data mining techniques such as clustering,
classification, association rule mining have been used to analyzed road safety data.
Chang and Chen [13] analyzed national freeway-1 data from Taiwan using CART and
negative binomial regression model. Abellan et al. [15] analyzed two lane rural highway
data of Granada, Spain using decision rules extracted from decision tree method. Depaire
et al. [2] applied latent class clustering on two road user traffic accident data from 1997
to 1999 of Belgium which divides the accident data into seven clusters. Rovsek et al. [16]
analyzed crash data from 2005 to 2009 of Slovenia with classification and regression tree
(CART) algorithm. Kashani et al. [17] uses CART to analyze crash records obtained from
information and technology department of the Iran traffic police from 2006 to 2008.
This paper proposes a framework that is based on the cluster analysis using K modes
algorithm and association rule mining using Apriori algorithm. Using cluster analysis as
a preliminary task can group the data into different homogeneous segments. Association
rule mining is further applied on these clusters as well as on entire data set (EDS) to
generate association rules. In the best of our knowledge, it is the first time that both the
approaches have been used together for analysis of road accident data. The result of the
analysis proves that using cluster analysis as a preliminary task can help in removing heterogeneity
to some extent in the road accident data. The paper is organized as follows: In
Sect. “Proposed framework”, a framework is proposed to analyze the road accident data.
Next, a description of the data set used is given. In Sect. “Results and discussion”, the

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

วิเคราะห์การถดถอย (เช่นแบบจำลองถดถอยเชิงเส้น การถดถอยทวินามลบรุ่นรุ่นและ Poisson ถดถอย) มีเทคนิคนิยมมากที่สุดในความล้มเหลววิเคราะห์เนื่องจากการเชื่อมต่อระหว่างอุบัติเหตุและปัจจัยที่ส่งผลกระทบต่อพวกเขาสามารถระบุอย่างเห็นได้ชัด ใช้ข้อมูลดังกล่าว สถาน accident-prone สามารถตั้งอยู่โดยวิศวกรจราจร และไฟส่องสว่างและการบังคับใช้ สามารถแล้วสามารถใช้ได้อย่างมีประสิทธิภาพ อย่างไรก็ตาม พวกเขามีจำกัดกำลังการผลิตใหม่ และไม่คาดคิดรูปแบบและความสัมพันธ์ที่ซ่อนอยู่ในฐานข้อมูลทั่วไป, [12] อธิบายปัญหาบางอย่างที่อาจเกิดขึ้นในขณะที่แบบการวิเคราะห์ทางสถิติเพื่อวิเคราะห์ชุดข้อมูล มีขนาดใหญ่เช่นการเพิ่มจำนวนเนนพารามิเตอร์การเพิ่มขึ้นของตัวแปร และอาจมีความไม่ถูกต้องบางการทดสอบทางสถิติเป็นข้อมูลจากการห่าง แบบจำลองถดถอยมักจะมีสมมติฐานเฉพาะรูปแบบและความสัมพันธ์พื้นฐานที่กำหนดไว้ล่วงหน้าระหว่างตนเองตัวแปรอิสระ และอิสระ การละเมิดสมมติฐานเหล่านี้อาจนำไปสู่การรูปแบบเพื่อให้ผลลัพธ์ผิดพลาด [13] ดังนั้น เราต้องการเทคนิคต่าง ๆ ที่สามารถใช้วิเคราะห์อุบัติเหตุทางถนนอย่างถูกต้อง และสามารถแยกดีกว่า การทำเหมืองข้อมูล[14] ได้อธิบายไว้เป็นชุดของเทคนิคที่ใช้การสกัดนัย ก่อนหน้านี้ไม่ทราบข้อมูลจากข้อมูลจำนวนมหาศาล มีการทำเหมืองข้อมูลพื้นที่ที่ถูกใช้ โดยนักวิจัยทั่วโลกในการวิเคราะห์ เกิดขึ้นข้อมูลการเดินทางชนิดต่าง ๆ เทคนิคการทำเหมืองข้อมูลหลายเช่นคลัสเตอร์การจัดประเภท สมาคมกฎการทำเหมืองแร่มีการใช้ถนนวิเคราะห์ความปลอดภัยของข้อมูลช้างและเฉิน [13] วิเคราะห์ข้อมูลฟรีเวย์ 1 แห่งชาติจากไต้หวันโดยใช้รถเข็น และแบบจำลองถดถอยทวินามลบ ทางหลวงชนบทเลนสองวิเคราะห์ Abellan et al. [15]ข้อมูลของกรานาดา สเปนโดยใช้กฎการตัดสินใจจากวิธีที่ต้นไม้ตัดสินใจ Depaireet al. [2] ใช้คลัสเตอร์บนถนนสองผู้ใช้ข้อมูลอุบัติเหตุจราจรจาก 1997 ชั้นแฝงการ 1999 ของเบลเยียมซึ่งแบ่งข้อมูลอุบัติเหตุกลุ่มเจ็ด Rovsek et al. [16]วิเคราะห์ความผิดพลาดข้อมูลจาก 2005 2552 ของสโลวีเนียกับต้นไม้การจำแนกและการถดถอยขั้นตอนวิธีการ (คัน) Kashani et al. [17] รถเข็นที่ใช้ในการวิเคราะห์ความล้มเหลวในการบันทึกได้จากฝ่ายข้อมูลและเทคโนโลยีของตำรวจจราจรที่อิหร่านจาก 2006 2008กระดาษนี้เสนอกรอบตามการวิเคราะห์คลัสเตอร์ที่ใช้โหมด Kอัลกอริทึมและสมาคมเหมืองกฎที่ใช้อัลกอริทึม Apriori โดยใช้การวิเคราะห์คลัสเตอร์เป็นงานเบื้องต้นสามารถจัดกลุ่มข้อมูลเป็นเซ็กเมนต์ที่เหมือนกันแตกต่างกัน ความสัมพันธ์ของกฎการทำเหมืองแร่เพิ่มเติมไว้ ในกลุ่มนี้เช่น เดียว กับชุดข้อมูลทั้งหมด (EDS)สร้างกฎความสัมพันธ์ ในที่สุดความรู้ของเรา มันเป็นเวลาที่ทั้งวิธีมีการใช้ร่วมกันสำหรับการวิเคราะห์ข้อมูลอุบัติเหตุถนน ผลของการวิเคราะห์พิสูจน์ว่าการใช้คลัสเตอร์วิเคราะห์งานเบื้องต้นสามารถช่วยในการลบ heterogeneityในบางกรณีข้อมูลอุบัติเหตุถนน กระดาษจัดเป็นดังนี้: ในอ "เสนอกรอบ" กรอบจะเสนอการวิเคราะห์ข้อมูลอุบัติเหตุถนนถัดไป กำหนดรายละเอียดของชุดข้อมูลที่ใช้ ในอ "ผลลัพธ์และสนทนา" ใน

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

การวิเคราะห์การถดถอย (เช่นแบบจำลองการถดถอยเชิงเส้นถดถอยทวินามเชิงลบ
และหุ่นจำลองถดถอยปัวซอง) ได้รับเทคนิคที่นิยมมากที่สุดในการแข่งขัน
วิเคราะห์เนื่องจากการเชื่อมต่อระหว่างการเกิดอุบัติเหตุและปัจจัยที่ส่งผลให้พวกเขาสามารถที่จะ
ระบุอย่างเห็นได้ชัด การใช้ข้อมูลดังกล่าวสถานที่เกิดอุบัติเหตุได้ง่ายสามารถอยู่
โดยวิศวกรจราจรและสิ่งอำนวยความสะดวกเช่นไฟส่องสว่างและการบังคับใช้แล้วสามารถ
นำมาประยุกต์ใช้อย่างมีประสิทธิภาพ แต่พวกเขามีความสามารถที่จะค้นพบใหม่และไม่คาดคิด จำกัด
รูปแบบและความสัมพันธ์ที่ถูกซ่อนอยู่ในฐานข้อมูลเดิม [12] แสดงให้เห็น
ว่าปัญหาบางอย่างที่อาจเกิดขึ้นในขณะที่ใช้การวิเคราะห์ทางสถิติแบบดั้งเดิมในการ
วิเคราะห์ชุดข้อมูลที่มีขนาดใหญ่เช่นเพิ่มขึ้นชี้แจงในจำนวน
ของ พารามิเตอร์กับการเพิ่มขึ้นในจำนวนของตัวแปรและอาจจะมีความอ่อนแอบางส่วน
ของการทดสอบทางสถิติเป็นเนื่องจากข้อมูลที่เบาบาง นอกจากนี้ยังมีรูปแบบการถดถอยมักจะมี
การตั้งสมมติฐานของตัวเองรูปแบบเฉพาะและความสัมพันธ์พื้นฐานที่กำหนดไว้ล่วงหน้าระหว่าง
ตัวแปรตามและเป็นอิสระ การละเมิดของสมมติฐานเหล่านี้อาจนำไปสู่
รูปแบบที่จะให้ผลลัพธ์ที่ผิดพลาด [13] ดังนั้นเราจำเป็นต้องมีเทคนิคที่แตกต่างกันที่สามารถ
นำมาใช้ในการวิเคราะห์อุบัติเหตุบนท้องถนนอย่างถูกต้องและสามารถดึงผลลัพธ์ที่ดีกว่า การทำเหมืองข้อมูล
[14] สามารถอธิบายเป็นชุดของเทคนิคที่ใช้ในการสกัดโดยนัยก่อนหน้านี้
ที่ไม่รู้จักและซ่อนข้อมูลจากข้อมูลจำนวนมาก การทำเหมืองข้อมูลเป็น
พื้นที่ที่จะเกิดขึ้นว่าจะถูกใช้โดยนักวิจัยทั่วโลกสำหรับการวิเคราะห์
ข้อมูลชนิดต่างๆการขนส่ง หลายเทคนิคการทำเหมืองข้อมูลเช่นการจัดกลุ่ม
จำแนกการทำเหมืองแร่สมาคมกฎมีการใช้ข้อมูลการวิเคราะห์ความปลอดภัยทางถนน.
ช้างและเฉิน [13] วิเคราะห์ทางหลวงแห่งชาติ-1 ข้อมูลจากไต้หวันโดยใช้รถเข็นและ
เชิงลบแบบการถดถอยทวินาม Abellan et al, [15] วิเคราะห์สองเลนทางหลวงชนบท
ข้อมูลที่กรานาดา, สเปนโดยใช้กฎการตัดสินใจที่สกัดจากวิธีต้นไม้ตัดสินใจ Depaire
et al, [2] นำไปใช้จัดกลุ่มกลุ่มแฝงบนถนนสองข้อมูลอุบัติเหตุจราจรของผู้ใช้ปี 1997 จาก
ที่จะปี 1999 เบลเยียมซึ่งแบ่งข้อมูลอุบัติเหตุที่เกิดขึ้นเป็นเจ็ดกลุ่ม Rovsek et al, [16]
วิเคราะห์ข้อมูลความผิดพลาด 2005-2009 ของสโลวีเนียที่มีการจัดหมวดหมู่และต้นไม้ถดถอย
อัลกอริทึม (ซื้อ) Kashani et al, [17] ใช้รถเข็นในการวิเคราะห์ความผิดพลาดของการบันทึกที่ได้รับจาก
เทคโนโลยีสารสนเทศและการกรมตำรวจจราจรอิหร่านตั้งแต่ปี 2006 ถึงปี 2008
กระดาษนี้นำเสนอกรอบที่อยู่บนพื้นฐานของการวิเคราะห์กลุ่มโดยใช้ K โหมด
ขั้นตอนวิธีการทำเหมืองแร่และการปกครองของสมาคมโดยใช้อัลกอริทึม Apriori โดยใช้การวิเคราะห์กลุ่มเป็น
งานเบื้องต้นสามารถจัดกลุ่มข้อมูลในส่วนที่เป็นเนื้อเดียวกันที่แตกต่างกัน สมาคม
เหมืองแร่กฎถูกนำไปใช้เพิ่มเติมเกี่ยวกับกลุ่มเหล่านี้เช่นเดียวกับชุดข้อมูลทั้งหมด (EDS) เพื่อ
สร้างกฎสมาคม ที่ดีที่สุดของความรู้ของเราก็เป็นครั้งแรกที่ทั้งสอง
วิธีได้ถูกนำมาใช้ร่วมกันเพื่อการวิเคราะห์ข้อมูลอุบัติเหตุทางถนน ผลของ
การวิเคราะห์พิสูจน์ให้เห็นว่าการใช้การวิเคราะห์กลุ่มเป็นงานเบื้องต้นสามารถช่วยในการลบความแตกต่าง
ไปบ้างในข้อมูลอุบัติเหตุทางถนน กระดาษที่ถูกจัดขึ้นดังนี้
นิกาย กรอบ "เสนอ" กรอบที่มีการเสนอในการวิเคราะห์ข้อมูลการเกิดอุบัติเหตุที่ถนน.
ถัดไป, คำอธิบายของชุดข้อมูลที่ใช้จะได้รับ ในนิกาย "ผลการทดลองและการอภิปราย" ที่

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.