In this paper we argue that, under

In this paper we argue that, under a broad range of
circumstances, all data reduction techniques will result
in some decrease in tree size with little impact
on accuracy. Section 2 oers detailed empirical evidence
for the validity of this claim, but an intuitive
feeling for why it might be true can be grasped by
looking at Figure 1. The gure shows plots of tree
size and accuracy as a function of training set size for
the UC Irvine australian dataset. c4.5 was used to
generate the trees (Quinlan 1993) and each plot corresponds
to a dierent pruning mechanism: error-based
(ebp { the c4.5 default) (Quinlan 1993), reduced error
(rep) (Quinlan 1987), minimum description length
(mdl) (Quinlan & Rivest 1989), cost-complexity with
the 1se rule (ccp1se) (Breiman et al. 1984), and costcomplexity
without the 1se rule (ccp0se). On the
left-hand side of the graphs, no training instances are
available and the best one can do with test instances is
to assign them a class label at random. On the righthand
side of the graph, the entire dataset (excluding
test instances) is available to the tree building process.
Movement from the the left to the right corresponds
to the addition of randomly selected instances to the
training set. Alternatively, moving from the right to
the left corresponds to removing randomly selected instances
from the training set.

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

In this paper we argue that, under a broad range ofcircumstances, all data reduction techniques will resultin some decrease in tree size with little impacton accuracy. Section 2 o ers detailed empirical evidencefor the validity of this claim, but an intuitivefeeling for why it might be true can be grasped bylooking at Figure 1. The gure shows plots of treesize and accuracy as a function of training set size forthe UC Irvine australian dataset. c4.5 was used togenerate the trees (Quinlan 1993) and each plot correspondsto a di erent pruning mechanism: error-based(ebp { the c4.5 default) (Quinlan 1993), reduced error(rep) (Quinlan 1987), minimum description length(mdl) (Quinlan & Rivest 1989), cost-complexity withthe 1se rule (ccp1se) (Breiman et al. 1984), and costcomplexitywithout the 1se rule (ccp0se). On theleft-hand side of the graphs, no training instances areavailable and the best one can do with test instances isto assign them a class label at random. On the righthandside of the graph, the entire dataset (excludingtest instances) is available to the tree building process.Movement from the the left to the right correspondsto the addition of randomly selected instances to thetraining set. Alternatively, moving from the right tothe left corresponds to removing randomly selected instancesfrom the training set.

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

ในบทความนี้เรายืนยันว่าภายใต้ความหลากหลายของสถานการณ์เทคนิคการลดข้อมูลทั้งหมดจะส่งผลให้ลดลงในขนาดต้นไม้ที่มีผลกระทบเพียงเล็กน้อยบางอย่างบนความถูกต้อง ส่วนที่ 2 o? ERS หลักฐานเชิงประจักษ์ที่มีรายละเอียดความถูกต้องของการเรียกร้องนี้แต่ใช้งานง่ายมีความรู้สึกว่าทำไมมันอาจจะเป็นจริงสามารถลงโทษโดยมองไปที่รูปที่ 1 แสดงให้เห็น Gure แปลงต้นไม้ขนาดและความถูกต้องเป็นหน้าที่ของขนาดชุดการฝึกอบรมสำหรับUC Irvine ชุดข้อมูลที่ออสเตรเลีย C4.5 ถูกใช้ในการสร้างต้นไม้(ควินแลน 1993) และแต่ละแปลงตรงไปยังดิต่างกันกลไกการตัดแต่งกิ่ง: ข้อผิดพลาดที่ใช้ (ebp {เริ่มต้น C4.5) (ควินแลน 1993) ข้อผิดพลาดลดลง(ตัวแทน) (ควินแลน 1987 ) ความยาวคำอธิบายขั้นต่ำ(MDL) (ควินแลนและ Rivest 1989) ค่าใช้จ่ายที่มีความซับซ้อนกฎ1SE (ccp1se) (Breiman et al. 1984) และ costcomplexity โดยไม่ต้องกฎ 1SE (ccp0se) บนด้านซ้ายมือของกราฟกรณีการฝึกอบรมไม่มีที่มีอยู่และที่ดีที่สุดสามารถทำอะไรกับกรณีทดสอบที่จะกำหนดให้ฉลากระดับที่สุ่ม บนขวาด้านข้างของกราฟชุดข้อมูลทั้งหมด(ไม่รวมกรณีการทดสอบ) สามารถใช้ได้กับกระบวนการสร้างต้นไม้. เคลื่อนไหวจากซ้ายไปขวาตรงไปนอกจากนี้กรณีการสุ่มเลือกไปยังชุดการฝึกอบรม อีกทางเลือกหนึ่งที่จะย้ายจากขวาไปซ้ายสอดคล้องกับการเอากรณีการสุ่มเลือกจากชุดการฝึกอบรม

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

ในกระดาษนี้เรายืนยันว่า ในช่วงกว้างของ
สถานการณ์ , เทคนิคการลดข้อมูลทั้งหมดจะมีผลในบางลดลงในขนาด

ต้นไม้ที่มีผลกระทบน้อยในความถูกต้อง มาตรา ๒ o ERS รายละเอียดหลักฐานเชิงประจักษ์
สำหรับความถูกต้องของการเรียกร้องนี้ แต่ใช้งานง่าย
รู้สึกว่าทำไมมันอาจจะจริงที่สามารถลงโทษโดย
มองรูปที่ 1 การ gure แสดงแปลงต้นไม้
ขนาดและความถูกต้องเป็นฟังก์ชันของขนาดชุดฝึก
UC Irvine ออสเตรเลียวันที่ . ใช้โปรแกรม C4.5

สร้างต้นไม้ ( ควินลัน 1993 ) แต่ละแปลงเป็น ดิ สอดคล้อง
erent ตัดกลไก : ข้อผิดพลาดตาม
( ebp { เริ่มต้นโปรแกรม C4.5 ) ( ควินลัน 1993 ) , ลดความผิดพลาด
( ตัวแทน ) ( ควินแลน 1987 ) ความยาวรายละเอียดน้อยที่สุด
( MDL ) ( ควินแลน&ที่เกิด พ.ศ. 2532 ) ค่าใช้จ่ายกับความซับซ้อน
การ 1se กฎ ( ccp1se ) ( breiman et al . 1984 ) และ costcomplexity
โดยไม่ 1se กฎ ( ccp0se ) บน
ทางด้านซ้ายมือของกราฟ ไม่มีกรณีการฝึกอบรม
ใช้ได้และสิ่งที่ดีที่สุดหนึ่งสามารถทำ ด้วยกรณีทดสอบเพื่อให้พวกเขาเรียน
ฉลากแบบสุ่ม ใน righthand
ด้านกราฟข้อมูลทั้งหมด ( ยกเว้น
กรณีทดสอบ ) สามารถใช้ได้กับต้นไม้
สร้างกระบวนการการเคลื่อนไหวจากซ้ายไปขวาเพื่อเพิ่มกรณีสอดคล้อง

ศึกษาชุดฝึกอบรม . อีกวิธีหนึ่งคือ การย้ายจากขวาไปซ้าย สอดคล้องกับการเอา

สุ่มเลือกอินสแตนซ์จากการตั้งค่า

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.