I. INTRODUCTIONMost Thai people are

I. INTRODUCTION
Most Thai people are familiar with Thai herbs, which have
been used for curing many simple symptoms, such as
indigestion, stomachache, fever and so on. Many organizations
[1-4] have published useful Thai herb information on their
websites. Although these information are similar and somewhat
consistent, none have completed information regarding names
of Thai herbs, symptoms, and parts of used. For example, one
webpage only gives scientific name but does not give common
name; while another webpage only indicate which Thai herb to
be used for curing which symptom, but does not provide which
part of the Thai herb to be used.
This paper proposes Thai herb information extraction
process to recognize useful Thai herb information and extract
them from multiple websites. The process employed an open
source HTML parser called JSOUP [5] and several template
files. The overall process has two main phase; symptom name
collection phase and treatment information extraction phase.
Output of the first phase is a file containing symptom names,
which collecting from multiple websites using synonyms of a
word 'treat'. The second phase extracted Thai herb treatment
information, including Thai herb names and medicinal used.
Names include science name, common name, and family name.
Medicinal used includes part of used and symptom name.
The challenge of Thai herb information extraction is that
each websites have different HTML structures. Some are
simple and some are complicated. When one Thai herb can
treat many symptoms, the symptom names are listed
continuously, with or without delimiter. However, eachsymptom can be treated using different Thai herb parts. The
process needs to recognize which content is the Thai herb part
and which content is the symptom.
Not only that Thai herb information from multiple sources
has no fixed pattern content, but there is also inherently nested
content and listed content structure within medicinal-used topic
of each page. That is, medicinal-used topic can contain many
part-of-used topics, and each part-of-used topic can contain
many symptoms that it can cure.
This paper divides into 3 parts; the next section
presents related work on web information extraction. Section 3
describes the proposed process for extracting Thai herb
information from multiple websites. The last section describes
the experiment process and evaluates precision and recall of the
proposed process.

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

I. INTRODUCTIONMost Thai people are familiar with Thai herbs, which havebeen used for curing many simple symptoms, such asindigestion, stomachache, fever and so on. Many organizations[1-4] have published useful Thai herb information on theirwebsites. Although these information are similar and somewhatconsistent, none have completed information regarding namesof Thai herbs, symptoms, and parts of used. For example, onewebpage only gives scientific name but does not give commonname; while another webpage only indicate which Thai herb tobe used for curing which symptom, but does not provide whichpart of the Thai herb to be used.This paper proposes Thai herb information extractionprocess to recognize useful Thai herb information and extractthem from multiple websites. The process employed an opensource HTML parser called JSOUP [5] and several templatefiles. The overall process has two main phase; symptom namecollection phase and treatment information extraction phase.Output of the first phase is a file containing symptom names,which collecting from multiple websites using synonyms of aword 'treat'. The second phase extracted Thai herb treatmentinformation, including Thai herb names and medicinal used.Names include science name, common name, and family name.Medicinal used includes part of used and symptom name.The challenge of Thai herb information extraction is thateach websites have different HTML structures. Some aresimple and some are complicated. When one Thai herb cantreat many symptoms, the symptom names are listedcontinuously, with or without delimiter. However, eachsymptom can be treated using different Thai herb parts. Theprocess needs to recognize which content is the Thai herb partand which content is the symptom.Not only that Thai herb information from multiple sourceshas no fixed pattern content, but there is also inherently nestedcontent and listed content structure within medicinal-used topicof each page. That is, medicinal-used topic can contain manypart-of-used topics, and each part-of-used topic can containmany symptoms that it can cure.This paper divides into 3 parts; the next sectionpresents related work on web information extraction. Section 3describes the proposed process for extracting Thai herbinformation from multiple websites. The last section describesthe experiment process and evaluates precision and recall of theproposed process.

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

I. บทนำคนไทยส่วนใหญ่คุ้นเคยกับสมุนไพรไทยที่ได้ถูกนำมาใช้เพื่อการรักษาอาการที่เรียบง่ายเป็นจำนวนมากเช่นอาหารไม่ย่อยปวดท้องมีไข้และอื่นๆ หลายองค์กร[1-4] มีการเผยแพร่ข้อมูลที่เป็นประโยชน์สมุนไพรไทยของพวกเขาในเว็บไซต์ แม้ว่าข้อมูลเหล่านี้จะคล้ายกันและค่อนข้างสอดคล้องกันไม่มีใครได้เสร็จสิ้นการข้อมูลเกี่ยวกับชื่อของสมุนไพรไทยอาการและชิ้นส่วนที่ใช้ ตัวอย่างเช่นหนึ่งในหน้าเว็บเพียง แต่ช่วยให้ชื่อทางวิทยาศาสตร์ แต่ไม่ให้ร่วมกันชื่อ; ในขณะที่หน้าเว็บอีกเพียงระบุว่าสมุนไพรไทยเพื่อนำมาใช้สำหรับการบ่มซึ่งอาการแต่ไม่ได้ให้การที่เป็นส่วนหนึ่งของสมุนไพรไทยที่จะใช้. กระดาษนี้นำเสนอการสกัดข้อมูลสมุนไพรไทยขั้นตอนในการรับรู้ข้อมูลสมุนไพรไทยที่มีประโยชน์และสารสกัดจากพวกเขาจากหลายเว็บไซต์. กระบวนการที่ใช้เปิดแหล่งที่มาที่เรียกว่า HTML parser JSOUP [5] และอีกหลายแม่แบบของไฟล์ กระบวนการโดยรวมมีสองขั้นตอนหลัก ชื่ออาการขั้นตอนการจัดเก็บและการรักษาขั้นตอนการสกัดข้อมูล. การส่งออกของเฟสแรกเป็นไฟล์ที่มีชื่ออาการที่เก็บรวบรวมจากเว็บไซต์ต่างๆโดยใช้คำพ้องความหมายของคำว่า'รักษา' ขั้นที่สองสกัดการรักษาสมุนไพรไทยข้อมูลรวมทั้งชื่อสมุนไพรไทยและสมุนไพรที่ใช้. ชื่อรวมถึงชื่อวิทยาศาสตร์ชื่อสามัญและชื่อครอบครัว. ยาที่ใช้รวมถึงส่วนหนึ่งของการใช้และชื่ออาการ. ความท้าทายของสมุนไพรสกัดข้อมูลไทยคือการที่เว็บไซต์แต่ละมีโครงสร้างที่แตกต่างกัน HTML บางคนมีความเรียบง่ายและบางส่วนมีความซับซ้อน เมื่อหนึ่งในสมุนไพรไทยสามารถรักษาอาการหลายชื่ออาการที่มีอยู่อย่างต่อเนื่องโดยมีหรือไม่มีตัวคั่น อย่างไรก็ตาม eachsymptom สามารถรักษาได้โดยใช้ชิ้นส่วนที่แตกต่างกันสมุนไพรไทย กระบวนการความต้องการที่จะรับรู้เนื้อหาที่เป็นส่วนสมุนไพรไทยและเนื้อหาเป็นอาการ. ไม่เพียง แต่ที่ข้อมูลสมุนไพรไทยจากหลายแหล่งไม่มีเนื้อหารูปแบบคงที่แต่ยังมีที่ซ้อนกันโดยเนื้อแท้เนื้อหาและโครงสร้างเนื้อหาอยู่ภายในหัวข้อสมุนไพรที่ใช้ของแต่ละหน้า นั่นคือหัวข้อที่ใช้ยาสามารถมีจำนวนมากส่วนหนึ่งของการใช้หัวข้อและแต่ละส่วนของหัวข้อที่ใช้สามารถมีอาการหลายอย่างที่มันสามารถรักษา. กระดาษนี้จะแบ่งออกเป็น 3 ส่วนคือ ส่วนถัดไปนำเสนอผลงานที่เกี่ยวข้องในการสกัดข้อมูลเว็บ ส่วนที่ 3 อธิบายขั้นตอนที่นำเสนอสำหรับการสกัดสมุนไพรไทยข้อมูลจากหลายเว็บไซต์ ส่วนสุดท้ายจะอธิบายขั้นตอนการทดสอบและประเมินความแม่นยำและการเรียกคืนของกระบวนการเสนอ

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

ผมแนะนำ
ส่วนใหญ่คนไทยจะคุ้นเคยกับสมุนไพร ซึ่งได้ถูกใช้เพื่อรักษาอาการง่ายๆ

มากมาย เช่น ท้องอืด ปวดท้อง ไข้ และ หลายองค์กร
[ 1-4 ] เผยแพร่ประโยชน์สมุนไพรไทยข้อมูลบนเว็บไซต์ของตน

แม้ว่าข้อมูลเหล่านี้จะเหมือนกันและค่อนข้าง
สอดคล้องกัน ไม่มีข้อมูลเกี่ยวกับชื่อเสร็จ
สมุนไพรไทย อาการและชิ้นส่วนที่ใช้ ตัวอย่างเช่นหนึ่ง
เว็บเพจให้แค่ ชื่อวิทยาศาสตร์ แต่ไม่ให้ชื่อสามัญ
; ในขณะที่เว็บอื่นเท่านั้น ซึ่งบ่งชี้ถึงสมุนไพรที่ใช้รักษาอาการ

ซึ่ง แต่ไม่ได้ให้ที่
ส่วนของสมุนไพรที่จะใช้ กระดาษนี้นำเสนอ

ขั้นตอนการสกัดข้อมูล สมุนไพรไทย เพื่อรับรู้ข้อมูลสมุนไพร ไทยมีประโยชน์และสกัด
พวกเขาจากเว็บไซต์หลายกระบวนการที่ใช้เปิดแหล่งที่มา HTML parser เรียกว่า jsoup
[ 5 ] และไฟล์แม่แบบ
หลาย กระบวนการโดยรวมมีหลักสองเฟส ชื่อเฟส การเก็บและรักษาอาการ

ขั้นตอนการสกัดข้อมูล การแสดงผลของเฟสแรก คือ ไฟล์ที่มีชื่ออาการ
ซึ่งรวบรวมจากหลายเว็บไซต์ใช้คำพ้องความหมายของ
คำ ' รักษา ' ระยะที่สองสกัด
รักษาสมุนไพรรายละเอียด ได้แก่ ชื่อสมุนไพรและใช้สมุนไพร
ชื่อรวมวิทยาศาสตร์ชื่อ ชื่อ และ นามสกุล ใช้ยารวมถึงส่วนหนึ่งของ

และใช้ชื่อเกิดอาการ ความท้าทายของการสกัดข้อมูล สมุนไพรไทย คือ
แต่ละเว็บไซต์มีโครงสร้าง HTML แตกต่างกัน บาง
ง่ายและมีความซับซ้อน เมื่อสมุนไพรสามารถ
รักษาอาการหลายอาการอยู่
ชื่ออย่างต่อเนื่อง มี หรือ ไม่มีตัวคั่น . อย่างไรก็ตาม eachsymptom สามารถรักษาที่แตกต่างกันโดยใช้สมุนไพรส่วน
กระบวนการต้องจำเนื้อหาที่เป็นสมุนไพร ซึ่งเป็นเนื้อหา และส่วน

ไม่ใช่แค่อาการ ข้อมูลสมุนไพรจากหลายแหล่ง
มีรูปแบบคงที่ เนื้อหา แต่ยังมีเนื้อแท้ที่ซ้อนกัน
เนื้อหาที่แสดงเนื้อหาและโครงสร้างภายในหัวข้อ
สรรพคุณของแต่ละหน้า นั่นคือสรรพคุณ หัวข้อจะประกอบด้วยส่วนต่างๆ
ใช้หัวข้อและส่วนของแต่ละหัวข้อสามารถใช้มีหลายอาการที่สามารถรักษา
.
กระดาษนี้จะถูกแบ่งออกเป็น 3 ส่วน ส่วนที่เกี่ยวข้องกับงานนำเสนอ
ถัดไปในการสกัดข้อมูลเว็บ หมวด 3
อธิบายเสนอกระบวนการสกัด
สมุนไพรไทยข้อมูลจากเว็บไซต์หลาย ส่วนสุดท้ายอธิบายถึงกระบวนการทดลองและประเมินความถูกต้อง

เสนอและเรียกคืนของกระบวนการ

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.