it was verified whether the space w

it was verified whether the space was a sentence break or not.
The system was trained and tested with subsets of the ORCHID corpus
(Charoenporn et al., 1997), and 80% break detection and
9% false-break rates were achieved.
An extension of the algorithm was proposed by Charoenpornsawat and Sornlertlamvanich (2001).
Not only the POSs of surrounding
words but also collocations of surrounding words and
lengths of surrounding token texts were used as the features
for determining whether space characters were sentence
boundaries.
These features were confirmed to be useful.
In (Charoenpornsawat and Sornlertlamvanich, 2001),
these features were extracted automatically by machine learning using the system Winnow.
Winnow was also used for sentence break detection.
Compared to the POS n-gram model,
a 1.7% improvement of break-detection rate
and a 79% reduction of false-break rate were achieved.
Although these gains are substantial,
the algorithms depend strongly on word segmentation and POS tagging.
A larger POS tagged corpus is needed to improve all these components

it was verified whether the space was a sentence break or not. 
The system was trained and tested with subsets of the ORCHID corpus
(Charoenporn et al., 1997), and 80% break detection and
9% false-break rates were achieved. 
An extension of the algorithm was proposed by Charoenpornsawat and Sornlertlamvanich (2001). 
Not only the POSs of surrounding
words but also collocations of surrounding words and
lengths of surrounding token texts were used as the features
for determining whether space characters were sentence
boundaries. 
These features were confirmed to be useful.
In (Charoenpornsawat and Sornlertlamvanich, 2001), 
these features were extracted automatically by machine learning using the system Winnow. 
Winnow was also used for sentence break detection. 
Compared to the POS n-gram model, 
a 1.7% improvement of break-detection rate 
and a 79% reduction of false-break rate were achieved. 
Although these gains are substantial, 
the algorithms depend strongly on word segmentation and POS tagging. 
A larger POS tagged corpus is needed to improve all these components

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

it was verified whether the space was a sentence break or not. The system was trained and tested with subsets of the ORCHID corpus(Charoenporn et al., 1997), and 80% break detection and9% false-break rates were achieved. An extension of the algorithm was proposed by Charoenpornsawat and Sornlertlamvanich (2001). Not only the POSs of surroundingwords but also collocations of surrounding words andlengths of surrounding token texts were used as the featuresfor determining whether space characters were sentenceboundaries. These features were confirmed to be useful.In (Charoenpornsawat and Sornlertlamvanich, 2001), these features were extracted automatically by machine learning using the system Winnow. Winnow was also used for sentence break detection. Compared to the POS n-gram model, a 1.7% improvement of break-detection rate and a 79% reduction of false-break rate were achieved. Although these gains are substantial, the algorithms depend strongly on word segmentation and POS tagging. A larger POS tagged corpus is needed to improve all these components

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

มันคือการตรวจสอบว่าพื้นที่ที่เป็นตัวแบ่งประโยคหรือไม่.
ระบบที่ได้รับการฝึกอบรมและการทดสอบด้วยการย่อยของร่างกายกล้วยไม้
(เจริญพร et al., 1997) และ 80% การตรวจสอบการหยุดพักและ
9% อัตราเท็จแบ่งกำลังประสบความสำเร็จ.
ส่วนหนึ่งของขั้นตอนวิธีการที่เสนอโดย Charoenpornsawat และ Sornlertlamvanich (2001).
ไม่เพียง แต่ poss
รอบคำแต่ยัง collocations รอบคำและความยาวของรอบตำราโทเค็นถูกนำมาใช้เป็นคุณสมบัติในการพิจารณาว่าตัวอักษรพื้นที่เป็นประโยคขอบเขต. คุณสมบัติเหล่านี้ ได้รับการยืนยันที่จะเป็นประโยชน์. ใน (Charoenpornsawat และ Sornlertlamvanich, 2001), คุณสมบัติเหล่านี้ถูกสกัดโดยอัตโนมัติโดยการเรียนรู้เครื่องที่ใช้ระบบโปรย. โปรยก็ยังใช้สำหรับการตรวจสอบแบ่งประโยค. เมื่อเทียบกับ POS รุ่น N-กรัมการปรับปรุง% 1.7 ของ อัตราการทำลายการตรวจสอบและการลดลง79% ของราคาที่ผิดพลาดแบ่งกำลังประสบความสำเร็จ. แม้ว่ากำไรเหล่านี้มีความสำคัญขั้นตอนวิธีการขึ้นอยู่อย่างมากในการตัดคำและการติดแท็ก POS. คลังข้อมูลแท็ก POS ที่มีขนาดใหญ่เป็นสิ่งจำเป็นในการปรับปรุงทุกองค์ประกอบเหล่านี้

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

มันถูกยืนยันว่าพื้นที่เป็นประโยคที่แตก หรือ ไม่
ระบบการฝึกอบรมและทดสอบกับชุดย่อยของกล้วยไม้คลังข้อมูล
( charoenporn et al . , 1997 ) และ 80% แบ่งการตรวจและ
9 % แบ่งเป็นเท็จอัตรารับ
เป็นส่วนขยายของอัลกอริทึมที่เสนอโดย charoenpornsawat และ sornlertlamvanich ( 2001 ) ไม่เพียง แต่รูปแบบของรอบ

คำ แต่ยัง collocations รอบคำและความยาวของโทเค็นรอบ
ข้อความที่ใช้เป็นคุณสมบัติสำหรับการกำหนดว่าตัวอักษรพื้นที่
มีขอบเขตประโยค

คุณสมบัติเหล่านี้ได้รับการยืนยันที่จะเป็นประโยชน์ .
( charoenpornsawat และ sornlertlamvanich , 2001 ) ,
คุณสมบัติเหล่านี้ถูกสกัดโดยอัตโนมัติโดยเครื่องการเรียนรู้โดยใช้ระบบเป้าหมาย .
ฝัดยังใช้ประโยคทำลายการตรวจสอบ
เมื่อเทียบกับ n-gram POS แบบ
1.7 % การปรับปรุงแบ่งตรวจสอบอัตรา
และร้อยละ 79 อัตราการแบ่งเท็จถูกความ
ถึงแม้ว่าผลประโยชน์เหล่านี้เป็นรูปธรรม
อัลกอริทึมขึ้นอยู่อย่างมากในการตัดคำ และ ระบบการติดแท็ก
POS , คลังข้อมูลขนาดใหญ่ จะต้องปรับปรุงองค์ประกอบเหล่านี้ทั้งหมด

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.