1.Create a initial set of keywords

1.Create a initial set of keywords relevant to security defects
2. Mine the code review repositories and populate database.
3. Search the database using initial set of keywords and build a CSV ﬁle (Corpus). Each entry in the csv ﬁle is a code review comments containing at least one of the predetermined keywords.
4.Because,many of the comments and texts contains code snippet, apply identiﬁer splitting rules on the corpus. (i.e. isBufferFull becomes "is Buffer Full" or read_string becomes "readstring").
5. Clean the corpus.Remove whitespace,punctuation,and numbers. Convert all words to lowercase. Create list of tokens for each document(i.e. rowincsv) in the Corpus.
6. Apply porter stemming algorithm to ﬁnd the stem of each of the tokens. (i.e. buffer, buffered, buffering all becomes buffer).
7. Create a Document Term matrix from the corpus.
8. Determine the words those co-occurred frequently with each of our predetermined keywords.
9. Manually inspect all the frequently co-occurring words, to determine which keywords should be added to the predetermined keywords list. The last row of Table 5 lists the keywords,we added after the test-mining.

1.Create a initial set of keywords relevant to security defects
2. Mine the code review repositories and populate database. 
3. Search the database using initial set of keywords and build a CSV ﬁle (Corpus). Each entry in the csv ﬁle is a code review comments containing at least one of the predetermined keywords. 
4.Because,many of the comments and texts contains code snippet, apply identiﬁer splitting rules on the corpus. (i.e. isBufferFull becomes "is Buffer Full" or read_string becomes "readstring"). 
5. Clean the corpus.Remove whitespace,punctuation,and numbers. Convert all words to lowercase. Create list of tokens for each document(i.e. rowincsv) in the Corpus. 
6. Apply porter stemming algorithm to ﬁnd the stem of each of the tokens. (i.e. buffer, buffered, buffering all becomes buffer). 
7. Create a Document Term matrix from the corpus. 
8. Determine the words those co-occurred frequently with each of our predetermined keywords. 
9. Manually inspect all the frequently co-occurring words, to determine which keywords should be added to the predetermined keywords list. The last row of Table 5 lists the keywords,we added after the test-mining.

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

1.Create a initial set of keywords relevant to security defects2. Mine the code review repositories and populate database. 3. Search the database using initial set of keywords and build a CSV ﬁle (Corpus). Each entry in the csv ﬁle is a code review comments containing at least one of the predetermined keywords. 4.Because,many of the comments and texts contains code snippet, apply identiﬁer splitting rules on the corpus. (i.e. isBufferFull becomes "is Buffer Full" or read_string becomes "readstring"). 5. Clean the corpus.Remove whitespace,punctuation,and numbers. Convert all words to lowercase. Create list of tokens for each document(i.e. rowincsv) in the Corpus. 6. Apply porter stemming algorithm to ﬁnd the stem of each of the tokens. (i.e. buffer, buffered, buffering all becomes buffer). 7. Create a Document Term matrix from the corpus. 8. Determine the words those co-occurred frequently with each of our predetermined keywords. 9. Manually inspect all the frequently co-occurring words, to determine which keywords should be added to the predetermined keywords list. The last row of Table 5 lists the keywords,we added after the test-mining.

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

1.Create ชุดเริ่มต้นของคำหลักที่เกี่ยวข้องกับข้อบกพร่องการรักษาความปลอดภัย
2 เหมืองที่เก็บตรวจสอบรหัสและเติมฐานข้อมูล.
3 ค้นหาฐานข้อมูลโดยใช้การตั้งค่าเริ่มต้นของคำหลักและสร้าง CSV สาย le (คอร์ปัส) รายการในสาย CSV le แต่ละความคิดเห็นตรวจสอบรหัสที่มีอย่างน้อยหนึ่งของคำหลักที่กำหนดไว้.
4.Because หลายความคิดเห็นและข้อความที่มีข้อมูลโค้ดที่ใช้กฎการแยกสายเอ้อระบุในคลัง (เช่น isBufferFull กลายเป็น "เป็นบัฟเฟอร์เต็ม" หรือ read_string กลายเป็น "readstring").
5 ทำความสะอาดช่องว่าง corpus.Remove เครื่องหมายวรรคตอนและตัวเลข แปลงทุกคำที่เป็นตัวพิมพ์เล็ก สร้างรายการของสัญญาณสำหรับแต่ละเอกสาร (เช่น rowincsv) ใน Corpus ได้.
6 ใช้ประตูกั้นขั้นตอนวิธี fi ครั้งที่ต้นกำเนิดของแต่ละสัญญาณที่ (บัฟเฟอร์คือบัฟเฟอร์บัฟเฟอร์ทั้งหมดจะกลายเป็นบัฟเฟอร์).
7 สร้างเมทริกซ์ระยะเอกสารจากคลัง.
8 ตรวจสอบคำผู้ร่วมเกิดขึ้นบ่อยครั้งกับแต่ละคำหลักที่กำหนดไว้ของเรา.
9 ตรวจสอบด้วยตนเองทุกคำที่มักจะร่วมเกิดขึ้นเพื่อตรวจสอบว่าคำหลักที่ควรจะเพิ่มรายชื่อคำหลักที่กำหนดไว้ แถวสุดท้ายของตารางที่ 5 แสดงรายการคำหลักที่เราเพิ่มเข้ามาหลังจากการทดสอบการทำเหมืองแร่

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

1 . สร้างชุดเริ่มต้นของคำหลักที่เกี่ยวข้องกับการรักษาความปลอดภัยข้อบกพร่อง
2 ของผมรหัสตรวจสอบที่เก็บและอาศัยฐานข้อมูล
3 ค้นหาฐานข้อมูลการตั้งค่าเริ่มต้นของคำหลักและสร้าง CSV จึงเลอ ( Corpus ) แต่ละรายการใน CSV จึงเลอเป็นรหัสตรวจสอบความคิดเห็นที่มีอย่างน้อยหนึ่งของการกำหนดคำหลัก
4 . เพราะหลายของความคิดเห็นและข้อความที่ประกอบด้วยรหัสข้อมูลใช้ identi จึงเอ้อแยกกฎในคลังข้อมูล ( เช่น isbufferfull กลายเป็น " บัฟเฟอร์เต็ม " หรือ read_string กลายเป็น " readstring " )
5 ทําความสะอาดข้อมูล ลบช่องว่าง , ตัวเลขเครื่องหมายวรรคตอนและ . แปลงคำทั้งหมดเป็นตัวพิมพ์เล็ก สร้างรายชื่อของสัญญาณสำหรับเอกสารแต่ละ ( เช่น rowincsv ) ในคลังข้อมูล
6 ใช้พอร์เตอร์ ( ขั้นตอนวิธีการถ่ายทอดและก้านของแต่ละสัญญาณ ( เช่น กันชน , กันชน ,บัฟเฟอร์ทั้งหมดกลายเป็นบัฟเฟอร์ )
7 สร้างเมทริกซ์ระยะของเอกสารจากคลังข้อมูล
8 หาคำเหล่านี้เกิดขึ้นบ่อยกับแต่ละ Co ของเรากำหนดคำหลัก
9 ตรวจสอบด้วยตนเองทั้งหมดบ่อย Co คําที่เกิดขึ้น เพื่อหาคำหลักที่ควรเพิ่มการกำหนดคำสำคัญรายการ แถวสุดท้ายของตาราง 5 รายการคำหลักที่เราเพิ่มเข้ามาหลังจากการทดสอบเหมืองแร่

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.