1. Input Text file
Segmentation
Part-of-Speech tagging
2. Thai Grammar Parser
TGParser > Parsed Sentences
3. Extract CFG Rules from the parsed results
FrequencyRuleExtractor.java
4. Find Probability for each rule
(Perl program)
5. Parse the sentences using NLTK
6. Use Evalb to evaluate the results