Conference paper
Automatic content extraction and visualization of Thai websites for improved information representation
2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp.2229-2234
IEEE
IEEE International Conference on Systems, Man, and Cybernetics, SMC 2012 (Seoul, 14/10/2012–16/10/2012)
2012
Abstract
This paper presents an integrated approach to automatically provide an overview of content on Thai websites based on tag cloud. This approach is intended to address the information overload issue by presenting the overview to users in order that they could assess whether the information meets their needs. The approach has incorporated Web content extraction, Thai word segmentation, and information presentation to generate a tag cloud in Thai language as an overview of the key content in the webpage. From the experimental study, the generated Thai Tag clouds are able to provide an overview of the tags which frequently appear in the title and body of the content. Moreover, the first few lines in the tag cloud offer an improved readability.
Details
- Title
- Automatic content extraction and visualization of Thai websites for improved information representation
- Authors/Creators
- W. Thanadechteemapat (Author/Creator) - Murdoch UniversityC.C. Fung (Author/Creator) - Murdoch University
- Publication Details
- 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp.2229-2234
- Conference
- IEEE International Conference on Systems, Man, and Cybernetics, SMC 2012 (Seoul, 14/10/2012–16/10/2012)
- Publisher
- IEEE
- Identifiers
- 991005542304507891
- Copyright
- © 2012 IEEE.
- Murdoch Affiliation
- School of Information Technology
- Language
- English
- Resource Type
- Conference paper
Metrics
303 File views/ downloads
55 Record Views