Unsupervised Learning of mDTD Extraction Patterns for Web Text Mining
ARTICLE
Dongseok Kim, Hanmin Jung, Gary Geunbae Lee
Information Processing & Management Volume 39, Number 4, ISSN 0306-4573
Abstract
Presents a new extraction pattern, modified Document Type Definition (mDTD), which relies on analytical interpretation to identify extraction target from the contents of Web documents. Experiments with 330 Korean and 220 English Web documents on audio and video shopping sites yielded an average extraction precision of 91.3% for Korean and 81.9% for English. (AEF)
Citation
Kim, D., Jung, H. & Lee, G.G. (2003). Unsupervised Learning of mDTD Extraction Patterns for Web Text Mining. Information Processing & Management, 39(4), 623. Retrieved August 16, 2024 from https://www.learntechlib.org/p/63938/.
ERIC is sponsored by the Institute of Education Sciences (IES) of the U.S. Department of Education.
Copyright for this record is held by the content creator. For more details see ERIC's copyright policy.