CEW-DTW: A new time series model for text mining

Guandong Zhang, Hao Yu, Lu Xiao

Research output: Chapter in Book/Entry/PoemConference contribution

1 Scopus citations

Abstract

The keyword information is usually applied to describe answers. In most of the previous studies, researchers usually rank answers according to keyword retrieval, which fails to consider the importance of the time sequence of keywords in answers. In this paper, we propose CEW-DTW, a new time series model for answer ranking. This model considers the importance of the time sequence of keywords as well as the amount of keywords. CEW-DTW is developed from a carefully designed model, Dynamic Time Warping-Delta (DTW-D). We choose Amazon question/answer data as our evaluation dataset. We apply Entropy to remove noise in answer vectors. In experiments, we apply normalized discounted cumulative gain (nDCG) as the assess rule to test models. CEW-DTW is proven to have a better performance than Dynamic Time Warping (DTW) and Dynamic Time Warping-Delta (DTW-D) in answer ranking. An extensive set of evaluation results demonstrates the effectiveness of the CEW-DTW model for answer ranking.

Original languageEnglish (US)
Title of host publication2018 International Conference on Information and Communications Technology, ICOIACT 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages158-162
Number of pages5
ISBN (Electronic)9781538609545
DOIs
StatePublished - Apr 26 2018
Event1st International Conference on Information and Communications Technology, ICOIACT 2018 - Yogyakarta, Indonesia
Duration: Mar 6 2018Mar 7 2018

Publication series

Name2018 International Conference on Information and Communications Technology, ICOIACT 2018
Volume2018-January

Other

Other1st International Conference on Information and Communications Technology, ICOIACT 2018
Country/TerritoryIndonesia
CityYogyakarta
Period3/6/183/7/18

Keywords

  • Cosine Similarity
  • Dynamic Time Warping
  • Entropy
  • Euclidean Distance
  • Time Series

ASJC Scopus subject areas

  • Information Systems
  • Computer Networks and Communications
  • Hardware and Architecture
  • Control and Optimization
  • Instrumentation

Fingerprint

Dive into the research topics of 'CEW-DTW: A new time series model for text mining'. Together they form a unique fingerprint.

Cite this