TDM modeling and evaluation of different domain transforms for LSI

T Jaber, A Amira, P Milligan

    Research output: Contribution to journalArticle

    2 Citations (Scopus)

    Abstract

    Latent semantic indexing (LSI) is a popular technique used in information retrieval (IR) applications. This paper presents a novel evaluation strategy based on the use of image processing tools. The authors evaluate the use of the discrete cosine transform (DCT) and Cohen Daubechies Feauveau 9/7 (CDF9/7) wavelet transform as a preprocessing step for the singular value decomposition (SVD) step of the LSI system. In addition, the effect of different threshold types on the search results is examined. The results show that accuracy can be increased by applying both transforms as a preprocessing step, with better performance for the hard-threshold function. The choice of the best threshold value is a key factor in the transform process. This paper also describes the most effective structure for the database to facilitate efficient searching in the LSI system. (c) 2009 Elsevier B.V. All rights reserved.
    LanguageEnglish
    Pages2406-2417
    JournalNeurocomputing
    Volume72
    Issue number10-12,
    DOIs
    Publication statusPublished - Jun 2009

    Fingerprint

    Time division multiplexing
    Semantics
    Discrete cosine transforms
    Singular value decomposition
    Information retrieval
    Wavelet transforms
    Image processing

    Keywords

    • Latent semantic indexing
    • Information retrieval
    • Discrete cosine transform
    • Singular value decomposition
    • Cohen Daubechies Feauveau 9/7
    • Hard thresholding
    • Soft thresholding

    Cite this

    Jaber, T ; Amira, A ; Milligan, P. / TDM modeling and evaluation of different domain transforms for LSI. 2009 ; Vol. 72, No. 10-12,. pp. 2406-2417.
    @article{bc0ce063dadb4ab496fe3a3bf956365e,
    title = "TDM modeling and evaluation of different domain transforms for LSI",
    abstract = "Latent semantic indexing (LSI) is a popular technique used in information retrieval (IR) applications. This paper presents a novel evaluation strategy based on the use of image processing tools. The authors evaluate the use of the discrete cosine transform (DCT) and Cohen Daubechies Feauveau 9/7 (CDF9/7) wavelet transform as a preprocessing step for the singular value decomposition (SVD) step of the LSI system. In addition, the effect of different threshold types on the search results is examined. The results show that accuracy can be increased by applying both transforms as a preprocessing step, with better performance for the hard-threshold function. The choice of the best threshold value is a key factor in the transform process. This paper also describes the most effective structure for the database to facilitate efficient searching in the LSI system. (c) 2009 Elsevier B.V. All rights reserved.",
    keywords = "Latent semantic indexing, Information retrieval, Discrete cosine transform, Singular value decomposition, Cohen Daubechies Feauveau 9/7, Hard thresholding, Soft thresholding",
    author = "T Jaber and A Amira and P Milligan",
    year = "2009",
    month = "6",
    doi = "10.1016/j.neucom.2008.12.010",
    language = "English",
    volume = "72",
    pages = "2406--2417",
    number = "10-12,",

    }

    Jaber, T, Amira, A & Milligan, P 2009, 'TDM modeling and evaluation of different domain transforms for LSI', vol. 72, no. 10-12, pp. 2406-2417. https://doi.org/10.1016/j.neucom.2008.12.010

    TDM modeling and evaluation of different domain transforms for LSI. / Jaber, T; Amira, A; Milligan, P.

    Vol. 72, No. 10-12, 06.2009, p. 2406-2417.

    Research output: Contribution to journalArticle

    TY - JOUR

    T1 - TDM modeling and evaluation of different domain transforms for LSI

    AU - Jaber, T

    AU - Amira, A

    AU - Milligan, P

    PY - 2009/6

    Y1 - 2009/6

    N2 - Latent semantic indexing (LSI) is a popular technique used in information retrieval (IR) applications. This paper presents a novel evaluation strategy based on the use of image processing tools. The authors evaluate the use of the discrete cosine transform (DCT) and Cohen Daubechies Feauveau 9/7 (CDF9/7) wavelet transform as a preprocessing step for the singular value decomposition (SVD) step of the LSI system. In addition, the effect of different threshold types on the search results is examined. The results show that accuracy can be increased by applying both transforms as a preprocessing step, with better performance for the hard-threshold function. The choice of the best threshold value is a key factor in the transform process. This paper also describes the most effective structure for the database to facilitate efficient searching in the LSI system. (c) 2009 Elsevier B.V. All rights reserved.

    AB - Latent semantic indexing (LSI) is a popular technique used in information retrieval (IR) applications. This paper presents a novel evaluation strategy based on the use of image processing tools. The authors evaluate the use of the discrete cosine transform (DCT) and Cohen Daubechies Feauveau 9/7 (CDF9/7) wavelet transform as a preprocessing step for the singular value decomposition (SVD) step of the LSI system. In addition, the effect of different threshold types on the search results is examined. The results show that accuracy can be increased by applying both transforms as a preprocessing step, with better performance for the hard-threshold function. The choice of the best threshold value is a key factor in the transform process. This paper also describes the most effective structure for the database to facilitate efficient searching in the LSI system. (c) 2009 Elsevier B.V. All rights reserved.

    KW - Latent semantic indexing

    KW - Information retrieval

    KW - Discrete cosine transform

    KW - Singular value decomposition

    KW - Cohen Daubechies Feauveau 9/7

    KW - Hard thresholding

    KW - Soft thresholding

    U2 - 10.1016/j.neucom.2008.12.010

    DO - 10.1016/j.neucom.2008.12.010

    M3 - Article

    VL - 72

    SP - 2406

    EP - 2417

    IS - 10-12,

    ER -