Poisson approach to clustering analysis of regulatory sequences

HY Wang, H Zheng, Hu Jinglu

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

The presence of similar patterns in regulatory sequences may aid users in identifying co-regulated genes or inferring regulatory modules. By modelling pattern occurrences in regulatory regions with Poisson statistics, this paper presents a log likelihood ratio statistics-based distance measure to calculate pair-wise similarities between regulatory sequences. We employed it within three clustering algorithms: hierarchical clustering, Self-Organising Map, and a self-adaptive neural network. The results indicate that, in comparison to traditional clustering algorithms, the incorporation of the log likelihood ratio statistics-based distance into the learning process may offer considerable improvements in the process of regulatory sequence-based classification of genes
LanguageEnglish
Pages141-157
JournalInternational Journal of Computational Biology and Drug Design
Volume1
Issue number2
DOIs
Publication statusPublished - Sep 2008

Fingerprint

Statistics
Clustering algorithms
Genes
Self organizing maps
Neural networks

Cite this

@article{c98100de434a416cbba314a0ec95d694,
title = "Poisson approach to clustering analysis of regulatory sequences",
abstract = "The presence of similar patterns in regulatory sequences may aid users in identifying co-regulated genes or inferring regulatory modules. By modelling pattern occurrences in regulatory regions with Poisson statistics, this paper presents a log likelihood ratio statistics-based distance measure to calculate pair-wise similarities between regulatory sequences. We employed it within three clustering algorithms: hierarchical clustering, Self-Organising Map, and a self-adaptive neural network. The results indicate that, in comparison to traditional clustering algorithms, the incorporation of the log likelihood ratio statistics-based distance into the learning process may offer considerable improvements in the process of regulatory sequence-based classification of genes",
author = "HY Wang and H Zheng and Hu Jinglu",
year = "2008",
month = "9",
doi = "10.1504/IJCBDD.2008.020206",
language = "English",
volume = "1",
pages = "141--157",
journal = "International Journal of Computational Biology and Drug Design",
issn = "1756-0756",
number = "2",

}

Poisson approach to clustering analysis of regulatory sequences. / Wang, HY; Zheng, H; Jinglu, Hu.

In: International Journal of Computational Biology and Drug Design, Vol. 1, No. 2, 09.2008, p. 141-157.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Poisson approach to clustering analysis of regulatory sequences

AU - Wang, HY

AU - Zheng, H

AU - Jinglu, Hu

PY - 2008/9

Y1 - 2008/9

N2 - The presence of similar patterns in regulatory sequences may aid users in identifying co-regulated genes or inferring regulatory modules. By modelling pattern occurrences in regulatory regions with Poisson statistics, this paper presents a log likelihood ratio statistics-based distance measure to calculate pair-wise similarities between regulatory sequences. We employed it within three clustering algorithms: hierarchical clustering, Self-Organising Map, and a self-adaptive neural network. The results indicate that, in comparison to traditional clustering algorithms, the incorporation of the log likelihood ratio statistics-based distance into the learning process may offer considerable improvements in the process of regulatory sequence-based classification of genes

AB - The presence of similar patterns in regulatory sequences may aid users in identifying co-regulated genes or inferring regulatory modules. By modelling pattern occurrences in regulatory regions with Poisson statistics, this paper presents a log likelihood ratio statistics-based distance measure to calculate pair-wise similarities between regulatory sequences. We employed it within three clustering algorithms: hierarchical clustering, Self-Organising Map, and a self-adaptive neural network. The results indicate that, in comparison to traditional clustering algorithms, the incorporation of the log likelihood ratio statistics-based distance into the learning process may offer considerable improvements in the process of regulatory sequence-based classification of genes

UR - http://www.inderscience.com/browse/index.php

U2 - 10.1504/IJCBDD.2008.020206

DO - 10.1504/IJCBDD.2008.020206

M3 - Article

VL - 1

SP - 141

EP - 157

JO - International Journal of Computational Biology and Drug Design

T2 - International Journal of Computational Biology and Drug Design

JF - International Journal of Computational Biology and Drug Design

SN - 1756-0756

IS - 2

ER -