Streaming Audio Using MPEG–7 Audio Spectrum Envelope to Enable Self-similarity within Polyphonic Audio

J Doherty, K Curran, P McKevitt

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

The ability of traditional packet level Forward Error Correction approaches can limit errors for small sporadic network losses but when dropouts of large portions occur listening quality becomes an issue. Services such as audio-on-demand drastically increase the loads on networks therefore new, robust and highly efficient coding algorithms are necessary. One method overlooked to date, which can work alongside existing audio compression schemes, is that which takes account of the semantics and natural repetition of music through meta-data tagging. Similarity detection within polyphonic audio has presented problematic challenges within the field of Music Information Retrieval. We present a system which works at the content level thus rendering it applicable in existing streaming services. Using the MPEG–7 Audio Spectrum Envelope (ASE) gives features for extraction and combined with k-means clustering enables self-similarity to be performed within polyphonic audio.
LanguageEnglish
Pages190-202
JournalTelkomnika
Volume15
Issue number1
Publication statusPublished - 1 Mar 2017

Fingerprint

Audio streaming
Forward error correction
Metadata
Information retrieval
Semantics

Keywords

  • streaming audio

Cite this

@article{57d9881b109a4be98d1d8ad92ca140d1,
title = "Streaming Audio Using MPEG–7 Audio Spectrum Envelope to Enable Self-similarity within Polyphonic Audio",
abstract = "The ability of traditional packet level Forward Error Correction approaches can limit errors for small sporadic network losses but when dropouts of large portions occur listening quality becomes an issue. Services such as audio-on-demand drastically increase the loads on networks therefore new, robust and highly efficient coding algorithms are necessary. One method overlooked to date, which can work alongside existing audio compression schemes, is that which takes account of the semantics and natural repetition of music through meta-data tagging. Similarity detection within polyphonic audio has presented problematic challenges within the field of Music Information Retrieval. We present a system which works at the content level thus rendering it applicable in existing streaming services. Using the MPEG–7 Audio Spectrum Envelope (ASE) gives features for extraction and combined with k-means clustering enables self-similarity to be performed within polyphonic audio.",
keywords = "streaming audio",
author = "J Doherty and K Curran and P McKevitt",
year = "2017",
month = "3",
day = "1",
language = "English",
volume = "15",
pages = "190--202",
journal = "TELKOMNIKA (Telecommunication, Computing, Electronics and Control)",
issn = "1693-6930",
number = "1",

}

Streaming Audio Using MPEG–7 Audio Spectrum Envelope to Enable Self-similarity within Polyphonic Audio. / Doherty, J; Curran, K; McKevitt, P.

In: Telkomnika, Vol. 15, No. 1, 01.03.2017, p. 190-202.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Streaming Audio Using MPEG–7 Audio Spectrum Envelope to Enable Self-similarity within Polyphonic Audio

AU - Doherty, J

AU - Curran, K

AU - McKevitt, P

PY - 2017/3/1

Y1 - 2017/3/1

N2 - The ability of traditional packet level Forward Error Correction approaches can limit errors for small sporadic network losses but when dropouts of large portions occur listening quality becomes an issue. Services such as audio-on-demand drastically increase the loads on networks therefore new, robust and highly efficient coding algorithms are necessary. One method overlooked to date, which can work alongside existing audio compression schemes, is that which takes account of the semantics and natural repetition of music through meta-data tagging. Similarity detection within polyphonic audio has presented problematic challenges within the field of Music Information Retrieval. We present a system which works at the content level thus rendering it applicable in existing streaming services. Using the MPEG–7 Audio Spectrum Envelope (ASE) gives features for extraction and combined with k-means clustering enables self-similarity to be performed within polyphonic audio.

AB - The ability of traditional packet level Forward Error Correction approaches can limit errors for small sporadic network losses but when dropouts of large portions occur listening quality becomes an issue. Services such as audio-on-demand drastically increase the loads on networks therefore new, robust and highly efficient coding algorithms are necessary. One method overlooked to date, which can work alongside existing audio compression schemes, is that which takes account of the semantics and natural repetition of music through meta-data tagging. Similarity detection within polyphonic audio has presented problematic challenges within the field of Music Information Retrieval. We present a system which works at the content level thus rendering it applicable in existing streaming services. Using the MPEG–7 Audio Spectrum Envelope (ASE) gives features for extraction and combined with k-means clustering enables self-similarity to be performed within polyphonic audio.

KW - streaming audio

M3 - Article

VL - 15

SP - 190

EP - 202

JO - TELKOMNIKA (Telecommunication, Computing, Electronics and Control)

T2 - TELKOMNIKA (Telecommunication, Computing, Electronics and Control)

JF - TELKOMNIKA (Telecommunication, Computing, Electronics and Control)

SN - 1693-6930

IS - 1

ER -