TY - JOUR
T1 - On-Line Melody Extraction From Polyphonic Audio Using Harmonic Cluster Tracking
AU - Arora, Vipul
AU - Behera, Laxmidhar
PY - 2013/3
Y1 - 2013/3
N2 - Extraction of predominant melody from the musicalperformances containing various instruments is one of the mostchallenging task in the field of music information retrieval andcomputational musicology. This paper presents a novel frameworkwhich estimates predominant vocal melody in real-timeby tracking various sources with the help of harmonic clusters(combs) and then determining the predominant vocal source byusing the harmonic strength of the source. The novel on-lineharmonic comb tracking approach complies with both structuralas well as temporal constraints simultaneously. It relies uponthe strong higher harmonics for robustness against distortionof the first harmonic due to low frequency accompaniments, incontrast to the existing methods which track the pitch values. Thepredominant vocal source identification depends upon the novelidea of source dependant filtering of recognition score, whichallows the algorithm to be implemented on-line. The proposedmethod, although on-line, is shown to significantly outperformour implementation of a state-of-the-art offline method for vocalmelody extraction. Evaluations also show the reduction in octaveerror and the effectiveness of novel score filtering technique inenhancing the performance.
AB - Extraction of predominant melody from the musicalperformances containing various instruments is one of the mostchallenging task in the field of music information retrieval andcomputational musicology. This paper presents a novel frameworkwhich estimates predominant vocal melody in real-timeby tracking various sources with the help of harmonic clusters(combs) and then determining the predominant vocal source byusing the harmonic strength of the source. The novel on-lineharmonic comb tracking approach complies with both structuralas well as temporal constraints simultaneously. It relies uponthe strong higher harmonics for robustness against distortionof the first harmonic due to low frequency accompaniments, incontrast to the existing methods which track the pitch values. Thepredominant vocal source identification depends upon the novelidea of source dependant filtering of recognition score, whichallows the algorithm to be implemented on-line. The proposedmethod, although on-line, is shown to significantly outperformour implementation of a state-of-the-art offline method for vocalmelody extraction. Evaluations also show the reduction in octaveerror and the effectiveness of novel score filtering technique inenhancing the performance.
U2 - 10.1109/TASL.2012.2227731
DO - 10.1109/TASL.2012.2227731
M3 - Article
VL - 21
SP - 520
EP - 530
JO - IEEE Transactions on Audio, Speech and Language Processing
JF - IEEE Transactions on Audio, Speech and Language Processing
SN - 1558-7916
IS - 3
ER -