Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
2. EXPERIMENTAL RESULTS
Copyright is held by the author/owner(s).
MediaEval 2012 Workshop, October 4-5, 2012, Pisa, Italy We carry out a preliminary validation on the provided
avg. Fscore (%)
80 legend
64.9 68
60.2
60
47.8 47.8 51.1 47.9 49.6 50.7
54.5 55.2 50.9 55.6 55.7 55.8 linear SVM
41.1 41.2 38.5 41.6
5-NN
40 29.8 34.2
22.6 23.6
20
0
V1 V2 V3 V4 V5 A1 A2 T1 T2 T3 T4 AV1 AV2 AV3 AT1 AT2 VT1 VT2 AVT1 AVT2 AVT3 AVT4 AVT5
Figure 1: Genre average F score = 2 · P · R/(P + R) (P = precision, R = recall; train-test percentage split of 50%).
V1 SURF with B-o-V-W metadata (see T4). The ASR provided by LIMSI [4] tends
V2 Harris with B-o-V-W
V3 hist+HoG
to be more efficient than the one from LIUM [3] (see T1 vs.
V4 MPEG-7 T2). Metadata is more reliable than the ASR transcripts
V5 LBP+CCV+hist (improvement of 14% over ASR; see T3 vs. T2) but cannot
A1 standard audio be computed automatically (is generated by users, e.g. tags,
A2 block-based audio[2]
T1 TF-IDF ASR LIUM[3]
title). The discriminative power of metadata diminish when
T2 TF-IDF ASR LIMSI[4] mixed with audio-visual information (see AVT5).
T3 TF-IDF metadata
T4 TF-IDF metadata+ASR LIMSI[4] 2.2 Official runs
AV1 hist+HoG+block-based audio[2]
We use the SVM linear classifier with: Run1 - LBP, CCV,
AV2 LBP+CCV+hist+block-based audio[2]
AV3 MPEG-7+block-based audio[2] hist (visual) and audio block-based [2], Run2 - TF-IDF
AT1 block-based audio[2]+TF-IDF ASR LIMSI[4] on ASR LIMSI [4] (evaluation using only the ASR test set
AT2 block-based audio[2]+TF-IDF metadata videos), Run3 - audio block-based [2] + LBP, CCV, hist +
VT1 LBP+CCV+hist+TF-IDF metadata TF-IDF on ASR LIMSI [4], Run4 - audio block-based [2],
VT2 LBP+CCV+hist+TF-IDF ASR LIMSI[4]
AVT1 standard audio+LBP+CCV+hist+TF-IDF ASR LIMSI[4] Run5 - TF-IDF on metadata + ASR LIMSI [4].
AVT2 block-based audio[2]+LBP+CCV+hist+TF-IDF metadata
block-based audio[2]+LBP+CCV+hist+TF-IDF ASR
AVT3
LIMSI[4] Table 1: MAP for official runs.
AVT4
block-based[2]&standard audio+LBP+CCV+hist+TF- team Run1 Run2 Run3 Run4 Run5
IDF ASR LIMSI[4] ARF 19.41% 21.74% 22.04% 18.92% 37.93%
block-based[2]&standard audio+LBP+CCV+hist+TF-
AVT5
IDF metadata&ASR LIMSI[4]