BIBLIOGRAPHY 159
[113] T. Mei, B. Yang, X.-S. Hua, and S. Li. Contextual video recommendation by multimodal
relevance and user feedback. ACM Transactions on Information Systems, 29(2):10, 2011.
DOI: 10.1145/1961209.1961213 125, 126
[114] A. Mesaros, T. Heittola, A. Eronen, and T. Virtanen. Acoustic event detection in real
life recordings. In IEEE EUSIPCO, pages 1267–1271, 2010. 111
[115] T. Mikolov, I. Sutskever, K. Chen, G. Corrado, and J. Dean. Distributed representations
of words and phrases and their compositionality. In Proc. of the Annual Conference on
Neural Information Processing Systems, pages 3111–3119, NIPS Foundation, 2013. 23,
115
[116] M.-F. Moens, K. Pastra, K. Saenko, and T. Tuytelaars. Vision and language integration
meets multimedia fusion. In Proc. of the ACM International Conference on Multimedia,
page 1493, 2016. DOI: 10.1145/2964284.2980537 59
[117] G. Monaci, P. Jost, P. Vandergheynst, B. Mailhe, S. Lesage, and R. Gribon-
val. Learning multimodal dictionaries. IEEE TIP, 16(9):2272–2283, 2007. DOI:
10.1109/tip.2007.901813 118
[118] E. Morvant, A. Habrard, S. Ayache, and phane. Majority Vote of Diverse Classifiers for
Late Fusion. Springer, 2014. DOI: 10.1007/978-3-662-44415-3_16 25
[119] J. Ngiam, A. Khosla, M. Kim, J. Nam, H. Lee, and A. Y. Ng. Multimodal deep learning.
In ICML, pages 689–696, 2011. 98
[120] P. X. Nguyen, G. Rogez, C. C. Fowlkes, and D. Ramanan. e open world of micro-
videos. CoRR, abs/1603.09439, 2016. 3
[121] L. Nie, M. Wang, L. Zhang, S. Yan, B. Zhang, and T.-S. Chua. Disease inference from
health-related questions via sparse deep learning. IEEE Transactions on Knowledge and
Data Engineering, 27(8):2107–2119, 2015. DOI: 10.1109/tkde.2015.2399298 42
[122] L. Nie, X. Wang, J. Zhang, X. He, H. Zhang, R. Hong, and Q. Tian. Enhancing micro-
video understanding by harnessing external sounds. In ACM MM, pages 1192–1200,
2017. DOI: 10.1145/3123266.3123313 90, 99, 101
[123] L. Nie, L. Zhang, Y. Yang, M. Wang, R. Hong, and T.-S. Chua. Beyond doctors: Future
health prediction from multimedia and multimodal observations. In Proc. of the ACM
Multimedia Conference, pages 591–600, 2015. DOI: 10.1145/2733373.2806217 30, 34,
83
[124] L. Nie, Y.-L. Zhao, M. Akbari, J. Shen, and T.-S. Chua. Bridging the vocabulary gap
between health seekers and healthcare knowledge. IEEE Transactions on Knowledge and
Data Engineering, 27(2):396–409, 2015. DOI: 10.1109/tkde.2014.2330813 51