BIBLIOGRAPHY 159
[113] T. Mei, B. Yang, X.-S. Hua, and S. Li. Contextual video recommendation by multimodal
relevance and user feedback. ACM Transactions on Information Systems, 29(2):10, 2011.
DOI: 10.1145/1961209.1961213 125, 126
[114] A. Mesaros, T. Heittola, A. Eronen, and T. Virtanen. Acoustic event detection in real
life recordings. In IEEE EUSIPCO, pages 1267–1271, 2010. 111
[115] T. Mikolov, I. Sutskever, K. Chen, G. Corrado, and J. Dean. Distributed representations
of words and phrases and their compositionality. In Proc. of the Annual Conference on
Neural Information Processing Systems, pages 3111–3119, NIPS Foundation, 2013. 23,
115
[116] M.-F. Moens, K. Pastra, K. Saenko, and T. Tuytelaars. Vision and language integration
meets multimedia fusion. In Proc. of the ACM International Conference on Multimedia,
page 1493, 2016. DOI: 10.1145/2964284.2980537 59
[117] G. Monaci, P. Jost, P. Vandergheynst, B. Mailhe, S. Lesage, and R. Gribon-
val. Learning multimodal dictionaries. IEEE TIP, 16(9):2272–2283, 2007. DOI:
10.1109/tip.2007.901813 118
[118] E. Morvant, A. Habrard, S. Ayache, and phane. Majority Vote of Diverse Classifiers for
Late Fusion. Springer, 2014. DOI: 10.1007/978-3-662-44415-3_16 25
[119] J. Ngiam, A. Khosla, M. Kim, J. Nam, H. Lee, and A. Y. Ng. Multimodal deep learning.
In ICML, pages 689–696, 2011. 98
[120] P. X. Nguyen, G. Rogez, C. C. Fowlkes, and D. Ramanan. e open world of micro-
videos. CoRR, abs/1603.09439, 2016. 3
[121] L. Nie, M. Wang, L. Zhang, S. Yan, B. Zhang, and T.-S. Chua. Disease inference from
health-related questions via sparse deep learning. IEEE Transactions on Knowledge and
Data Engineering, 27(8):2107–2119, 2015. DOI: 10.1109/tkde.2015.2399298 42
[122] L. Nie, X. Wang, J. Zhang, X. He, H. Zhang, R. Hong, and Q. Tian. Enhancing micro-
video understanding by harnessing external sounds. In ACM MM, pages 1192–1200,
2017. DOI: 10.1145/3123266.3123313 90, 99, 101
[123] L. Nie, L. Zhang, Y. Yang, M. Wang, R. Hong, and T.-S. Chua. Beyond doctors: Future
health prediction from multimedia and multimodal observations. In Proc. of the ACM
Multimedia Conference, pages 591–600, 2015. DOI: 10.1145/2733373.2806217 30, 34,
83
[124] L. Nie, Y.-L. Zhao, M. Akbari, J. Shen, and T.-S. Chua. Bridging the vocabulary gap
between health seekers and healthcare knowledge. IEEE Transactions on Knowledge and
Data Engineering, 27(2):396–409, 2015. DOI: 10.1109/tkde.2014.2330813 51
160 BIBLIOGRAPHY
[125] K. Nigam and R. Ghani. Analyzing the effectiveness and applicability of co-training. In
CIKM, pages 86–93, 2000. DOI: 10.1145/354756.354805 25
[126] Q. Pan, D. Kong, C. H. Ding, and B. Luo. Robust non-negative dictionary learning. In
National Conference of the American Association for Artificial Intelligence, pages 2027–2033,
2014. 62
[127] R. Pan, Y. Zhou, B. Cao, N. N. Liu, R. Lukose, M. Scholz, and Q. Yang. One-class
collaborative filtering. In Proc. of the IEEE International Conference on Data Mining,
pages 502–511, 2008. DOI: 10.1109/icdm.2008.16 126
[128] S. L. Pancoast, M. Akbacak, and M. H. Sanchez. Supervised acoustic concept ex-
traction for multimedia event detection. In Proc. of the ACM International Workshop on
Audio and Multimedia Methods for Large-Scale Video Analysis, pages 9–14, 2012. DOI:
10.1145/2390214.2390219 110, 111
[129] J. Park, S.-J. Lee, S.-J. Lee, K. Kim, B.-S. Chung, and Y.-K. Lee. Online video recom-
mendation through tag-cloud aggregation. IEEE Transaction on MultiMedia, 18(1):78–
87, 2011. DOI: 10.1109/mmul.2010.6 125, 126
[130] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. irion, O. Grisel, M. Blondel,
P. Prettenhofer, R. Weiss, V. Dubourg, et al. Scikit-learn: Machine learning in Python.
In Journal of Machine Learning Research, ( JMLR), vol. 12, pages 2825–2830, 2011. 34,
68
[131] H. Peng, K. Li, B. Li, H. Ling, W. Xiong, and W. Hu. Predicting image memorability
by multi-view adaptive regression. In Proc. ACM International Conference on Multimedia,
pages 1147–1150, 2015. DOI: 10.1145/2733373.2806303 42
[132] M. Quadrana, A. Karatzoglou, B. Hidasi, and P. Cremonesi. Personalizing session-
based recommendations with hierarchical recurrent neural networks. In Proc. of
the 11th ACM Conference on Recommender Systems, pages 130–137, 2017. DOI:
10.1145/3109859.3109896 126, 127
[133] N. Quadrianto and C. H. Lampert. Learning multi-view neighborhood preserving pro-
jections. In ICML, pages 425–432, 2011. 93
[134] G. A. Ramirez, T. Baltrušaitis, and L.-P. Morency. Modeling latent discriminative dy-
namic of multi-dimensional affective signals. In International Conference on Affective Com-
puting and Intelligent Interaction, pages 396–406, Springer, 2011. DOI: 10.1007/978-3-
642-24571-8_51 25
[135] M. Ravanelli, B. Elizalde, K. Ni, and G. Friedland. Audio concept classification with
hierarchical deep neural networks. In IEEE EUSIPCO, pages 606–610, 2014. 110
BIBLIOGRAPHY 161
[136] M. Redi, N. O’Hare, R. Schifanella, M. Trevisiol, and A. Jaimes. 6 seconds of sound and
vision: Creativity in micro-videos. In Proc. of the IEEE Conference on Computer Vision and
Pattern Recognition, pages 4272–4279, 2014. DOI: 10.1109/cvpr.2014.544 3
[137] S. Rendle, C. Freudenthaler, Z. Gantner, and L. Schmidt-ieme. BPR: Bayesian per-
sonalized ranking from implicit feedback. In Proc. of the AUAI Conference on Uncertainty
in Artificial Intelligence, pages 452–461, 2009. DOI: 10.1142/s0218001416590011 126,
134
[138] S. D. Roy, T. Mei, W. Zeng, and S. Li. Towards cross-domain learning for social video
popularity prediction. IEEE Transactions on Multimedia, 15(6):1255–1267, 2013. DOI:
10.1109/tmm.2013.2265079 24
[139] J. Rupnik and J. Shawe-Taylor. Multi-view canonical correlation analysis. In Proc. of
Conference on Data Mining and Data Warehouses, pages 1–4, 2010. 39
[140] M. A. Saad, A. C. Bovik, and C. Charrier. Blind prediction of natural video
quality. IEEE Transactions on Image Processing, 23(3):1352–1365, 2014. DOI:
10.1109/tip.2014.2299154 22
[141] S. Sadanand and J. J. Corso. Action bank: A high-level representation of activity in video.
In CVPR, pages 1234–1241, 2012. DOI: 10.1109/cvpr.2012.6247806 109
[142] S. Sano, T. Yamasaki, and K. Aizawa. Degree of loop assessment in microvideo.
In IEEE International Conference on Image Processing, pages 5182–5186, 2014. DOI:
10.1109/icip.2014.7026049 3
[143] G. Schindler, M. Brown, and R. Szeliski. City-scale location recognition. In IEEE
CVPR, pages 1–7, 2007. DOI: 10.1109/cvpr.2007.383150 61
[144] E. Shutova, D. Kiela, and J. Maillard. Black holes and white rabbits: Metaphor identifica-
tion with visual features. In NAACL, pages 160–170, 2016. DOI: 10.18653/v1/n16-1020
25
[145] V. Sindhwani, P. Niyogi, and M. Belkin. A co-regularization approach to semi-supervised
learning with multiple views. In Proc. of the International Conference on Machine Learning,
pages 74–79, ACM, 2005. 26
[146] A. W. M. Smeulders. Early vs. late fusion in semantic video analysis. In ACM MM,
pages 399–402, 2005. DOI: 10.1145/1101149.1101236 25
[147] A. J. Smola and B. Schölkopf. A tutorial on support vector regression. Statistics and
Computing, 14(3):199–222, 2004. DOI: 10.1023/b:stco.0000035301.49549.88 54
162 BIBLIOGRAPHY
[148] C. G. Snoek, M. Worring, and A. W. Smeulders. Early vs. late fusion in semantic video
analysis. In Proc. of the ACM International Conference on Multimedia, pages 399–402,
2005. DOI: 10.1145/1101149.1101236 59
DOI: 10.1145/2766462.2767726
[149] X. Song, L. Nie, L. Zhang, M. Akbari, and T.-S. Chua. Multiple social network learning
and its application in volunteerism tendency prediction. In Proc. of ACM SIGIR Confer-
ence on Research and Development in Information Retrieval, pages 213–222, 2015. DOI:
10.1145/2766462.2767726 20, 34, 54, 55, 111
[150] X. Song, L. Nie, L. Zhang, M. Liu, and T.-S. Chua. Interest inference via structure-
constrained multi-source multi-task learning. In Proc. of the International Joint Conference
on Artificial Intelligence, pages 2371–2377, AAAI Press, 2015. 26, 67
[151] G. Szabo and B. A. Huberman. Predicting the popularity of online content. Communi-
cations of the ACM, 53(8):80–88, 2010. DOI: 10.2139/ssrn.1295610 42
[152] J. Tang and K. Wang. Personalized top-n sequential recommendation via convolutional
sequence embedding. In Proc. of the ACM International Conference on Web Search and Data
Mining, pages 565–573, 2018. DOI: 10.1145/3159652.3159656 126, 127
[153] K. Tang, M. Paluri, L. Fei-Fei, R. Fergus, and L. Bourdev. Improving image clas-
sification with location context. In IEEE CVPR, pages 1008–1016, 2015. DOI:
10.1109/iccv.2015.121 61
[154] G.-B. Huang, Q.-Y. Zhu, and C.-K. Siew. Extreme learning machine: eory and ap-
plications. Neurocomputing, 70(1):489–501, 2006. DOI: 10.1016/j.neucom.2005.12.126
55
[155] L. C. Totti, F. A. Costa, S. Avila, E. Valle, W. Meira, and V. Almeida. e impact
of visual attributes on online image diffusion. In Proc. of ACM Web Science Conference,
pages 42–51, 2014. DOI: 10.1145/2615569.2615700 24
[156] T. Trzcinski and P. Rokita. Predicting popularity of online videos using support vector
regression. ArXiv Preprint ArXiv:1510.06223, 2015. 24, 42
[157] T. X. Tuan and T. M. Phuong. 3D convolutional networks for session-based recommen-
dation with content features. In Proc. of ACM International Conference on Recommender
Systems, pages 138–146, 2017. DOI: 10.1145/3109859.3109900 126, 127
[158] M. Vasconcelos, J. M. Almeida, and M. A. Gonçalves. Predicting the popularity of
micro-reviews: A foursquare case study. Information Sciences, 325:355–374, 2015. DOI:
10.1016/j.ins.2015.07.001 24
BIBLIOGRAPHY 163
[159] D. Wang, S. C. H. Hoi, Y. He, J. Zhu, T. Mei, and J. Luo. Retrieval-based face annota-
tion by weak label regularized local coordinate coding. TPAMI, 36(3):1–14, 2013. DOI:
10.1145/2072298.2072345 97
[160] D. Wang, X. Zhang, M. Fan, and X. Ye. Semi-supervised dictionary learning via struc-
tural sparse preserving. In National Conference of the American Association for Artificial
Intelligence, pages 2137–2144, 2016. 62
[161] K. Wang, R. He, L. Wang, W. Wang, and T. Tan. Joint feature selection and
subspace learning for cross-modal retrieval. TPAMI, 38(10):2010–2024, 2016. DOI:
10.1109/tpami.2015.2505311 26
[162] M. Wang, X. Hua, R. Hong, J. Tang, G. Qi, and Y. Song. Unified video annotation
via multigraph learning. IEEE Transactions on Circuits and Systems for Video Technology,
19(5):733–746, 2009. DOI: 10.1109/tcsvt.2009.2017400 19
[163] M. Wang, H. Li, D. Tao, K. Lu, and X. Wu. Multimodal graph-based reranking for web
image search. IEEE Transactions on Image Processing, 21(11):4649–4661, 2012. DOI:
10.1109/tip.2012.2207397 19
[164] M. Wang, X. Liu, and X. Wu. Visual classification by `
1
-hypergraph modeling.
IEEE Transactions on Knowledge and Data Engineering, 27(9):2564–2574, 2015. DOI:
10.1109/TKDE.2015.2415497 30
[165] S. Wang, L. Zhang, Y. Liang, and Q. Pan. Semi-coupled dictionary learning with
applications to image super-resolution and photo-sketch synthesis. In IEEE Con-
ference on Computer Vision and Pattern Recognition, pages 2216–2223, 2012. DOI:
10.1109/cvpr.2012.6247930 62
[166] X. Wang, L. Nie, X. Song, D. Zhang, and T. S. Chua. Unifying virtual and physical
worlds: Learning toward local and global consistency. TOIS, 36(1):1–26, 2017. DOI:
10.1145/3052774 92
[167] Y. Wang, S. Rawat, and F. Metze. Exploring audio semantic concepts for
event-based video retrieval. In IEEE ICASSP, pages 1360–1364, 2014. DOI:
10.1109/icassp.2014.6853819 110
[168] M. White, Y. Yu, X. Zhang, and D. Schuurmans. Convex multi-view subspace learning.
In NIPS, pages 1673–1681, 2012. 63
[169] S. Wold, K. Esbensen, and P. Geladi. Principal component analysis. Chemometrics and
Intelligent Laboratory Systems, 2(1–3):37–52, 1987. DOI: 10.1016/0169-7439(87)80084-
9 16
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset