[4] Oriol Vinyals and Alexander Toshev and Samy Bengio and Dumitru Erhan (2014). "Show and Tell: {A} Neural Image Caption Generator". CoRR. URL: https://arxiv.org/pdf/1411.4555v2.pdf
[5] Kyunghyun Cho et al. (2014). "Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation". CoRR. URL: https://arxiv.org/pdf/1406.1078v3.pdf
[38] Graves A, Schmidhuber J. (2005) "Framewise phoneme classification with bidirectional LSTM and other neural network architectures.". URL: https://www.cs.toronto.edu/~graves/nn_2005.pdf
[39] Alex Graves et al. (2006). "Connectionist Temporal Classification: Labelling Unsegmented Sequence Data with Recurrent Neural Networks". URL: http://www.cs.toronto.edu/~graves/icml_2006.pdf