Training

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

5.5. DEEP MULTI-MODAL TRANSFER LEARNING 115

Formally, we denote .x

; x

/ as the pair of the i-th and j -th samples, and deﬁne a pairwise

class indicator as



C1; if x

and x

are with the same label;

1; otherwise.

(5.3)

To encode the similarity preservation, we minimize the cross entropy loss of classifying all the

pairs into a label ,

i;j D1

I





D 1



log 





 I





D 1



log 



a



; (5.4)

where I./ is a binary indicator function that outputs 1 when the argument is true, otherwise 0;

and ./ is the sigmoid function. We can equivalently rewrite the above equation as

D 

iD1

j D1

log 







: (5.5)

It is very time-consuming to directly optimize Eq. (5.5) due to the huge amount of the instance

pairs, i.e., O(N

) w.r.t N samples.

To reduce the computing load, we turn to negative sampling [115]. In particular, for a

given micro-video sample x, we respectively sampled S positive and S negative micro-videos

from x’s own category and its non-categories following a distribution .x

; x

; 

/. Formally, we

uniformly sample the ﬁrst instance x

. We then sample the next instance x

with a probability

that represents the geometric closeness between x

and x

. We calculate s

with the radial

basis function kernel as

jMj

m2M

exp





 x



; (5.6)

where ı

is a radius parameter that is set as the median of the Euclidean distances of all samples

on the modality m.

5.5.3 DEEP NETWORK FOR VENUE ESTIMATION

After obtaining the multi-modal representations, we add a stack of fully connected layers, which

enables us to capture the nonlinear and complex interactions between the visual, acoustic, and

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Training

Create new playlist

Sign In

Sign Up

Table of Contents for
Training