Experiments and Results (1/2)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

3.6. MULTIMODAL TRANSDUCTIVE LEARNING 31

consider the squared loss regarding the N unlabeled samples to guarantee the learning perfor-

mance. We ultimately reach our objective function as

min

f;L.S

iD1

 f

C f

L.S

/f C 

kD1



tr.L.S

L.S

/ 

tr.L.S

L.S



;

where  and  are both nonnegative regularization parameters. To be more speciﬁc,  penal-

izes the disagreement among the latent space and modalities, and



encourages that similar

popularity will be assigned to similar micro-videos.

3.6.2 OPTIMIZATION

To simplify the representation, we ﬁrst deﬁne that

L D

tr.L.S

L.S

tr.L.S

L.S

(3.9)

erefore, the objective function can be transformed to

min

iD1

 f

C 



L 



C f

Lf; s.t. tr.L.S

// D 1: (3.10)

Furthermore, to optimize

L more eﬃciently, inspired by the property that tr.

/ D 1, we

let

L.S

/ D

kD1

; s.t.

kD1

D 1: (3.11)

Consequently, we have,

L D

tr.L.S

L.S

/ D

kD1

; s.t.

kD1

D 1: (3.12)

Interestingly, we ﬁnd that ˇ

can be treated as the co-related degree between the latent common

space and each modality. It is worth noting that we do not impose the constraint of ˇ  0, since

we want to keep both positive and negative co-relations. A positive coeﬃcient indicates the

positive correlation between the modality space and the latent common space, while a negative

coeﬃcient reﬂects the negative correlation, which may be due to the noisy data of the modality.

e larger the ˇ

is, the higher correlation between the latent space and the k-th modality will

32 3. MULTIMODAL TRANSDUCTIVE LEARNING

be. In the end, the ﬁnal objective function can be written as:

min

f;ˇ

iD1

 f

C 

kD1



iD1





C f

kD1

f C 



;

s.t. e

ˇ D 1; (3.13)

where ˇ D Œˇ

; ˇ

;    ; ˇ



2 R

and e D Œ1; 1;    ; 1

2 R

.  is the regularization param-

eter, introduced to avoid the overﬁtting problem. We denote the objective function of Eq. (3.13)

as . We adopt the alternating optimization strategy to solve the two variables f and ˇ in . In

particular, we optimize one variable while ﬁxing the other one in each iteration. We keep this

iterative procedure until the  converges.

Computing ˇ

with f Fixed

We ﬁrst ﬁx f and transform the objective function  as

min



kD1

N CM

tD1



.t/

ˇ 

.t/



C g

ˇ C 



; s.t. e

ˇ D 1; (3.14)

where g D Œf

f; f

f; : : : ; f

f

2 R

, M

.t/

D Œ

.t/

;

.t/

; : : : ;

.t/

 2 R

.N CM /K

and

.t/

N CM

denotes the t-th column of

. For simplicity, we replace

.t/

with

.t/

ˇ, as e

ˇ D 1.

With the help of Lagrangian,  can be rewritten as follows:

min



kD1

N CM

tD1





.t/



.t/





C  g

ˇ C ı .1  e

ˇ/ C 



; (3.15)

where ı is a nonnegative Lagrange multiplier. Taking derivative of Eq. (3.15) with respect to ˇ,

we have

@

@ˇ

D Hˇ C g  ıe; (3.16)

where

H D 2



N CM

tD1



.t/



.t/





.t/



.t/



C  I

; (3.17)

and I is a K  K identity matrix. Setting Eq. (3.16) to zero, we have:

ˇ D H

1

.ıe  g/: (3.18)

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Experiments and Results (1/2)

Create new playlist

Sign In

Sign Up

Table of Contents for
Experiments and Results (1/2)