Experiments (1/2)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

78 4. MULTIMODAL COOPERATIVE LEARNING

where y

denotes the category label of the sample x

. We ultimately reach the following objective

function:

min

A;Q

nD1

 Da



nD1



nD1

: (4.27)

e alternative optimization strategy is applicable here. By ﬁxing Q

, taking derivative of

the above formulation regarding a

, and setting it to zero, we reach

D

 Da

C  a

C Q

D 0;



D C  I C Q



D D

;



D C  I C Q



1





(4.28)

Once we obtain all the a

, we can easily compute Q

based on Eqs. (4.24) and (4.26).

Computing D with A ﬁxed: Fixing A and taking the derivative of  with respect to D,

we have

@

D .DA  X/A

: (4.29)

By setting Eq. (4.29) to zero, it can be derived that











D 0;

D D







1

(4.30)

It is straightforward that the above algorithm converges, because in each iteration,  will

decrease, as shown in Figure 4.6. By using Algorithm 4.2, we can learn a set of dictionaries D

for each modality of samples X

and their corresponding representations A

4.5.4 ONLINE LEARNING

As analyzed in the introduction, the eﬃcient operation and incremental learning of micro-

videos deserve our attention. To accomplish this, we present an online learning algorithm (re-

ferred to Algorithm 4.3). Generally speaking, if an incoming sample is labeled, we leverage it

to strengthen the dictionary learning. We treat the learned D over the initial training data as

.0/

and update it to D

.t/

at the current time t. Otherwise, we compute its sparse representation

based on the current dictionaries and classify it into the right venue category.

An Incoming Labeled Sample: At the t-th online update, a new sample x

with a label

is given. We can know which leaf node this micro-video is from and then use it to update

the dictionaries D

.t1/

. From Eq. (4.30), we ﬁnd that the solution of D

.t/

relies on the sparse

representation A

.t/

D ŒA

.t1/

; a

. We thus need to compute a

ﬁrst that is the representation

vector of x

. However, Eq. (4.28) tells us that a

is related to Q

computed by A

.t/

D ŒA

.t1/

; a

.

4.5. MULTIMODAL COMPLEMENTARY LEARNING 79

10987654321

Iterations

Objective

Figure 4.6: Example of the convergence of Algorithm 4.2.

To address this problem, we ﬁrstly initialize a

to get a temporal A

.t/

D ŒA

.t1/

; a

, and then

we use Eq. (4.26) to compute Q

. Afterward, we can use Eq. (4.28) to compute a

with D

.t1/

as the dictionary. We repeat this procedure until we obtain the stable A

.t/

for sample x

To estimate D

.t/

when ﬁxing A

.t/

, we adopt the similar procedure introduced in [109].

In particular, we sequentially update each column of D

.t/

. We here take the j -th column as an

example to illustrate the procedure.

We deﬁne d

.t/ as the j -th column of D

.t/

. And we set



.t/



iD1



 D

.t1/



: (4.31)

We then set 5

.t/

g.D

.t/

/ to be zero, and obtain

.t/ D

iD1





DQa



iD1

; (4.32)

where a

is the j -th entry of a

D D D

.t1/

n fd

.t-1/g is a dictionary excluding the j -th atom

and Qa

D a

n fa

g deﬁnes the coeﬃcients for the corresponding dictionary atoms of

After deriving this equation, we have the additive property of linear solution

iD1



 D

.t1/



iD1

C d

.t  1/: (4.33)

80 4. MULTIMODAL COOPERATIVE LEARNING

Algorithm 4.3 Our INTIMATE Algorithm

Input:

Initialization input matrix fX

;

Streaming data f: : : ; x

; : : :g;

Node assignment fG

with weights fe

;

Parameters f; g;

Ensure:

Discriminant dictionaries fD

;

Sparse coding fa

of x

and its label;

1: Initialize fD

.0/

and fA

.0/

using Algorithm 4.2;

2: for each modality m do

3: Training the classiﬁer f

using A

.0/

;

4: end for

5: Initialize t 1;

6: for a newly sample x

.t/

in the stream do

7: if x

.t/

has a label y

.t/

then

8: for each modality m do

9: Fixing D

.t1/

, learn a

.t/

using Eq. (4.28);

10: Fixing A

.t1/

and a

.t/

, update D

.t/

using Eq. (4.35) and Eq. (4.36);

11: end for

12: else if x

.t/

without label then

13: for each modality m do

14: Learning the representation a

.t/

with D

.t1/

;

15: Leveraging a

.t/

and f

, predict its label y

;

16: end for

17: Based on fy

g, obtain the ﬁnal label y

using Eq. (4.37);

18: end if

19: update t t C 1;

20: end for

21: return D

.t/

We set

U.t/ D

.t/; : : : ; u

.t/



D U.t  1/ C a

;

F.t/ D

.t/; : : : ; f

.t/



D F.t  1/ C x

;

U.0/ D

nD1

.0/

;

F.0/ D

nD1

.0/

;

(4.34)

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Experiments (1/2)

Create new playlist

Sign In

Sign Up

Table of Contents for
Experiments (1/2)