Overview of Zero-Shot learning


title: SSL Review -2
date: 2019-03-10 10:10:25
tags: SSL
mathjax: true

Overview of Zero-Shot learning

  • the seen classes

    • the classes covered by labeled training instances in the feauture space
  • the unseen classes

    • unlabeled testing instances in the feature space which belong to another set of classes
  • the feature space

    • a real number space
    • each instance is represented as a vector within it
    • each instance is usually assumed to belong to one class
  • definition

    • the Set of Seen Classes:
      S = { c i s i = 1 , , N s } S=\{c_{i}^{s}\|i=1,……,N_{s}\}
    • a seen classes: c i j c_{i}^{j}
    • the Set of Unseen Classes:
      U = { c i u i = 1 , N u } U=\{c_{i}^{u}\|i=1,N_{u}\}
    • an unseen class: C i u C_{i}^{u}
    • Denote that : S U = {S}\bigcap{U}=\varnothing
    • The Feature Space: X X , which is D d i m e n s i o n a l D-dimensional : R D R^{D}
    • The set of labeled training instances belonging to seen classes:

    D t r = { ( x i t r , y i t r ) X × S } D^{tr}=\{(x_{i}^{tr},y_{i}^{tr})\in X\times S\}

    • for each labeled instance ( x i t r , y i t r ) (x_i^{tr},y_i^{tr}) ,
      • x i t r x_i^{tr} :the instance in the feature space ,
      • y i t r y_i^{tr} : the corresponding class label .
    • The set of testing instances:
      X t e = { x i t e X } i = 1 N t e X^{te}=\{x_i^{te}\in X\}_{i=1}^{N_{te}}
      where each x i t e x_i^{te} is a testing instance in the feature space.
    • The corresponding class labels for X t e X^{te} :
      Y t e = { y i t e U } i = 1 N t e Y^{te}=\{y_i^{te}\in U\}_{i=1}^{N_{te}}
      which are able to be predicted.

Definition 1.1 (Zero-Shot Learning)

  • Given labeled training instances D t r D^{tr} belonging to the seen classes S S ,zero-shot learning aims to learn a classifier f u ( ) : X U f^u(\cdot):X\rightarrow U that can classify testing instances X t e X^{te} (i.e. to predict Y t e Y^{te} ) belonging to the unseen classes U U .

  • General idea:
    - transfer the knowledge contained in the training instances D t r D^{tr} to the task of testing instance classfication.
    - The label spaces covered by the training and the testing instances are disjoint.
    - Zero-Shot learning is a subfield of transfer learning.

  • Transfer learning
    - knowledge contained in the source domain and source task is transferred to the target domain for learning the model in the target task.
    - Transfer learning can be classified into :
    1、homogeneous transfer learning
    >>the feature spaces and the label spaces are the same.
    2、heterogeneous transfer learning
    >>the feature spaces and/or the label spaces are different.
    - Zero-shot learning:
    >> the source feature space is the feature space of training instances.
    >> the target feature space is the feature space of testing instances.
    >> they are the same , both are X X .
    >>the source label space is the seen class set S S
    >> the target label space is the unseen class set U U
    >>Belong to heterogeneous transfer learning with different label spaces(briefly: HTL-DLS)

  • Many existing methods for HTL-DLS are proposed for problems under the setting in which there are some labeled instances for the target label space classes .

  • However in Zero-Shot learning,no labeled instances belonging to the target space classes(the unseen classes ) are available .

  • Auxiliary information for the no labeled instances problem in zero-shot learning problem.

    • Should contain information about all of the unseen classes.
    • This is to guarantee that each of the unseen classes is provided with corresponding auxiliary information.
    • Should be related to the instances in the feature space.
    • This is to guarantee that the auxiliary information is usable.
  • Existing works.

    • The approach to involve auxiliary information is inspired by the way human beings recognize the world.

    • Humans can perform zero-shot learning with the help of some semantic background knowledge.

      -for example: with the knowldge that" a zebra looks like a horse, and with stripes."
      we can recognize a zebra even without having seen one before, as long as we know what a horse looks like and what the pattern “stripe” looks like.

    • The Auxiliary information involved by existing zero-shot learning methods is usually some Semantic information.

    • It forms a space that contains both the seen and the unseen classes(Refer to Semantic Space)

    • Semantic Space:

      • similar to the feature space, is usually a real number space
      • each class has a corresponding vector representation , (refered to as the Class prototype or prototype for short of this class)
      • We denote Γ \Gamma as the Semantic Space :
        • Γ \mathcal{\Gamma} is M-dimensional,usually R M \mathbb{R}^{M} .
        • t i s Γ t_i^s\in\Gamma as the class prototypes for seen class c i s c_i^s .
        • t i u Γ t_i^u\in\Gamma as the class prptotype for unseen class c i u c_i^u
        • Denote T s = { t i s } i = 1 N s T^s=\{t_i^s\}_{i=1}^{N_s} as the set of protptypes for seen classes
        • Denote T u = { t i u } i = 1 N u T^u=\{t_i^u\}_{i=1}^{N_u} as the set of prototypes for unseen class
        • Denote Π ( ) : S U Γ \Pi(\cdot):S \cup U\rightarrow \Gamma as a class prototyping function that takes a class label as input, and outputs the corresponding class prototype.
      • In zero-shot learning , D t r D^{tr} T s T^s T u T^u involved in obtaining the zero-shot classifier f u ( ) f^u(\cdot) .
        Key notations

猜你喜欢

转载自blog.csdn.net/yinxian9019/article/details/88388830