Example: marketing

Semi-Supervised Learning Tutorial

Semi-Supervised Learning Tutorial Xiaojin Zhu Department of Computer Sciences University of Wisconsin, Madison, USA. ICML 2007. Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 1 / 135. Outline 1 Introduction to Semi-Supervised Learning 2 Semi-Supervised Learning Algorithms Self Training Generative Models S3 VMs Graph-Based Algorithms Multiview Algorithms 3 Semi-Supervised Learning in Nature 4 Some Challenges for Future Research Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 2 / 135. Introduction to Semi-Supervised Learning Outline 1 Introduction to Semi-Supervised Learning 2 Semi-Supervised Learning Algorithms Self Training Generative Models S3 VMs Graph-Based Algorithms Multiview Algorithms 3 Semi-Supervised Learning in Nature 4 Some Challenges for Future Research Xiaojin Zhu (Univ.)

Semi-Supervised Learning Tutorial Xiaojin Zhu Department of Computer Sciences University of Wisconsin, Madison, USA ICML 2007 Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 1 / 135

Tags:

  Learning, Semi, Supervised, Semi supervised learning

Information

Domain:

Source:

Link to this page:

Please notify us if you found a problem with this document:

Other abuse

Transcription of Semi-Supervised Learning Tutorial

1 Semi-Supervised Learning Tutorial Xiaojin Zhu Department of Computer Sciences University of Wisconsin, Madison, USA. ICML 2007. Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 1 / 135. Outline 1 Introduction to Semi-Supervised Learning 2 Semi-Supervised Learning Algorithms Self Training Generative Models S3 VMs Graph-Based Algorithms Multiview Algorithms 3 Semi-Supervised Learning in Nature 4 Some Challenges for Future Research Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 2 / 135. Introduction to Semi-Supervised Learning Outline 1 Introduction to Semi-Supervised Learning 2 Semi-Supervised Learning Algorithms Self Training Generative Models S3 VMs Graph-Based Algorithms Multiview Algorithms 3 Semi-Supervised Learning in Nature 4 Some Challenges for Future Research Xiaojin Zhu (Univ.)

2 Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 3 / 135. Introduction to Semi-Supervised Learning Disclaimer This Tutorial reflects my subjective opinions. Many work cannot be included. Thank Olivier Chapelle for some of the S3VM figures. Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 4 / 135. Introduction to Semi-Supervised Learning Why bother? Because people want better performance for free. the traditional view unlabeled data is cheap labeled data can be hard to get Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 5 / 135.

3 Introduction to Semi-Supervised Learning Why bother? Because people want better performance for free. the traditional view unlabeled data is cheap labeled data can be hard to get I human annotation is boring Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 5 / 135. Introduction to Semi-Supervised Learning Why bother? Because people want better performance for free. the traditional view unlabeled data is cheap labeled data can be hard to get I human annotation is boring I labels may require experts Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 5 / 135.

4 Introduction to Semi-Supervised Learning Why bother? Because people want better performance for free. the traditional view unlabeled data is cheap labeled data can be hard to get I human annotation is boring I labels may require experts I labels may require special devices Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 5 / 135. Introduction to Semi-Supervised Learning Why bother? Because people want better performance for free. the traditional view unlabeled data is cheap labeled data can be hard to get I human annotation is boring I labels may require experts I labels may require special devices I your graduate student is on vacation Xiaojin Zhu (Univ.)

5 Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 5 / 135. Introduction to Semi-Supervised Learning Example of hard-to-get labels Task: speech analysis Switchboard dataset telephone conversation transcription 400 hours annotation time for each hour of speech film f ih n uh gl n m be all bcl b iy iy tr ao tr ao l dl Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 6 / 135. Introduction to Semi-Supervised Learning Another example of hard-to-get labels Task: natural language parsing Penn Chinese Treebank 2 years for 4000 sentences The National Track and Field Championship has finished.

6 Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 7 / 135. Introduction to Semi-Supervised Learning Example of not-so-hard-to-get labels a little secret For some tasks, it may not be too difficult to label 1000+ instances. Task: image categorization of eclipse . Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 8 / 135. Introduction to Semi-Supervised Learning Example of not-so-hard-to-get labels There are ways like the ESP game ( ) to encourage human computation for more labels. Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 9 / 135.

7 Introduction to Semi-Supervised Learning Example of not-so-hard-to-get labels In this Tutorial we will learn how to use unlabeled data to improve classification. Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 10 / 135. Introduction to Semi-Supervised Learning The Learning Problem Goal Using both labeled and unlabeled data to build better learners, than using each one alone. Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 11 / 135. Introduction to Semi-Supervised Learning Notations input instance x, label y learner f : X 7 Y. labeled data (Xl , Yl ) = {(x1:l , y1:l )}.

8 Unlabeled data Xu = {xl+1:n }, available during training usually l n test data Xtest = {xn+1: }, not available during training Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 12 / 135. Introduction to Semi-Supervised Learning Semi-Supervised vs. transductive Learning labeled data (Xl , Yl ) = {(x1:l , y1:l )}. unlabeled data Xu = {xl+1:n }, available during training test data Xtest = {xn+1: }, not available during training Semi-Supervised Learning Transductive Learning is ultimately applied to the test is only concerned with the data (inductive). unlabeled data. Xiaojin Zhu (Univ.)

9 Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 13 / 135. Introduction to Semi-Supervised Learning Why the name supervised Learning (classification, regression) {(x1:n , y1:n )}. l Semi-Supervised classification/regression {(x1:l , y1:l ), xl+1:n , xtest }. transductive classification/regression {(x1:l , y1:l ), xl+1:n }. l Semi-Supervised clustering {x1:n , must-, cannot-links}. l unsupervised Learning (clustering) {x1:n }. Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 14 / 135. Introduction to Semi-Supervised Learning Why the name supervised Learning (classification, regression) {(x1:n , y1:n )}.

10 L Semi-Supervised classification/regression {(x1:l , y1:l ), xl+1:n , xtest }. transductive classification/regression {(x1:l , y1:l ), xl+1:n }. l Semi-Supervised clustering {x1:n , must-, cannot-links}. l unsupervised Learning (clustering) {x1:n }. We will mainly discuss Semi-Supervised classification. Xiaojin Zhu (Univ. Wisconsin, Madison) Semi-Supervised Learning Tutorial ICML 2007 14 / 135. Introduction to Semi-Supervised Learning How can unlabeled data ever help? 1 2 labeled data decision boundary (labeled). unlabeled data decision boundary (labeled and unlabeled).. 1 2. 1 0 1 x assuming each class is a coherent group ( Gaussian).


Related search queries