PDF4PRO ^⚡AMP

Modern search engine that looking for books and documents around the web

Dark knowledge - TTIC

dark knowledge Geoffrey Hinton, Oriol Vinyals & Jeff Dean Google Inc. The conflicting constraints of learning and using The easiest way to extract a lot of knowledge from the training data is to learn many different models in parallel. We want to make the models as different as possible to minimize the correlations between their errors. We can use different initializations or different architectures or different subsets of the training data. It is helpful to over-fit the individual models. A test time we average the predictions of all the models or of a selected subset of good models that make different errors. That s how almost all ML competitions are won ( Netflix) Why ensembles are bad at test time A big ensemble is highly redundant. It has very very little knowledge per parameter. At test time we want to minimize the amount of computation and the memory footprint.

The conflicting constraints of learning and using • The easiest way to extract a lot of knowledge from the training data is to learn many different models in parallel.

Fullscreen Download

Tags:

Learning, Knowledge, Dark, Dark knowledge

Information

Domain:

Source:

Link to this page:

Please notify us if you found a problem with this document:

Spam in document Broken preview Other abuse

Transcription of Dark knowledge - TTIC

Related search queries

Chapter 3 Applying Learning Theories to Margaret, Learning, Transfer, Application for Credit Transfer through, Application for Credit Transfer through Credit for Recognised Learning, ECTS Users’ Guide, European Commission, And transfer, Transfer Learning, Multimedia learning, Student-Centered Learning, Learning in the Digital Age, FACULTY CREDENTIALS

Dark knowledge - TTIC

Tags:

Information

Related documents

Related search queries