probabilistic model-agnostic meta-learningcbfinn/_files/icml2018_pgm_platipus.pdfprobabilistic...

Probabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Upload: others

Post on 04-Jun-2020

6 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Probabilistic Model-Agnostic Meta-LearningChelsea Finn*, Kelvin Xu*, Sergey Levine

*equal contribution

Page 2: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Deep learning requires a lot of data.

Few-shot learning / meta-learning: incorporate prior knowledge from previous tasks for fast learning of new tasks

Given 1 example of 5 classes: Classify new examples

few-shot image classification

Page 3: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Ambiguity in Few-Shot Learning

Important for:- learning to actively learn

- safety-critical few-shot learning (e.g. medical imaging)

- learning to explore in meta-RL

Can we learn to generate hypotheses about the underlying function?

+ -

Page 4: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Key idea: Train over many tasks, to learn parameter vector θ that transfers

Meta-test time:

Background: Model-Agnostic Meta-LearningOptimize for pretrained parameters

MAML:

Finn, Abbeel, Levine ICML ’17

How to reason about ambiguity?

Page 5: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Modeling ambiguity

Can we sample classifiers?

Intuition of our approach: learn a prior where a random kick can put us in different modes

smiling, hatsmiling, young

Page 6: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Meta-learning with ambiguity

Page 7: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Sampling parameter vectors

approximate with MAPthis is extremely crude

but extremely convenient!

Training is harder. We use amortized variational inference.

(Santos ’92, Grant et al. ICLR ’18)

Page 8: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

PLATIPUSProbabilistic LATent model for Incorporating Priors and Uncertainty in few-Shot learning

Ambiguous 5-shot regression:

Ambiguous 1-shot classification:

Page 9: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

PLATIPUSProbabilistic LATent model for Incorporating Priors and Uncertainty in few-Shot learning

Page 10: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Collaborators

Questions?

Sergey LevineKelvin Xu

Takeaways

Can use hybrid inference in hierarchical Bayesian model for scalable and uncertainty-aware meta-learning.

Handling ambiguity is particularly relevant in few-shot learning.

Page 11: Probabilistic Model-Agnostic Meta-Learningcbfinn/_files/icml2018_pgm_platipus.pdfProbabilistic Model-Agnostic Meta-Learning Chelsea Finn*, Kelvin Xu*, Sergey Levine *equal contribution

Related concurrent work

Kim et al. “Bayesian Model-Agnostic Meta-Learning”: uses Stein variational gradient descent for sampling parameters

Gordon et al. “Decision-Theoretic Meta-Learning”: unifies a number of meta-learning algorithms under a variational inference framework

Model-Agnostic Process Modeling Master Thesis · 2020-02-12 · Model-Agnostic Process Modeling Master Thesis Master in Innovation and Research in Informatics ... like the Business

Meta Exploration for Model-Agnostic Reinforcement Learning

Multimodal Model-Agnostic Meta-Learning via Task-Aware … · 2020-02-13 · Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation Risto Vuorio 1 Shao-Hua Sun 2Hexiang

Anchors: High-Precision Model-Agnostic Explanationsmarcotcr/aaai18.pdf · Anchors: High-Precision Model-Agnostic Explanations ... the FICO score is sufﬁcient in ... High Precision

Probabilistic Model-Agnostic Meta-Learning · a critical challenge in few-shot learning is task ambiguity: even when a powerful prior can be meta-learned from a large number of prior

Toward Multimodal Model-Agnostic Meta-Learningmetalearning.ml/2018/papers/metalearn2018_paper4.pdf · Toward Multimodal Model-Agnostic Meta-Learning Risto Vuorio1 Shao-Hua Sun 2Hexiang

Knowledge Distillation for Model-Agnostic Meta-Learning

Domain Generalization via Model-Agnostic Learning of

Meta-DermDiagnosis: Few-Shot Skin Disease Identification using Meta-Learning · 2020. 12. 18. · Vuorio et al’s work ‘Multimodal model-agnostic meta-learning via task-aware modulation.’

Bayesian Model-Agnostic Meta-Learning...Bayesian Model-Agnostic Meta-Learning Jaesik Yoon∗3, Taesup Kim∗‡2, Ousmane Dia1, Sungwoong Kim4, Yoshua Bengio2,5, Sungjin Ahn‡6 1Element

Bayesian Model-Agnostic Meta-Learning · 2018-11-20 · Bayesian Model-Agnostic Meta-Learning Taesup Kimz2, Jaesik Yoon 3, Ousmane Dia1, Sungwoong Kim4, Yoshua Bengio2;5, Sungjin

Knowledge Distillation for Model-Agnostic Meta-Learningecai2020.eu/papers/1264_paper.pdf · 3.1 Model-Agnostic Meta-learning (MAML) MAML is a meta-learning framework for one-shot

SegFix: Model-Agnostic Boundary Refinement for Segmentation...SegFix: Model-Agnostic Boundary Refinement for Segmentation 5 3 Approach 3.1 Framework The overall pipeline of SegFix

1 Predicting Defective Lines Using a Model-Agnostic Technique1 Predicting Defective Lines Using a Model-Agnostic Technique Supatsara Wattanakriengkrai, Patanamon Thongtanunam, Chakkrit

Model-Agnostic Meta-Learning for Fast Adaptation …...Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks Chelsea Finn, Pieter Abbeel, Sergey Levine Presentener: Siavash

Research Project Proposal: Structured Meta-Learning for Heterogeneous … · 2020. 4. 10. · •Finn et al. Model-agnostic meta-learning for fast adaptation of deep networks, 2018

Model-Agnostic Meta-Learning for Fast Adaptation of Deep ... · Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks Chelsea Finn 1Pieter Abbeel1 2 Sergey Levine Abstract

Model Agnostic Contrastive Explanations for Structured Data

Probabilistic Model-Agnostic Meta-Learning · 4 Method Our goal is to build a meta-learning method that can handle the uncertainty and ambiguity that occurs when learning from small

Model-Agnostic Meta-Learning for Fast Adaptation of Deep ...cse.iitrpr.ac.in/ckn/courses/f2019/cs618/maml.pdfCharacteristics of MAML •The MAML learner’s weights are updated using

Archimate Meta Model

Model-Agnostic Meta-Learning (MAML) - for Fast Adaptation ... · Model-Agnostic Meta-Learning (MAML) for Fast Adaptation of Deep Networks from Chelsea Finn, Pieter Abbeel and Sergey

PoseFix: Model-Agnostic General Human Pose …openaccess.thecvf.com/content_CVPR_2019/papers/Moon...PoseFix: Model-agnostic General Human Pose Reﬁnement Network Gyeongsik Moon Department

An Overview of Model-Agnostic Interpretation Methodsdmqa.korea.ac.kr/uploads/seminar/20200918_An Overview of... · 2020. 9. 18. · 02 | Model-Agnostic Methods LIME (Local Interpretable

A Technology-Agnostic MTJ SPICE Model with User-Defined

Algebricks: A Data Model-Agnostic Compiler Backend for Big ...asterix.ics.uci.edu/pub/p422-borkar.pdf · Algebricks: A Data Model-Agnostic Compiler Backend for Big Data Languages

(2019) Aravind Rajeswaran, Chelsea Finn, Sham …...Aravind Rajeswaran, Chelsea Finn, Sham Kakade, Sergey Levine (2019) Mayank Mittal Model-Agnostic Meta-Learning Images from: Model-Agnostic

via Task-Aware Modulation - GitHub Pages• Task network: further gradient adaptation via MAML steps Background Model-Agnostic Meta-Learning [1] • Meta-learn a parameter initialization

arXiv:2002.04766v1 [cs.LG] 12 Feb 2020Distribution-Agnostic Model-Agnostic Meta-Learning Liam Collins , Aryan Mokhtari , Sanjay Shakkottai February 13, 2020 Abstract TheModel-AgnosticMeta-Learning(MAML)algorithm[Finnetal.,2017]hasbeencel

Model-Agnostic Explainability for Visual Search

Case studies 1. ABP Meta-Model. 2. Concepts ABP Meta-Model. 3. Design of the Visualiser. 4. CITA Meta-Model. 5. Concepts CITA Meta-Model. 6. Mapping of

JOURNAL OF LA Generic Model-Agnostic Convolutional Neural

A Comprehensive Scenario Agnostic Data LifeCycle Model for an

Workshop on Meta-Learning (MetaLearn 2020) - Sung WhanYoon Meta …metalearning.ml/2018/slides/spotlights/spotlight1.pdf · 2021. 4. 7. · Toward Multimodal Model-Agnostic Meta-Learning

Model-Agnostic Counterfactual Reasoning for Eliminating