webinar - analyzing video
TRANSCRIPT
![Page 1: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/1.jpg)
Dato Confidential1
Analyzing Video with GraphLab CreateJune 16, 2016
Guy Rapaport, Data Scientist, Dato [email protected]
![Page 2: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/2.jpg)
Dato Confidential2
Dato: We Intelligent Applications
![Page 3: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/3.jpg)
Dato Confidential
Some of our Customers
3
![Page 4: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/4.jpg)
Dato Confidential4
Businessmust be
intelligent
Machine learning applications
• Recommenders • Fraud detection• Ad targeting• Financial models• Personalized
medicine • Churn prediction• Smart UX
(video & text)• Personal assistants• IoT• Socials networks• Log analysisLast decade:
Data managementNow:
Intelligent apps
?Last 5 years:
Traditional analytics
![Page 5: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/5.jpg)
Dato Confidential
![Page 6: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/6.jpg)
Dato Confidential
Creating a model pipeline
exploration
data
modeling
- Images- Text- Graphs- Tabular Data
![Page 7: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/7.jpg)
Dato Confidential
Creating a model pipeline
Ingest Transform
Model DeployUnstructured Data
![Page 8: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/8.jpg)
Dato Confidential
Creating a model pipeline using Dato products
Ingest Transform
Model DeployUnstructured Data
SFrame Engine(FREE, open
source)
GraphLab Create (Scalable Machine Learning Python
Library, 4K/machine/year)
Predictive Services(Serving + Load Balancing + AB
Testing,10K/machine/year)
![Page 9: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/9.jpg)
Dato Confidential9
$ pip install –U graphlab-create
![Page 10: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/10.jpg)
Dato Confidential10
What will we cover today?1. Match a movie’s screenplay with its subtitles.
- Now we know who says what and when.
2. Extract frames, then actors’ faces, from the movie.- We’ll use opencv for video manipulation and face
detection.
3. Train a face recognition model over the faces.- What’s the smallest portion of the movie we can get good
results from?
10
![Page 11: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/11.jpg)
Dato Confidential11
Python vs. Anaconda• You can download Python for free from python.org .
- Python with its standard library.
• Or, you could download the Anaconda distribution.- Python + tons of installed packages + package managers.
• It’s the same Python, but Anaconda includes both pip and also with it’s own package manager, conda.
11
![Page 12: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/12.jpg)
Dato Confidential12
pip vs. conda vs. virtualenvpip – install Python packages.
conda – install Python packages + any OS packages required for your package to work (libraries etc).
$ conda install -c menpo opencv3=3.1.0
virtualenv – separate environment (by manipulating the $PYTHONPATH etc.) so packages won’t break.
You can have multiple Python versions on the same machine, and use a Python version in different environments.
12
![Page 13: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/13.jpg)
Dato Confidential13
Look Deeper!1) Building a Face Recognition System with OpenCV in the blink of an Eye• https://github.com/rragundez/PyData• Live video from webcam, online analytics
2) Using mxnet for deep feature extraction• https://
github.com/dmlc/mxnet/blob/master/example/notebooks/predict-with-pretrained-model.ipynb
• mxnet is now integrated into GraphLab!
3) mxnet-face• https://github.com/tornadomeet/mxnet-face
![Page 14: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/14.jpg)
Dato ConfidentialConfidential – Dato internal use only. ©2015 Dato, Inc.
Questions?
“For the purpose of learning the Answer to theUltimate Question of Life, The Universe, and Everything,
the supercomputer Deep Thought was specially built.It takes Deep Thought 7½ million years to compute and check the answer, which turns out to be 42. Deep Thought points out that
the answer seems meaningless becausethe beings who instructed it
never actually knew what the Question was.”- Douglas Adams, “The Hitchhiker’s Guide to the Galaxy”
![Page 15: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/15.jpg)
Dato Confidential15
Our Machine Learning Specializationin Coursera
https://www.coursera.org/learn/ml-foundations
![Page 16: Webinar - Analyzing Video](https://reader031.vdocuments.site/reader031/viewer/2022012914/58708f7e1a28ab412b8b529d/html5/thumbnails/16.jpg)
Dato ConfidentialConfidential – Dato internal use only. ©2015 Dato, Inc.
Thanks!Install using pip: $ pip install -U graphlab-create
Dato Launcher Download:https://dato.com/download/
The benchmarks on GitHub:https://github.com/guy4261/glc_pagerank_benchmark
Coursera Course:https://www.coursera.org/learn/ml-foundations
Reach out: [email protected]