oas schwartz 16
TRANSCRIPT
Open Analytics NYC 2013
Measuring how users consume content on the web
Josh Schwartz, Chartbeat
About Chartbeat • Founded in 2009 • Focused on realtime analytics • Used by 8/10 top US publishers • Analyzes over 1B browsing sessions per day • Handles over 200K requests per second at daytime
peak
A quick introduction
About me • Background in machine learning and computer
vision • Data Scientist at Chartbeat • Currently focused on researching user engagement
on the web
A quick introduction
What on my site are visitors actually reading?
Content consumption
Creating new metrics for user engagement
• Research process • Backend infrastructure • As many insights as I can fit in
This talk
Event based metrics are insufficient. Clicks measure engagement with links, not with content.
Content consumption
Measuring between clicks
{window_top: 100, window_height: 700, doc_height: 3400, time_on_page: 90, ... }
Counting engaged time
{window_top: 100, window_height: 700, doc_height: 3400, time_on_page: 90, engaged_time: 62, ... }
Data architecture
In-memory DB (realtime traffic)
MongoDB (page-by-page historical)
AWS Redshift (user-by-user historical)
Where do they scroll?
How do they scroll?
Where do they engage?
Why does it matter?
Why does it matter?
Questions?
Josh Schwartz [email protected] @joshuadschwartz