big data and wee data by avery rosen

Post on 10-May-2015

312 Views

Category:

Technology

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

Big Data and Wee Data - We all know MongoDB is great for Big Data, but it's also great for work on the other end of the scale -- call it "Wee Data". In this talk, MongoDB expert an Principal at Bringing Fire Consulting, Avery Rosen, talks on how this type of data is far more common than Big Data scenarios. Avery discusses how just about every project starts with it. In this domain, we don't care about disk access and indices; instead, we care about skipping past the wheel inventing and getting right down to playing with the data. MongoDB lets you persist your prototype or small-working-set data without making you deal with freeze-drying and reconstitution, provides structure well beyond csv, gets out of your way as you evolve your schemas, and provides simple tools for introspecting data and crunching numbers.

TRANSCRIPT

MongoDB and Wee Data

hacking a workflow

what it’s all about

• MongoDB lets you persist and investigate your application data while getting out of your way

• p.s. BDD

me

• http://bringingfire.com/

• @4BringingFire

• expertise at the intersection of technology, product formulation, and business strategy

stuff I’m into

• products > tech

• UX testing

• iterative, analytics-driven dev

• agility in production

http://www.shopwiki.com

http://penrosejudaica.com

total projectsbig data projects= ?< 1%NOT AN

ACTUAL FACT

what is wee data?

• query performance is not an issue

• in fact it might fit in RAM

• in fact it might fit in 2MB

where mongo fits

• scales down nicely

• access programmatically without SQL

• persists for free

• query later, too

ORMpuh-lease.

http://penrosejudaica.com

Behavior Driven Development

• “Outside-in” approach

• Everything is behavior

• Acceptance-test-driven planning

• Ubiquitous language

Cucumber

Cucumber

Cucumber

Cucumber

rspec

• specification

• object doubles (mocks, fakes, stubs, spies)

• method stubbing

• expectations

Behavior Driven Development

• http://cukes.info/

• http://rspec.info/

Name Materials Price Description

fancy menorah gold, silver 600

plain menorah pewter 150

the Penrose ETL pipeline

Product Spreadsheets

.csv.csv.csv

CSV files Product Descriptions

Blah blah blah exquisite

craftsmanship

db.products.aggregate( { $match : { product_type : “menorah” } }, { $group : { _id: “$material”, count:

{ $sum : 1 } } )

Buy AdWords?

Q & A

top related