mlw 2014 - the evolution of our revolution, a session from alm

20
The Evolution of our Revolution Gene Bishop VP of Technology ALM’s Development in MarkLogic

Upload: marklogic

Post on 01-Jul-2015

149 views

Category:

Technology


4 download

DESCRIPTION

The story of our development and use of MarkLogic. We began with our first use as a content repository. We then developed and delivered customer products, as well as licensed content by product. Now we are off to the races with our version of Master Data Management in building am Audience Database solution.

TRANSCRIPT

The Evolution of our Revolution

Gene Bishop VP of Technology

ALM’s Development in MarkLogic

Introduction to ALM

• Leading Integrated Content provider serving the Legal and Commercial Real Estate Industries

• Our Channels • 60+ Websites and Blogs • Print 22 Newspapers, Magazines, across the

country • Over 140 Books published, print and online • 275 Events Globally reaching 50,000 decision

makers

• Digital product lines • SmartLitigator, Verdict Search • Directories • Books • Law Jobs • CLE • Reprints

• Offices and Publications in 9 US Regions, Canada, UK, Hong Kong

You’re Jimmy right? This is your house?...I’m Winston Wolf, I solve problems.”

By The Numbers…

• Our Award Winning Content reaches over 800,000 legal professionals monthly

• 60+ News Websites and Blogs – 500k Documents archive • 22 Print News Papers and Magazines – 1.8 Million Documents

archive dating back to 1989 • Case Law Document Archive- 300,000 • 11 Regional Judicial Profiles Print and Web- 8,000 Document Archive • 40+ E-newsletters • 140+ Books/ Treatises • 6+ Citators • 9 State Specific Verdicts Print Editions- 170,000 Document Archive

on the Web, 6,000 added every year • 9 Regional Expert Witness Directories- 15,000 Records

Our Content Types

Volume, Variety, Variability

• 10,000 to 12,000 content pieces added every month • Archive growing by 120k to 150k every year • Digital News Archive dates back to 1989 • 13 Content Creation Ecosystems exist for Editorial

Content Types like News, Cases, Verdicts, Treatises, Directories, Public Notices, Real Estate Content, Newsletters, Surveys, Blogs and Judges Profiles with regional variation

2 Sets of Challenges

The Problem Cycle

Problem ID

Analysis

Develop Test

Deliver

Problem ID

Analysis

Develop Test

Deliver

Problem ID

Analysis

Develop Test

Deliver

Knowledge & Experience Level

Archive

Delivery

???

The Beginning

2006

2008 - 2010Building Research

Applications

2007 2008 2009

2006 - 2008Golden Bucket

2010

• Using MarkLogic since 2005-06 • Initially used as a Content Repository • Evolves in a Repurposing Engine • Provides the foundation for content

serving for Research Applications

SmartLitigator

The Springboard…a 2nd Problem

2011 20122010

2010 - 2011Delivering Licensed Content Externally

2011 - 2012 Re - Aligning the Architecture

• We added a few sources to our Prod MLogic instances to strengthen the research products.

• Deliver Licensed Content • Began adding Many content sources • The ‘Big Show’ got bigger…

2013

Winston Wolf II - A Licensing Problem

• Over 1.8M documents • On going data delta delivery

• Provide a 6 month fence separating out our current content into archive access

“Jimmy…did I smell some coffee…”

There were: 15 Newspapers 7 Magazines 60 Websites/Blogs 44 Newsletters 140 Treatises 11 Judicial Profiles sources 4 Case Law Sources 9 Verdicts editions

All content loaded into MarkLogic using a Loader layer into an Incoming Database

We moved to Content Specific Buckets

Assigned to a Licensee Database Content pre-processing and

conversion to NITF done We delivered to a Licensee FTP

drop per a pre-agreed schedule (Daily, Weekly, Monthly per agreement with Licensee)

Deliveries are monitored by robust reporting and issue notification

Hard to believe…but I have a new problem

• Our “rich” content is loaded • Our data organization is founded

in product • We are now loading customer

information.

The Holy Grail?

2013 - 2015 MDM & Semantics

2014 2013

What that means to ALM

• We have our products Some Related Articles New, local, or relevant events New content additions to subscription

products Books, Magazines, Services

• We have Content Aggregated from created and

acquired sources Organized by product Tagged according to Taxonomy

• We have Reader/Visitor/Subscriber information He has a digital account He told us he’s a LinkedIn User He tells us his practice area

How can I get my message across and include a shameless plug for my boss?

At ALM-We’re taking the challenge to “re-imagine”

Predictive analysis: Firm, Individual, trends, schools

practice areas Product Offerings Manual and Automatic according

to behavior Triggers for Sales and Marketing

motions What do we know now?

What did we know then? How can I use this to make more accurate offerings or different or new products or analysis?

Small…but Mighty!

Our Evolution

2007 2008 2009 2010 2011 2012 2013 20142006

2006 - 2008Golden Bucket

2008 - 2010Building Research

Applications

2010 - 2011Delivering Licensed Content Externally

2013 - 2015 MDM & Semantics

2011 - 2012 Re - Aligning the Architecture

Q & A?

Gene E. Bishop VP of Technology ALM 4 Metrotech 21st Floor Brooklyn, NY 11201 [email protected] 347-227-3300