Transcript
Page 1: Stack_Overflow-Network_Graph

Stack Overflow

Network Graph

Team: Heineken

Page 2: Stack_Overflow-Network_Graph
Page 3: Stack_Overflow-Network_Graph

Motivation

What we want?

- We want overall quality of a question

- We want to see how much the similar

questions related to the question

Page 4: Stack_Overflow-Network_Graph

Solution

Visualize the Question Network by A Graph

- Inspired by http://www.visuwords.com/

Page 5: Stack_Overflow-Network_Graph
Page 6: Stack_Overflow-Network_Graph

Dataset

Page 7: Stack_Overflow-Network_Graph

Dataset

Page 8: Stack_Overflow-Network_Graph

How Can We Translate

the Questions Into

Network Graph?

Page 9: Stack_Overflow-Network_Graph

Three Properties of Graph

1. Size of Node

2. Distance

Between Nodes

3. Directed Edges

1.Quality of A Question

2.Similarity Between

Questions (Content Based)

1.Link Between Questions

(Item based)

Page 10: Stack_Overflow-Network_Graph

Quality

SimilarityAssociation

Question

Concept

Page 11: Stack_Overflow-Network_Graph

Data Exploration

Page 12: Stack_Overflow-Network_Graph
Page 13: Stack_Overflow-Network_Graph
Page 14: Stack_Overflow-Network_Graph

Data Exploration

Page 15: Stack_Overflow-Network_Graph

Data Exploration

Page 16: Stack_Overflow-Network_Graph

Data

CleansingSed Pig Latin

Page 17: Stack_Overflow-Network_Graph

Bash - Sed

Page 18: Stack_Overflow-Network_Graph

Pig

Page 19: Stack_Overflow-Network_Graph

Data Analysis

Cosine

Similarity

Aggregation

Integration

Page 20: Stack_Overflow-Network_Graph

Hive

Page 21: Stack_Overflow-Network_Graph

● Library: “tm"

● Data Format: Corpus

● Standardization:

○ Cleansing

○ SMART IR system

○ Porter Stemmer Approach

● Term Frequency matrix

○ Normalization: Cornell SMART system

● Generate Vector Space Model Matrix

● Dot Product of the Matrix

Finding Similarities Among the Questions

in RStudio With R Language

Page 22: Stack_Overflow-Network_Graph
Page 23: Stack_Overflow-Network_Graph
Page 24: Stack_Overflow-Network_Graph

factorial number of similarities

Page 25: Stack_Overflow-Network_Graph

Visualization

Page 26: Stack_Overflow-Network_Graph

Discovery - First 300 Questions TfIdf

Page 27: Stack_Overflow-Network_Graph

Discovery - First 300 Questions Bool

Page 28: Stack_Overflow-Network_Graph

Discovery - Links Rank Validation

Initial

OpenOrd

Page 29: Stack_Overflow-Network_Graph

Discovery - Links Rank Validation

Filter

Page 30: Stack_Overflow-Network_Graph

Discovery - Links Rank Validation

Edge View

OpenOrd

Page 31: Stack_Overflow-Network_Graph

Discovery - Links Rank Validation

Page 32: Stack_Overflow-Network_Graph

GEXF

Deployment

Page 33: Stack_Overflow-Network_Graph

Sigma.js

Graphs from Gephi can be exported but they are static

But what we want is for the user to interact with the graph

Sigma.js is a JavaScript library that renders graphs on web pages

Export graph from Gephi and parse it through Sigma.js

Page 34: Stack_Overflow-Network_Graph

Sigma.js

Init function

Page 35: Stack_Overflow-Network_Graph

Sigma.js

parse.Gexf function

Creating Nodes

Page 36: Stack_Overflow-Network_Graph

Sigma.js

Creating Edges

Page 37: Stack_Overflow-Network_Graph

Final Product

Page 38: Stack_Overflow-Network_Graph

Code Available on Git Repo

Page 39: Stack_Overflow-Network_Graph

Cheers!


Top Related