data mining tag clouds
Post on 24-May-2015
429 Views
Preview:
DESCRIPTION
TRANSCRIPT
DATA MINING AND WAREHOUSING
VISUALIZATION METHODS
VISUALIZING COMPLEX DATA AND RELATIONS
PRESENTED BY: REEMA QAISER KHAN
VISUALIZING COMPLEX DATA AND RELATIONS• In early days, visualization techniques
were mainly for numeric data.
• Recently, more and more non-numeric data, such as text and social networks, have become available.
• Visualizing and analyzing such data attracts a lot of interest.
TAG CLOUD
• For example, many people on the web tag various objects such as pictures, blog entries, and product reviews.
• A tag cloud is a stylized way of visually representing occurrences of words used to describe tags.
• The most popular topics are normally highlighted in a larger, bolder font.
• The first use of tag clouds on a website was on Flickr, created by Stewart Butterfield.
TYPES OF TAG CLOUD
There are many ways of implementation tag clouds. Some of them are more popular than the other ones. Basically we can divided tag clouds into 4 categories:
1-Colourful
2-Font-size
3-Sorted
4-Unsorted
COLOURFUL TAG CLOUDS
• The weight of the tags determined by the colour it has.
• It is strictly recommended to use only 2-3 colours in tag cloud, because if the more colours are used in cloud, cloud become more irritating and useless.
• The idea is that the more contrast exists between the colour of the tag and the background, the more powerful tag is.
• Weak- not often used tags have colours more similar to the background colour.
COLOURFUL TAG CLOUDS
Some examples of colourful tag clouds.
FONT-SIZE TAG CLOUDS
• In these type of clouds the most important or frequent words are highlighted by an appropriate font-size.
• It means that more powerful tag is bigger.
An example of Font-size tag cloud.
This image shows the "All time most popular tags" from Flickr Photo Sharing
COMPUTATION OF FONT-SIZE IN TAG CLOUD
Each tag represents your customers favorite holiday. How can you present the tags as a cloud tag being the valentines day as the biggest (with 50px font-size) and the liberation day as the smallest (with 12px font-size)?
• We will use the following variables, namely:a = the smallest count (or occurrence).b = the count of the tag being computed.c = the largest count.w = the smallest font-size.x = the font-size for the tag. It is the unknown.y = the largest font-size.
• Now let's substitute the given values to their respective variables. Assuming that we are solving for the "thanksgiving" font-size.a = 88b = 168c = 211w = 12x = ?y = 50
• And here's the formula:
x = (b-a) (y-w) ----------- + w (c-a)
x = ( ((168-88) * (50-12)) / (211-88) ) + 12x = 36.715446x = 37
• The thanksgiving tag should have 37px font-size in the tag cloud. birthday = 29pxchristmas = 18pxvalentines = 50pxthanksgiving = 37pxliberation = 12pxhaloween = 20pxnew year = 28px
SORTED TAG CLOUDS AND UNSORTED TAG CLOUDS• In sorted tag clouds, the clouds can be
sorted according to alphabet, frequency or similarity.
• In unsorted tag clouds, the clouds may not be sorted in some order, they may be in a clustered form.
TAG INDEX• In some cases tag clouds might be not the
best solution for precise content presentation.
• For instance, if visitors are looking for some specific topic they would prefer a search engine rather than “weighting” proportions of the tags.
TAG INDEX
An example of Tag Index
THANKYOU
top related