discovering the constraint- based association rules in an archive for unique bulgarian bells tihomir...

19
Discovering the constraint-based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics and Informatics, University of Veliko Tarnovo “St. Cyril and St. Methodius”, Bulgaria The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006 Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

Upload: ferdinand-baker

Post on 16-Jan-2016

229 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

Discovering the constraint-based association rules in an archive for unique Bulgarian bells

Tihomir Trifonov, Tsvetanka Georgieva

Department of Mathematics and Informatics,University of Veliko Tarnovo “St. Cyril and St. Methodius”, Bulgaria

The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

Page 2: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

This paper presents an application that discovers the constraint-based association rules.

It allows the association analysis of the different characteristics of the bells.

Detailed information about the examined bells is maintained in an audio and video archive of unique Bulgarian bells.

The application is realized with Java and SQL and provides the possibility for finding the association rules of the data obtained after applying the methods of digital processing of signals for analysis of bell sounds.

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

2

Page 3: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

Discovering the association rules is a data mining task [3, 7] in which the goal is to find interesting relationships between the attributes of the analyzed data.

Once found, the association rules can be used for supporting decision making in different areas.

In numerous cases the algorithms generate a large number of association rules, often thousands or even millions.

It is almost impossible for the end users to encompass or validate such a large number of association rules, limiting the results of the data mining is therefore helpful.

Tenth International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2008

3

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Page 4: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

Let I = {I1, I2, … , In} be a set of n different values of attributes. Let R be a relation, where each tuple t has a unique identifier and contains a set of items, such that t I. An association rule is an implication of the form X → Y, where X, Y I are sets of items with XY = . The set X is called an antecedent, and Y –a consequent.

There are two parameters associated with a rule:

the support of the association rule X → Y is the proportion (in percentages) of the number of the tuples in R, which contain X Y to the total number of the tuples in the relation;

the confidence of the association rule X → Y is the proportion (in percentages) of the number of the tuples in R, which contain X Y to the number of the tuples, which contain X.

4

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Association Rules

Page 5: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

5

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

The task of association rules mining is to generate all association rules which have values of the parameters support and confidence, exceeding the previously given respectively minimal support min_supp and minimal confidence min_conf.

Therefore the discovery of the association rule requires finding the sets of items, which have a support, more than the previously defined minimal threshold min_supp. These sets are called frequent itemsets.

Association Rules

Page 6: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

To increase the efficiency of existing algorithms for data mining, during the mining process constraints are applied with the goal for these association rules, of which only those interesting to the user are generated, instead of all association rules.

The constraint-based association rule mining aims to find all rules from given dataset, which satisfy the constraints required from the users.

For discovering only the rules corresponding to the specific patterns, in [4] the meta-rules are applied. The format of the interesting rules is defined by using a template, the algorithm generates only these rules, which correspond to this template.

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

6The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Constraint-based Association Rules

Page 7: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

A meta-rule [4] is a rule template from the following type

P1 P2 … Pm → Q1 Q2 … Ql

where Pi (i = 1, …, m) and Qj (j = 1, …, l) are instantiated predicates or predicate variables, p = m + l is the number of the predicates in the rule.

This paper presents an application, which allows the user to set constraints for searched rules and finds constraint-based association rules.

The application is used for performing the association analysis on the different characteristics of the bells, the information for which is kept in an archive produced for the goal.

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

Constraint-based Association Rules

7The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Page 8: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

Detailed data about the analysis of bells is stored in an audio and video archive of the unique Bulgarian bells [10]. The data of the archive is accessible from http://www.math.bas.bg/bells/belleng.html

For each bell, information is maintained for its unique identifier, location, type, geometrical dimensions, weight, material, condition, creator, year or period of creation, description, estimation of its historical value, digital photos, sound and video files, spectrograms.

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

8The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Discovering the constraint-based association rules in an archive for unique Bulgarian bells

Page 9: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

A program is realized with MatLab [11] for analysis of the sounds of the bells by using various methods for digital signal processing (DSP) – spectral analysis by means of the Discrete Fourier Transform (DFT), digital filter, wavelet analysis.

The partials of the sounds of the bells are found by applying the digital filter.

The information about the previously calculated partials of the sounds of the different bells is stored in the archive.

The represented application discovers the constraint-based association rules in an archive for unique Bulgarian bells. It is realized by using the languages Java and SQL.

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

Discovering the constraint-based association rules in an archive for unique Bulgarian bells

9The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Page 10: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

To the user that starts the application, the following possibilities, are provided:

Setting the attributes, being subject to analysis; Setting the minimal value of the support min_supp and

the minimal value of the confidence min_conf; Setting the conditions (Boolean expression) for the

values of the attributes, which can participate or not in the antecedent and the consequence of the searched rules.

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

Discovering the constraint-based association rules in an archive for unique Bulgarian bells

10The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Page 11: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

Choosing the attributes; defining the minimal support, the minimal confidence, the conditions for the values of the attributes; outputting the found rules

13The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Page 12: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

The figure shows an example result from the execution of the realized program with given values of the minimal support, minimal confidence and conditions for the values of the attributes. For instance, let the following rule be generated from the archive for unique Bulgarian bells:

SecondParial(“600 Hz”) → FirstPartial(“320 Hz”)with values of the support s = 0.025 and the confidence c = 0.5.

This rule means, that for monastery bells with second partial 600 Hz one of the most frequent values of the first partial is 320 Hz (with 50.00% confidence) and the monastery bells with value 600 Hz of their second partial represent 2.5% from all bells, included in the study.

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

Discovering the constraint-based association rules in an archive for unique Bulgarian bells

14The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Page 13: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

When representing the association rules in a tabular view all found rules are exposed in a table, where each row matches to a rule and provides information for the support and the confidence of this rule.

All rules can be showed in different orders – according to the values of the attributes, participating in the antecedents and the consequents of the discovered rules; according to the values of the parameters support and confidence in ascending or descending order.

In this manner the user has a more clear and complete view of the rules and can more easily locate a special rule. The tabular view facilitates adopting a large number of rules.

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

Discovering the constraint-based association rules in an archive for unique Bulgarian bells

15The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Page 14: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

Monastery “St. Transfiguration” near the town of Veliko Tarnovo

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

16The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Page 15: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

Church “St. Nikolay” in Veliko Tarnovo, View close to the bell

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

17The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Page 16: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

The connection between BellDB and MatLab provides possibility for analyzing the sounds of the bells and to search a concrete bell by a sample of its sound.

Real sound of the given bell and its 3D spectrogram, computed in MatLab

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

18The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Page 17: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

Data acquisition of the experimental data (PULSE 11, B&K)

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

19The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Page 18: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

References1. R. Agrawal, T. Imielinski, A. Swami, Mining Association Rules between Sets of Items in Large Databases, In Proc. of

the ACM SIGMOD International Conference on Management of Data,1993.2. R. Agrawal, R. Srikant, Fast Algorithms for Mining Association Rules, Proc. of the Int. Conf. on Very Large Databases,

1994.3. A. A. Barsegyan, M. S. Kupriyanov, V. V. Stepanenko, I. I. Holod, Technologies for data analysis: Data Mining, Visual

Mining, Text Mining, OLAP, BHV-Peterburg, 2008 (in Russian).4. Y. Fu and J. Han, Meta-rule-guided mining of association rules in relational databases, In Proc. of the Int. Workshop

on Integration of Knowledge Discovery with Deductive and Object-Oriented Databases, 1995.5. T. Georgieva, Discovering Branching and Fractional Dependencies in Databases, Data and Knowledge Engineering,

Elsevier, vol. 66, № 2, 2008.6. M. Kamber, J. Han, J. Chiang, Using Data Cubes for Metarule-Guided Mining of Multi-Dimensional Association Rules,

Technical Report, CMPT–TR–97–10, School of Computing Sciences, Simon Fraser University, 1997.7. M. Kantardzic, Data Mining: Concepts, Models, Methods, and Algorithms, John Wiley & Sons, 2003.8. S. Kotsiantis and D. Kanellopoulos, Association Rules Mining: A Recent Overview, GESTS International Transactions

on Computer Science and Engineering, Vol.32 (1), 2006.9. R. Ng, L. Lakshmanan, J. Han, A. Pang, Exploratory Mining and Pruning Optimizations of Constrained Association

Rules, In Proceedings of the ACM SIGMOD Conference on Management of Data, 1998.10. T. Trifonov ,T. Georgieva, Web based approach to managing audio and video archive for unique Bulgarian bells, In

Proc. of the Tenth Int. Conf. on science and technology "System Analysis and Information Technologies", Kiev, 2008.11. T. Trifonov ,T. Georgieva, The bell chime – an acoustical, mathematical and technological challenge, In Proceedings

of the National Scientific Conference on Acoustics, Varna, 2008.12. The Bell Project “Research and Identification of Valuable Bells of the Historic and Culture Heritage of Bulgaria and

Development of Audio and Video Archive with Advanced Technologies” Website, http://www.math.bas.bg/bells/belleng.html

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009

20The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006

Page 19: Discovering the constraint- based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics

Благодаря за вниманието!

Thank you for your attention!

Спасибо за внимание!

The work was supported partially by the Bulgarian National Science Fund under Grant

KIN-1009/2006

21

Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009