genetic algorithms for topical web search: a study of different mutation rates

15
Algorithms for Topical Web Search: A Study of Different Mutation Rates Roc ́ıo L. Cecchini†, Carlos M. Lorenzetti‡, Ana G. Maguitman‡ and N ́elida Beatriz Brignole†§

Upload: aman

Post on 23-Feb-2016

39 views

Category:

Documents


0 download

DESCRIPTION

Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates . Roc ́ ıo L. Cecchini †, Carlos M. Lorenzetti‡, Ana G. Maguitman ‡ and N ́ elida Beatriz Brignole †§ . Agenda. INTRODUCTION GENETIC ALGORITHMS GENETIC ALGORITHMS FOR EXPLORING QUERY SPACE - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

Genetic Algorithms for

Topical Web Search: A Study

of Different Mutation Rates Roc ́ıo L. Cecchini†, Carlos M. Lorenzetti‡, Ana G.

Maguitman‡ and N ́elida Beatriz Brignole†§

Page 2: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

Agenda INTRODUCTION GENETIC ALGORITHMS GENETIC ALGORITHMS FOR EXPLORING

QUERY SPACE SYSTEM ARCHITECTURE THE EFFECT OF DIFFERENT MUTATION

RATES CONCLUDING REMARKS

Page 3: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

INTRODUCTION Harvesting topical content is a process that

can be done by formulating topic-relevant queries and submitting them to a search engine.

In this scenario, selecting good query terms can be seen as an optimization problem where the objective function to be optimized is based on the effectiveness of a query to retrieve relevant material.

Page 4: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

INTRODUCTION Solving this optimization problem is

challenging because the query space has a huge number of dimensions.

To use GA to solve it, because following characteristics:

1. It is usually sufficient to identify suboptimal solutions

2. we may be interested in finding many high-quality queries rather than a single one.

Page 5: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

GENETIC ALGORITHMS

Selection: Parents are selected to produce offspring, favoring those parents with highest values of the fitness function.

Crossover: Crossover of population members takes place by exchanging subparts of the parent chromosomes

Mutation: mutation is the result of a random perturbation of the chromosome (e.g., replacing a gene by another)

Page 6: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

GENETIC ALGORITHMS

Step 1: Start with a randomly generated population Step 2: Evaluate the fitness of each individual in the

population Step 3: Select individuals to reproduce based on their

fitness Step 4: Apply crossover with probability Pc Step 5: Apply mutation with probability Pm Step 6: Replace the population by the new generation

of individuals Step 7: Go to step 2

Page 7: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

Selection-fitness function

Page 8: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

Selection

Page 9: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

Crossover The crossover operator used in our proposal

is known as single-point. new queries: 1.the first n terms are contributed by

one parent 2. the remaining terms by the second

parent, where the crossover point n is chosen at random.

Page 10: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

Mutation These changes consist in replacing a

randomly selected query term t1 by another term t2. The term t2 is obtained from a mutation pool .

Mutation pool: is updated with new terms from the snippets returned by the search engine

Page 11: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

SYSTEM ARCHITECTURE

Page 12: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

THE EFFECT OF DIFFERENT

MUTATION RATES

Page 13: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

Evaluation of query quality

Page 14: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

Conclusion 1. The techniques presented in this paper

are applicable to any domain for which it is possible to generate term-based characterizations of a topic.

2. higher mutations rates induce a more thorough exploration of the search space

3. As part of our future work we expect to continue testing different settings for the GA parameters (population size, crossover probability, mutation probability) as well as other selection methods.

Page 15: Genetic Algorithms for Topical Web Search: A Study of Different Mutation Rates

Q&A