research - searching for researchers

24
Davide Eynard [email protected] ReSearch (because research without search is just “re”)

Upload: davide-eynard

Post on 27-Jan-2015

110 views

Category:

Technology


0 download

DESCRIPTION

... because research without search is just "re". A talk teaching the basics of searching, made for the PhD students of the Department of Electronics and Information at Politecnico di Milano

TRANSCRIPT

Page 1: ReSearch - Searching for Researchers

Davide Eynard

eynardeletpolimiit

ReSearch(because research without search is just ldquorerdquo)

2

Davide EynardReSearch - 20080606

Table of contents

Introduction (ellipsis left by purpose) Conclusions

3

Davide EynardReSearch - 20080606

This seminar is not

ldquoLe risorse elettroniche per la ricercardquo a transversal course for the PhD Students of Politecnico di

Milano This (June 2008) will be the fourth edition Very good material from previous editions is available at

httpwwwbibliopolimiitdocumenti Main topics

bull query languagesbull online libraries journals and ebooksbull tools to create and manage your bibliography bull search engines deep Web open archives advanced browsingbull social publishing (blogs and RSS) and social bookmarkingbull POLIsearchbull using the university proxy to access online resourcesbull notes on copyright issuesbull search techniques (like PICO and SPICE)

4

Davide EynardReSearch - 20080606

So why

Searching (and now in particular being able to effectively search on the Internet) is very important for our research and more generally in our lives

Even if they are interested some students skip the course as it does not give enough credits

If youre interested in these topics ask for a solution (ie increase the credits together with the teaching material)

5

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

6

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

7

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

8

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

BUT I also have something new to tell you I promise

9

Davide EynardReSearch - 20080606

The Web

[httpwwwsearchloresorg]

Search engines cover (at best) frac14 of the web

Different SE may return different results (as they overlap)

Quality of results in terms of precision and recall

See (for instance) here

10

Davide EynardReSearch - 20080606

The Internet

EmailIRC

Usenet

P2P

IMBlogsWikisForumsFile sharingFolksonomies

EmuleBittorrent

The Web vs Not the Web

11

Davide EynardReSearch - 20080606

Search engines

How are search engines used Mostly queries with one or few words

bull (which ones Give a look at zeitgeist) Mostly you look just at the first hits

bull (check here and here)

Main operators are available instead quotes allinanchor inurl filetype intitle related and of course boolean ones

12

Davide EynardReSearch - 20080606

True or false

How true is boolean search (that is how truly boolean) ldquoI want this term or this other and not that onerdquo is fine but dont try to think in sets

semantic AND web

semantic

web

semantic

semantic AND semantic

but it doesnt work like this

13

Davide EynardReSearch - 20080606

Vector Space Model

In the VSM documents are represented as vectors in a multidimensional Euclidean space

The coordinate of document d in axis t is given by dt = TF(dt) IDF(t)

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 2: ReSearch - Searching for Researchers

2

Davide EynardReSearch - 20080606

Table of contents

Introduction (ellipsis left by purpose) Conclusions

3

Davide EynardReSearch - 20080606

This seminar is not

ldquoLe risorse elettroniche per la ricercardquo a transversal course for the PhD Students of Politecnico di

Milano This (June 2008) will be the fourth edition Very good material from previous editions is available at

httpwwwbibliopolimiitdocumenti Main topics

bull query languagesbull online libraries journals and ebooksbull tools to create and manage your bibliography bull search engines deep Web open archives advanced browsingbull social publishing (blogs and RSS) and social bookmarkingbull POLIsearchbull using the university proxy to access online resourcesbull notes on copyright issuesbull search techniques (like PICO and SPICE)

4

Davide EynardReSearch - 20080606

So why

Searching (and now in particular being able to effectively search on the Internet) is very important for our research and more generally in our lives

Even if they are interested some students skip the course as it does not give enough credits

If youre interested in these topics ask for a solution (ie increase the credits together with the teaching material)

5

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

6

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

7

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

8

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

BUT I also have something new to tell you I promise

9

Davide EynardReSearch - 20080606

The Web

[httpwwwsearchloresorg]

Search engines cover (at best) frac14 of the web

Different SE may return different results (as they overlap)

Quality of results in terms of precision and recall

See (for instance) here

10

Davide EynardReSearch - 20080606

The Internet

EmailIRC

Usenet

P2P

IMBlogsWikisForumsFile sharingFolksonomies

EmuleBittorrent

The Web vs Not the Web

11

Davide EynardReSearch - 20080606

Search engines

How are search engines used Mostly queries with one or few words

bull (which ones Give a look at zeitgeist) Mostly you look just at the first hits

bull (check here and here)

Main operators are available instead quotes allinanchor inurl filetype intitle related and of course boolean ones

12

Davide EynardReSearch - 20080606

True or false

How true is boolean search (that is how truly boolean) ldquoI want this term or this other and not that onerdquo is fine but dont try to think in sets

semantic AND web

semantic

web

semantic

semantic AND semantic

but it doesnt work like this

13

Davide EynardReSearch - 20080606

Vector Space Model

In the VSM documents are represented as vectors in a multidimensional Euclidean space

The coordinate of document d in axis t is given by dt = TF(dt) IDF(t)

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 3: ReSearch - Searching for Researchers

3

Davide EynardReSearch - 20080606

This seminar is not

ldquoLe risorse elettroniche per la ricercardquo a transversal course for the PhD Students of Politecnico di

Milano This (June 2008) will be the fourth edition Very good material from previous editions is available at

httpwwwbibliopolimiitdocumenti Main topics

bull query languagesbull online libraries journals and ebooksbull tools to create and manage your bibliography bull search engines deep Web open archives advanced browsingbull social publishing (blogs and RSS) and social bookmarkingbull POLIsearchbull using the university proxy to access online resourcesbull notes on copyright issuesbull search techniques (like PICO and SPICE)

4

Davide EynardReSearch - 20080606

So why

Searching (and now in particular being able to effectively search on the Internet) is very important for our research and more generally in our lives

Even if they are interested some students skip the course as it does not give enough credits

If youre interested in these topics ask for a solution (ie increase the credits together with the teaching material)

5

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

6

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

7

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

8

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

BUT I also have something new to tell you I promise

9

Davide EynardReSearch - 20080606

The Web

[httpwwwsearchloresorg]

Search engines cover (at best) frac14 of the web

Different SE may return different results (as they overlap)

Quality of results in terms of precision and recall

See (for instance) here

10

Davide EynardReSearch - 20080606

The Internet

EmailIRC

Usenet

P2P

IMBlogsWikisForumsFile sharingFolksonomies

EmuleBittorrent

The Web vs Not the Web

11

Davide EynardReSearch - 20080606

Search engines

How are search engines used Mostly queries with one or few words

bull (which ones Give a look at zeitgeist) Mostly you look just at the first hits

bull (check here and here)

Main operators are available instead quotes allinanchor inurl filetype intitle related and of course boolean ones

12

Davide EynardReSearch - 20080606

True or false

How true is boolean search (that is how truly boolean) ldquoI want this term or this other and not that onerdquo is fine but dont try to think in sets

semantic AND web

semantic

web

semantic

semantic AND semantic

but it doesnt work like this

13

Davide EynardReSearch - 20080606

Vector Space Model

In the VSM documents are represented as vectors in a multidimensional Euclidean space

The coordinate of document d in axis t is given by dt = TF(dt) IDF(t)

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 4: ReSearch - Searching for Researchers

4

Davide EynardReSearch - 20080606

So why

Searching (and now in particular being able to effectively search on the Internet) is very important for our research and more generally in our lives

Even if they are interested some students skip the course as it does not give enough credits

If youre interested in these topics ask for a solution (ie increase the credits together with the teaching material)

5

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

6

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

7

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

8

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

BUT I also have something new to tell you I promise

9

Davide EynardReSearch - 20080606

The Web

[httpwwwsearchloresorg]

Search engines cover (at best) frac14 of the web

Different SE may return different results (as they overlap)

Quality of results in terms of precision and recall

See (for instance) here

10

Davide EynardReSearch - 20080606

The Internet

EmailIRC

Usenet

P2P

IMBlogsWikisForumsFile sharingFolksonomies

EmuleBittorrent

The Web vs Not the Web

11

Davide EynardReSearch - 20080606

Search engines

How are search engines used Mostly queries with one or few words

bull (which ones Give a look at zeitgeist) Mostly you look just at the first hits

bull (check here and here)

Main operators are available instead quotes allinanchor inurl filetype intitle related and of course boolean ones

12

Davide EynardReSearch - 20080606

True or false

How true is boolean search (that is how truly boolean) ldquoI want this term or this other and not that onerdquo is fine but dont try to think in sets

semantic AND web

semantic

web

semantic

semantic AND semantic

but it doesnt work like this

13

Davide EynardReSearch - 20080606

Vector Space Model

In the VSM documents are represented as vectors in a multidimensional Euclidean space

The coordinate of document d in axis t is given by dt = TF(dt) IDF(t)

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 5: ReSearch - Searching for Researchers

5

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

6

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

7

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

8

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

BUT I also have something new to tell you I promise

9

Davide EynardReSearch - 20080606

The Web

[httpwwwsearchloresorg]

Search engines cover (at best) frac14 of the web

Different SE may return different results (as they overlap)

Quality of results in terms of precision and recall

See (for instance) here

10

Davide EynardReSearch - 20080606

The Internet

EmailIRC

Usenet

P2P

IMBlogsWikisForumsFile sharingFolksonomies

EmuleBittorrent

The Web vs Not the Web

11

Davide EynardReSearch - 20080606

Search engines

How are search engines used Mostly queries with one or few words

bull (which ones Give a look at zeitgeist) Mostly you look just at the first hits

bull (check here and here)

Main operators are available instead quotes allinanchor inurl filetype intitle related and of course boolean ones

12

Davide EynardReSearch - 20080606

True or false

How true is boolean search (that is how truly boolean) ldquoI want this term or this other and not that onerdquo is fine but dont try to think in sets

semantic AND web

semantic

web

semantic

semantic AND semantic

but it doesnt work like this

13

Davide EynardReSearch - 20080606

Vector Space Model

In the VSM documents are represented as vectors in a multidimensional Euclidean space

The coordinate of document d in axis t is given by dt = TF(dt) IDF(t)

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 6: ReSearch - Searching for Researchers

6

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

7

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

8

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

BUT I also have something new to tell you I promise

9

Davide EynardReSearch - 20080606

The Web

[httpwwwsearchloresorg]

Search engines cover (at best) frac14 of the web

Different SE may return different results (as they overlap)

Quality of results in terms of precision and recall

See (for instance) here

10

Davide EynardReSearch - 20080606

The Internet

EmailIRC

Usenet

P2P

IMBlogsWikisForumsFile sharingFolksonomies

EmuleBittorrent

The Web vs Not the Web

11

Davide EynardReSearch - 20080606

Search engines

How are search engines used Mostly queries with one or few words

bull (which ones Give a look at zeitgeist) Mostly you look just at the first hits

bull (check here and here)

Main operators are available instead quotes allinanchor inurl filetype intitle related and of course boolean ones

12

Davide EynardReSearch - 20080606

True or false

How true is boolean search (that is how truly boolean) ldquoI want this term or this other and not that onerdquo is fine but dont try to think in sets

semantic AND web

semantic

web

semantic

semantic AND semantic

but it doesnt work like this

13

Davide EynardReSearch - 20080606

Vector Space Model

In the VSM documents are represented as vectors in a multidimensional Euclidean space

The coordinate of document d in axis t is given by dt = TF(dt) IDF(t)

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 7: ReSearch - Searching for Researchers

7

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

8

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

BUT I also have something new to tell you I promise

9

Davide EynardReSearch - 20080606

The Web

[httpwwwsearchloresorg]

Search engines cover (at best) frac14 of the web

Different SE may return different results (as they overlap)

Quality of results in terms of precision and recall

See (for instance) here

10

Davide EynardReSearch - 20080606

The Internet

EmailIRC

Usenet

P2P

IMBlogsWikisForumsFile sharingFolksonomies

EmuleBittorrent

The Web vs Not the Web

11

Davide EynardReSearch - 20080606

Search engines

How are search engines used Mostly queries with one or few words

bull (which ones Give a look at zeitgeist) Mostly you look just at the first hits

bull (check here and here)

Main operators are available instead quotes allinanchor inurl filetype intitle related and of course boolean ones

12

Davide EynardReSearch - 20080606

True or false

How true is boolean search (that is how truly boolean) ldquoI want this term or this other and not that onerdquo is fine but dont try to think in sets

semantic AND web

semantic

web

semantic

semantic AND semantic

but it doesnt work like this

13

Davide EynardReSearch - 20080606

Vector Space Model

In the VSM documents are represented as vectors in a multidimensional Euclidean space

The coordinate of document d in axis t is given by dt = TF(dt) IDF(t)

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 8: ReSearch - Searching for Researchers

8

Davide EynardReSearch - 20080606

So what

What is the real purpose of this lecture thenWhat are the contentsIs this a short version of the PhD course

NAH

Theres so much material about search that we could prepareten complementary PhD courses

Moreover I already had some material I wanted to recycle httpsearchloresorg ndash a precious source for seekers PowerBrowsing ndash an old project of mine

BUT I also have something new to tell you I promise

9

Davide EynardReSearch - 20080606

The Web

[httpwwwsearchloresorg]

Search engines cover (at best) frac14 of the web

Different SE may return different results (as they overlap)

Quality of results in terms of precision and recall

See (for instance) here

10

Davide EynardReSearch - 20080606

The Internet

EmailIRC

Usenet

P2P

IMBlogsWikisForumsFile sharingFolksonomies

EmuleBittorrent

The Web vs Not the Web

11

Davide EynardReSearch - 20080606

Search engines

How are search engines used Mostly queries with one or few words

bull (which ones Give a look at zeitgeist) Mostly you look just at the first hits

bull (check here and here)

Main operators are available instead quotes allinanchor inurl filetype intitle related and of course boolean ones

12

Davide EynardReSearch - 20080606

True or false

How true is boolean search (that is how truly boolean) ldquoI want this term or this other and not that onerdquo is fine but dont try to think in sets

semantic AND web

semantic

web

semantic

semantic AND semantic

but it doesnt work like this

13

Davide EynardReSearch - 20080606

Vector Space Model

In the VSM documents are represented as vectors in a multidimensional Euclidean space

The coordinate of document d in axis t is given by dt = TF(dt) IDF(t)

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 9: ReSearch - Searching for Researchers

9

Davide EynardReSearch - 20080606

The Web

[httpwwwsearchloresorg]

Search engines cover (at best) frac14 of the web

Different SE may return different results (as they overlap)

Quality of results in terms of precision and recall

See (for instance) here

10

Davide EynardReSearch - 20080606

The Internet

EmailIRC

Usenet

P2P

IMBlogsWikisForumsFile sharingFolksonomies

EmuleBittorrent

The Web vs Not the Web

11

Davide EynardReSearch - 20080606

Search engines

How are search engines used Mostly queries with one or few words

bull (which ones Give a look at zeitgeist) Mostly you look just at the first hits

bull (check here and here)

Main operators are available instead quotes allinanchor inurl filetype intitle related and of course boolean ones

12

Davide EynardReSearch - 20080606

True or false

How true is boolean search (that is how truly boolean) ldquoI want this term or this other and not that onerdquo is fine but dont try to think in sets

semantic AND web

semantic

web

semantic

semantic AND semantic

but it doesnt work like this

13

Davide EynardReSearch - 20080606

Vector Space Model

In the VSM documents are represented as vectors in a multidimensional Euclidean space

The coordinate of document d in axis t is given by dt = TF(dt) IDF(t)

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 10: ReSearch - Searching for Researchers

10

Davide EynardReSearch - 20080606

The Internet

EmailIRC

Usenet

P2P

IMBlogsWikisForumsFile sharingFolksonomies

EmuleBittorrent

The Web vs Not the Web

11

Davide EynardReSearch - 20080606

Search engines

How are search engines used Mostly queries with one or few words

bull (which ones Give a look at zeitgeist) Mostly you look just at the first hits

bull (check here and here)

Main operators are available instead quotes allinanchor inurl filetype intitle related and of course boolean ones

12

Davide EynardReSearch - 20080606

True or false

How true is boolean search (that is how truly boolean) ldquoI want this term or this other and not that onerdquo is fine but dont try to think in sets

semantic AND web

semantic

web

semantic

semantic AND semantic

but it doesnt work like this

13

Davide EynardReSearch - 20080606

Vector Space Model

In the VSM documents are represented as vectors in a multidimensional Euclidean space

The coordinate of document d in axis t is given by dt = TF(dt) IDF(t)

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 11: ReSearch - Searching for Researchers

11

Davide EynardReSearch - 20080606

Search engines

How are search engines used Mostly queries with one or few words

bull (which ones Give a look at zeitgeist) Mostly you look just at the first hits

bull (check here and here)

Main operators are available instead quotes allinanchor inurl filetype intitle related and of course boolean ones

12

Davide EynardReSearch - 20080606

True or false

How true is boolean search (that is how truly boolean) ldquoI want this term or this other and not that onerdquo is fine but dont try to think in sets

semantic AND web

semantic

web

semantic

semantic AND semantic

but it doesnt work like this

13

Davide EynardReSearch - 20080606

Vector Space Model

In the VSM documents are represented as vectors in a multidimensional Euclidean space

The coordinate of document d in axis t is given by dt = TF(dt) IDF(t)

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 12: ReSearch - Searching for Researchers

12

Davide EynardReSearch - 20080606

True or false

How true is boolean search (that is how truly boolean) ldquoI want this term or this other and not that onerdquo is fine but dont try to think in sets

semantic AND web

semantic

web

semantic

semantic AND semantic

but it doesnt work like this

13

Davide EynardReSearch - 20080606

Vector Space Model

In the VSM documents are represented as vectors in a multidimensional Euclidean space

The coordinate of document d in axis t is given by dt = TF(dt) IDF(t)

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 13: ReSearch - Searching for Researchers

13

Davide EynardReSearch - 20080606

Vector Space Model

In the VSM documents are represented as vectors in a multidimensional Euclidean space

The coordinate of document d in axis t is given by dt = TF(dt) IDF(t)

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 14: ReSearch - Searching for Researchers

14

Davide EynardReSearch - 20080606

The epanaleptical approach

Some search engines are based on models that are much more similarto the VSM than to sets+boolean

Epanaleptical approach just repeat the word many times if its more that one word surround them with quotes

Examples (nice academic drawbacks) semantic web semantic web + collaborative systems slam performance evaluation

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 15: ReSearch - Searching for Researchers

15

Davide EynardReSearch - 20080606

To google or not to google

Use google to find anything ldquolocalrdquo searches can be run from google too try it with blogs forums wikis etc

bull phpbb trickbull mediawiki trick

Use alternative search engines search for relatedwwwgooglecom

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 16: ReSearch - Searching for Researchers

16

Davide EynardReSearch - 20080606

Search techniques

Word search (+ suffixes) Webbits (here and here)

bull (and the ldquoindex ofrdquo trick) Concept related search and specific search engines Arrows using communities of practice to enhance search

bull What are diy gtd seo slam etc Foster serendipity

bull check upper dirsbull follow linksbull look at the status bar

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 17: ReSearch - Searching for Researchers

17

Davide EynardReSearch - 20080606

Exploit collaboration

BlogsNews Ok I suppose you all know about RSS feeds

bull You can recognize thembull You can mash them upbull You can use them for other media

but how can you find interesting onesbull AideRSS techniquebull and a tutorial that explains you how to use it

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 18: ReSearch - Searching for Researchers

18

Davide EynardReSearch - 20080606

Exploit collaboration

Folksonomies delicious magnolia

Bibliography sharing bibsonomy CiteULike

Social networksgroups Ever searched for Facebook groups

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 19: ReSearch - Searching for Researchers

19

Davide EynardReSearch - 20080606

DIY

AKA Do It Yourself AKA means Also Known As

bull Also means well just jokin

In this case it means use a personal custom approach using readymade tools or creating new ones

How can you do it Know thy enemy

bull WWW HTTP HTML (see powerbrowsing)bull Human patternsbull PC patterns

Build models Exploit tools or regularities in contents

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 20: ReSearch - Searching for Researchers

20

Davide EynardReSearch - 20080606

Web Technologies

There are some things you should know to make a well-behavingbot

bull HTTP GET and POST Referer UserAgent Cookie Proxy

bull HTML Form Dynamically generated code

Give a look at this tutorial And to some DEI examples

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 21: ReSearch - Searching for Researchers

21

Davide EynardReSearch - 20080606

Tools and examples

Web toolsbull Program Committee Searcherbull Changedetectionbull Wayback machinebull Mashup toolsbull SpeakinAbout

Client toolsbull user agent switcherbull spidersscrapersbull custom made tools -)bull Firefox search plugins

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 22: ReSearch - Searching for Researchers

22

Davide EynardReSearch - 20080606

To conclude did you know

that we have people working on very interesting stuff about searching libraries and documents here (and in the real world about 100m from us)

that here you can find all the info you need to set up the university proxy so you can access restricted document libraries from anywhere

that on the OPAC you can find recent doctoral theses ready to read in pdf format

and that you have a lot of polimi-related news here

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 23: ReSearch - Searching for Researchers

23

Davide EynardReSearch - 20080606

Thats all folks

Thank you

Questions

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24
Page 24: ReSearch - Searching for Researchers

24

Davide EynardReSearch - 20080606

Contact Davide Eynard

eynardeletpolimiit

httpwwwdeipolimiitpeopleeynard

Tel 02 2399 4010

Fax 02 2399 3411

Back

  • Pagina 1
  • Pagina 2
  • Pagina 3
  • Pagina 4
  • Pagina 5
  • Pagina 6
  • Pagina 7
  • Pagina 8
  • Pagina 9
  • Pagina 10
  • Pagina 11
  • Pagina 12
  • Pagina 13
  • Pagina 14
  • Pagina 15
  • Pagina 16
  • Pagina 17
  • Pagina 18
  • Pagina 19
  • Pagina 20
  • Pagina 21
  • Pagina 22
  • Pagina 23
  • Pagina 24