basics of search engines and algorithms

32
Search Engine Optimization How Search Engine Works ? Presented by Mohammed Azharuddin

Upload: web-trainings-academy

Post on 06-May-2015

806 views

Category:

Documents


0 download

DESCRIPTION

Web Trainings Academy presents the Part 1 of the SEO Training Series. Learn about the concepts of Search Engines, Architecture, Serp and Search Algorithm Updates. Presented by Mohammed Azharuddin

TRANSCRIPT

Page 1: Basics of Search Engines and Algorithms

Search Engine Optimization

How Search Engine Works ?

Presented by Mohammed Azharuddin

Page 2: Basics of Search Engines and Algorithms

Contact Info

• Facebook: Md Azharuddin Barkati • Twitter : mdazhar01 • Gmail : [email protected]

Page 3: Basics of Search Engines and Algorithms

History of Search • 1990 – Archi Query Form – FTP based file search engine

• Feb 1993 – Excite.com– General word relation based search

• Oct 1993 – AliWeb– Manual submission engine

• Jan 1994 – Altavista– First natural language search engine

Page 4: Basics of Search Engines and Algorithms

• Jan 1996 – Backrup– Started by Larry Page and Segrey Brin

• Sep 15 1997 – Google.com– First search engine with Page Rank Technology

• 1997 – Yandex.com – Russian based search engine

• 1998 – MSN Search– Microsoft Rival to Google

Page 5: Basics of Search Engines and Algorithms

• 2000 – Baidu.com– Chinese based search engine

• 2008 – duckduckgo.com– Non tracking search engine

• 2009 – Bing.com – Microsoft Rival to Google

• 2010 – Blekko.com– Spam and Virus free search

http://www.searchenginehistory.com/http://www.google.co.in/about/company/history/

http://www.wordstream.com/articles/internet-search-engines-history

Page 6: Basics of Search Engines and Algorithms

The Google Story

Page 7: Basics of Search Engines and Algorithms

Search Engine Architecture

• Every search engine is based on following

–Crawling– Indexing–Algorithms–Results – Fight Spam

Page 8: Basics of Search Engines and Algorithms

Google Architecture

http://infolab.stanford.edu/~backrub/google.html

Page 9: Basics of Search Engines and Algorithms

Search Engine Architecture

CrawlerStore

Indexer

100 Million GBindexes indexes

Search Interface

Algorithms(Programs)

Query

Results

trash

trash

trash

Sorted based on Content / Factors

WWW

60 Trillion PagesOr

60 Lakh CroreLive Google Example

Page 10: Basics of Search Engines and Algorithms

Algorithms

• Programs and Formulas to get relevant results– Page Rank – Spelling Check – Synonym check – Auto complete– Query Understanding – Safe Search – User Context

Page 11: Basics of Search Engines and Algorithms

Page Rank Algorithm

• Google's first algorithms, which looks at links between pages to determine their relevance.

• PR is a number generated for each page available in Google Index

• PR Toolbar Range – NA to 10 (Best Rank) : This is based on Log Scale of

0 – 10 • Real Page rank is calculated based on number of

pages in index, which can be 0.15 to Trillions

Page 12: Basics of Search Engines and Algorithms

Toolbar Vs. Real PRToolbar Real PR

0 0 - 10

1 100 - 1,000

2 1,000 – 10,000

3 10,000 – 100000

4 100000 – 1000000

5 1000000 - 10000000

http://www.webworkshop.net/pagerank_calculator.php3

Page 13: Basics of Search Engines and Algorithms

PR Formula

Updated Formula

Old Formula

D = Damping Factor ; PR(N) = PR of Linking Site ; L(N) : No of Outbound Links

Page 14: Basics of Search Engines and Algorithms

Example

http://en.wikipedia.org/wiki/PageRankhttp://www.cs.princeton.edu/~chazelle/courses/BIB/pagerank.htm

Page 15: Basics of Search Engines and Algorithms

Fighting Spam • Spam refers to websites which uses un ethical

practices for Search Rankings• To fight the spam Google release updates

frequently called as “Algorithm Updates” • Google changes its search algorithm around

500 – 600 times every year. • Some of them are major and few are minor

updates

Page 16: Basics of Search Engines and Algorithms

Major Updates

Page 17: Basics of Search Engines and Algorithms
Page 18: Basics of Search Engines and Algorithms

• Panda Update - February 23, 2011

– This algorithm target the sites with thin content, content farms, duplicate content, sites with high ad-to-content ratios, and a number of other quality issues.

– Affected 12% queries on launch – Recent update : Panda 4 – May 19 2013

Page 19: Basics of Search Engines and Algorithms
Page 20: Basics of Search Engines and Algorithms

• Penguin Update – April 24, 2012

– This algorithm target the sites which over optimize the websites, uses excessive links.

– Affected 3% queries on launch

– Recent update : Pengiun 2.1 – Oct 4 2014

Page 21: Basics of Search Engines and Algorithms
Page 22: Basics of Search Engines and Algorithms

Humming Bird Update – August 2013

• This algorithm understands the context of the query by analyzing the words in query

• It can automatically rewrite the query internally based on certain words like “Near”, Vs, How to, Where, Who is …. Etc

• Many queries are provided as “ONE BOX ANSWERS” to give the quick answers.

Page 23: Basics of Search Engines and Algorithms

How it Works ?

User Query Query Translator

Modified Query

Index

Page 24: Basics of Search Engines and Algorithms

One Box Answers Queries

• When is Independence of India • Time in India or Time in Toronto • 1$ to INR • 1Mile to Kms • Banana Vs. Apple • Who is wife of Bill Gates • What is my IP • who invented www• Show me pictures of taj mahal

Page 25: Basics of Search Engines and Algorithms

Search Engine Results Page(SERP)

Page 26: Basics of Search Engines and Algorithms
Page 27: Basics of Search Engines and Algorithms

Types of Results

Paid Results

PPC Ads

Comparison Ads

Shopping Ads

Non Paid Results

Organic Web

News Results

Image Results

Local Results

Video Results

Site Links

Schema Data

Page 28: Basics of Search Engines and Algorithms

Click Through Rate (CTR)

• CTR is a measure to understand how many users are clicking on the site from SERP

• CTR helps to understand the user response

• The top four positions “above the fold” for many desktop users, receive 83% of first page organic clicks.

CTR = (No of Clicks/No of Impressions)x100

Page 29: Basics of Search Engines and Algorithms

2011

Page 30: Basics of Search Engines and Algorithms

2012 CTR Results

Page 31: Basics of Search Engines and Algorithms

Branded Vs. Un Branded

Page 32: Basics of Search Engines and Algorithms

Thank you

Give us your feedback